
What are the most common Issues faced in Apache Spark in Production | Most Asked Interview Question
In this video, I have explained the Most frequently asked Interview Question for Data Engineer role for all Experienced folks to know about production job handlings
what are the most common Issues faced in Apache Spark in Production env
Most Asked Interview Question
1. what is OOM ? How to find the Issue and How do you fix in Production system?
2. how do you Increase the resources for failed job with OOM error?
3. Job is taking long time in production, how do you identify the Issue and how do you fix in real time ?
4. what is task or stage failed job and how do you fix that errors?
5. what is data skew and data spill errors ? how do you find and fix in production environment ?
6. what is GC overhead error and how do you fix it?
7. what is CoarseGrainedExecutorBackend and how to find and fix it ?
8. what is Class not found exception and How to fix it?
9. what is File Not Found Exception and How to handle it?
Time lines:
00:36 - Out of Memory Issue
06:06 - Solutions for Out of Memory Issues
09:35 - Executor lost error
11:28 - Debugging the long running Spark job
12:30 - Class Not Found Exception
13:45 - FileNotFound Exception
14:33 - Stage failed or Tasks failures reaches max attempts
15:21 - GC Issues
16:24 - CoarseGrainedExecutorBackend: RECEIVED SIGNAL TERM
Please watch my other video on how to solve the data skew and data spills in spark job without changing the code
• How to remove Data skew and Data spil...
#sparkinterviewquestions #realtime #dataengineer #productionjobs
コメント