Map Reduce jobs take too long. What can be done to improve the performance of the cluster? | Hadoop admin questions

One the most common reasons for performance problems on Hadoop cluster is uneven distribution of the tasks. The number tasks has to match the number of available slots on the cluster
Hadoop is not a hardware aware system. It is the responsibility of the developers and the administrators to make sure that the resource supply and demand match.

No comments:

Post a Comment