What does the Hadoop administrator have to do after adding new datanodes to the Hadoop cluster? | Hadoop admin questions

Since the new nodes will not have any data on them, the administrator needs to start the balancer to redistribute data evenly between all nodes.
Hadoop cluster will detect new datanodes automatically. However, in order to optimize the cluster performance it is recommended to start rebalancer to redistribute the data between datanodes evenly.

No comments:

Post a Comment