What does a split do? | Hadoop Questions

Before transferring the data from hard disk location to map method, there is a phase or method called  the ‘Split Method‘. Split method pulls a block of data from HDFS to the framework. The Split class does not write anything, but reads data from the block and pass it to the mapper. Be default, Split is taken care by the framework.
Split method is equal to the block size and is used to divide block into bunch of splits.

No comments:

Post a Comment