cindygross.wordpress.com
Hadoop Likes Big Files
One of the frequently overlooked yet essential best practices for Hadoop is to prefer fewer, bigger files over more, smaller files. How small is too small and how many is too many? How do you stitc…