Sorting a huge text file using hadoop -



Sorting a huge text file using hadoop -

is possible sort huge text file lexicographically using mapreduce job has map tasks , 0 cut down tasks?

the records of text file separated new line character , size of file around 1 terra byte.

it great if 1 can suggest way accomplish sorting on huge file.

used treeset in map method hold entire info in input split , persisted it. got sorted file!

sorting hadoop mapreduce cloudera

Comments

Popular posts from this blog

web services - java.lang.NoClassDefFoundError: Could not initialize class net.sf.cglib.proxy.Enhancer -

Accessing MATLAB's unicode strings from C -

javascript - mongodb won't find my schema method in nested container -