Parallel Data Sorting and Deduplication in Distributed File Systems