mirror of
https://github.com/apache/druid.git
synced 2025-02-07 18:48:33 +00:00
de0a7b27e7
CombineTextInputFormat instead of TextInputFormat combines multiple splits for a single mapper and reduces the strain on hadoop platform. It greatly improves job completion time as there are fewer number of mappers to bookkeep.