HDFS copyFromLocal v/s put Command

“What’s the difference between copyFromLocal and Put command in HDFS CLI?” A very common interview question, isn’t it? Let’s try to figure out the notable difference between Put and copyFromLocal. Both commands have only one objective i.e. to load data in HDFS. Let’s demonstrate the functionality now. Variation 1: Loading data from local file system and storing the same […]

Read More

HDFS Storage Balancer Part 1

In this tutorial, we shall learn how to use HDFS Storage Balancer effectively. We will also effectively understand all possible permutations and combinations that can be applied in the Hadoop-Balancer command. HDFS allows us to store data using ‘Write Once’ paradigm where only appends are allowed. In production, there exists a scenario, where there might […]

Read More