In this blog, we will see how to archive/delete a file in HDFS if it is n days older. We can use this to check for any number of days. For example, let us say that we need to monitor an HDFS folder and delete the files when they become 7 days older.
In this previous article, we saw how to install Apache Hive in the Ubuntu machine. Both of these articles are written with an assumption that you have already installed the Hadoop framework in the machine. If not, please visit this post and install the Hadoop framework first. Pig is another component of the Hadoop ecosystem… Read More »
In realtime, Hadoop will be installed into a network of machines to form a cluster. Here, in this article, we will see the installation of Hadoop step by step in a single Ubuntu system. The post is written with an assumption that you already know what is a Name node, Data node, HDFS, etc and… Read More »
What is HDFS? This is the common question that everyone will encounter when they start learning about Hadoop. HDFS deals with the way data is stored and managed by Hadoop Framework. What is a Distributed File System? A distributed file system deals with managing data(files and folder) across multiple nodes or computers. It serves the… Read More »
While reading this article, Hadoop has already reached 3.0 version. However, it is important to know the history and how it evolved in the past. This will help people who are working in the migration projects from Hadoop 1.0 to Hadoop 2.0. It will also help the developers to understand and consider future use cases… Read More »