Tutorial: Hadoop: Common Terminologies

Neeraj Sharma

Neeraj Sharma

@neeraj-iAaNcG Oct 21, 2024
For other tutorials and index of tutorials in this series click here : <a href="https://www.crazyengineers.com/threads/tutorial-hadoop-main-thread.70394">Tutorial : Hadoop -- Main Thread</a>

Hello guys. Once you have an idea of what Hadoop and Big Data really are, lets have a look at the next important thing and that is to understand some of the terminologies that will be used in future tutorials.

Node :

A node is just a computer system typically a non enterprise hardware capable of storing data. Please note that it is a combination of nodes that lets Hadoop run with each node performing several functionality which I will go through in later tutorials. Diagrammatically a node is just another computer system as below


3

Rack :

We can add several nodes one after another and this collection of nodes is termed as a Rack. A rack is typically a collection of 30 to 40 nodes physically aligned close to each other and connected via the same network switch.

4
Important note: Network bandwidth any two nodes on a rack is greater than network bandwidth between two nodes on different racks.

Cluster:

A Hadoop Cluster or a Cluster is a collection of Racks.

5

So, these were the basic terminologies that must be understood in order to proceed further Do drop in comments in case you have any doubts.

Check in the Main thread here for links to other tutorials : <a href="https://www.crazyengineers.com/threads/tutorial-hadoop-main-thread.70394">Tutorial : Hadoop -- Main Thread</a>

Replies

Welcome, guest

Join CrazyEngineers to reply, ask questions, and participate in conversations.

CrazyEngineers powered by Jatra Community Platform

  • CodeReaper

    CodeReaper

    @codereaper-dYaglQ Nov 18, 2014

    Small yet good piece of information