Provisioning Hadoop Cluster with Ansible

Provisioning Hadoop Cluster with Ansible

Hi There! Today we are going to set up and configure Hadoop Cluster using Ansible.

What is Ansible?

Ansible is an open-source automation tool, or platform, used for IT tasks such as configuration management, application deployment, intraservice orchestration, and provisioning.

Ansible automates and simplifies repetitive, complex, and tedious operations. Everybody likes it because it brings huge time savings when we install packages or configure large numbers of servers. Its architecture is simple and effective. It works by connecting to your nodes and pushing small programs to them.

Let's Start

We have already set up our configuration file and inventory. You can get some idea on how to set up a basic inventory here

Now that we have that out of the way let's move to the main part. Now we write a playbook to configure the name node.

Screenshot (1016).png

You can see the full playbook here

Now let's run the playbook and check the results. We can run the playbook using

ansible-playbook <path_to_playbook>

VirtualBox_Ansible Controller Redhat_20_12_2020_16_11_18.png

VirtualBox_Ansible Controller Redhat_20_12_2020_16_11_35.png

Let's check the master node.

VirtualBox_Ansible CLI 2_20_12_2020_16_12_41.png

VirtualBox_Ansible CLI 2_20_12_2020_19_02_50.png

Now let's configure a data node the same way.

Screenshot (1017).png

You can see the whole playbook here

Now let's run the playbook and see the results.

VirtualBox_Ansible Controller Redhat_20_12_2020_19_17_02.png

Here you can see the data node has been connected to the master successfully.

VirtualBox_Ansible CLI 2_20_12_2020_19_23_08.png

Did you find this article valuable?

Support Written Wisdom by becoming a sponsor. Any amount is appreciated!