Hi There! Today we are going to set up and configure Hadoop Cluster using Ansible.
What is Ansible?
Ansible is an open-source automation tool, or platform, used for IT tasks such as configuration management, application deployment, intraservice orchestration, and provisioning.
Ansible automates and simplifies repetitive, complex, and tedious operations. Everybody likes it because it brings huge time savings when we install packages or configure large numbers of servers. Its architecture is simple and effective. It works by connecting to your nodes and pushing small programs to them.
Let's Start
We have already set up our configuration file and inventory. You can get some idea on how to set up a basic inventory here
Now that we have that out of the way let's move to the main part. Now we write a playbook to configure the name node.
You can see the full playbook here
Now let's run the playbook and check the results. We can run the playbook using
ansible-playbook <path_to_playbook>
Let's check the master node.
Now let's configure a data node the same way.
You can see the whole playbook here
Now let's run the playbook and see the results.
Here you can see the data node has been connected to the master successfully.