RACoop - Rapid Ansible Clustering with Hadoop

Set up an incredible Hadoop cluster in 10 minutes powered by Ansible

Abstract

This guide is intended to easy install a physical (or virtual) cluster running Nagios, Hadoop and HBase, everything automated by using Ansible.
The scenario (depicted in the figure below) will be composed of:
- 1 master node with 1 public interface and 1 private interface.
- 1 or more slave nodes, each one with a private Ethernet interface connected to the master node.

Configuration of different nodes is divided into two Ansible stages:
- provision, that is executed just the first time a node is configured.
- atboot, that is executed each time a node is rebooted.
Additionally several administration ansible playbooks are provided, found in the utils folder to shutdown, start, restart and test the cluster.

Installation guide

Master node configuration
Slave node configuration
Slave sign-up in Master
Warnings and Troubleshooting

Master node configuration

1. Install Ubuntu (including graphical mode) over RAID 1

(If you decide to install a different Ubuntu distribution, just go to next step.)

After booting up the live-cd and (if necessary) configuring network access, open up a terminal and assume root access sudo -s:

apt-get install mdadm

We assume two identical hard disks (/dev/sd[ab]), which will be used to create only one mdadm-volume /dev/md0 which will then be partitioned for /, swap and data storage, e.g. /home. For each of the disks, we format and enable the RAID option, e.g., by using GParted.

Now we create the mdadm volume:

sudo mdadm --create /dev/md0 --bitmap=internal --level=1 -n 2 /dev/sd[ab]1

After that we can begin the Installation. Make sure to start the installer from the terminal with the -b option, because installing the bootloader will fail anyway:

ubiquity -b

Make sure to go for manual partitioning and "use" the /dev/md0 device just created for /. After the installation the system is not yet bootable, so do not restart the box right away. We need to chroot into the installed system and fixup some stuff:

sudo -s
mount /dev/md0p1 /mnt
mount -o bind /dev /mnt/dev
mount -o bind /dev/pts /mnt/dev/pts
mount -o bind /sys /mnt/sys
mount -o bind /proc /mnt/proc
cat /etc/resolv.conf >> /mnt/etc/resolv.conf
chroot /mnt
apt-get install mdadm
nano /etc/grub.d/10_linux  # change quick_boot to 0
grub-install /dev/sda
grub-install /dev/sdb
update-grub
exit

Remove the intallation CD/USB and Reboot. Now the system should start.

2. Enable root user

sudo passwd root

Please, don't log as root user after doing this.

3. Clone this Git repo

sudo apt-get install git -y
git clone https://github.com/catedrasaes-umu/quick_cluster.git

4. Do `preprovision`

4.1. Locate this script: $/provision/master/prepovision_master.sh

4.2. Execute it as sudo:

sudo ./prepovision_master.sh <host_name> 
(e.g., use cluster0 as host name)

4.3. Generate public key:

ssh-keygen -t rsa
(Press `Return` 3 times)

4.4. Configure vars files. Before the Ansible provision can start some variables must be defined. These variables may be found on the provision/ansible/vars folder and they are:

In vars_common.yml:

user: The user name to be used in the cluster. It will be the same for the master and the slave nodes. master_hostname: The hostname for the master node. It is suggested to be named as "cluster0", so the slave nodes may be named as "cluster1..X". master_ip: The ip used by the master to connect to the slaves.
In vars_master.yml:

mac_eth0: The physical MAC address of the interface to be renamed to eth0. mac_eth1: The physical MAC address of the interface to be renamed to eth1. modprobe_eth0: The module name of the interface to be renamed to eth0. Obtainable with the command dmesg | grep 'interfaceName'. modprobe_eth1: The module name of the interface to be renamed to eth1. Obtainable with the command dmesg | grep 'interfaceName'. cluster_repository: The path to the cluster git repository. nagios_admin_user: The admin user to be used in the Nagios installation. nagios_admin_pass: The admin user password to be used in the Nagios installation. nagios_admin_email: An admin email to be notified when an alert comes up in Nagios.

5. Do Ansible `provision`

5.1. Launch the ansible playbook located at /tmp/provision/ansible/:

cd ~/provision/ansible/
ansible-playbook playbook_master.yml -K

5.2. When finished, reboot.

Slave node configuration

1. Install Ubuntu Server over RAID 1

(If you decide to install a different Ubuntu distribution, or avoid to mount RAID 1 devices, just go to next step.)

Same as in the Master node case, we assume two identical hard disks (/dev/sd[ab]) which will be used completely for our new install. Follow the instructions of the Ubuntu installer to create RAID 1 partition.

2. Do `preprovision`

2.1. Copy or download the $/provision/slave/prepovision_slave.sh script.

2.2. Execute it as sudo:

sudo ./prepovision_slave.sh <host_name> 
(e.g., use cluster1..X as hostname)

2.3. Once it finishes, there should be a /tmp/provision/ folder. Check the vars folder just to make sure everything is ok in provision/ansible/vars

2.4. Execute the slave playbook inside the /tmp/provision folder.

cd /tmp/provision/ansible/
ansible-playbook playbook_slave.yml -K

2.5. When finished, check if there is script in /etc/init.d/get_init_slave.sh.

2.6. Reboot the slave machine.

Slave sign-up in Master

The signup process must be executed in the cluster master when a new slave is added to the cluster. It is responsible of recreating files in master such as /etc/hosts and then push them to each slave. This process must be started once the new slave added is rebooted.

1. Add new slave to hostlist.yml file

1.1. Open the file $/provision/master/hostlist.yml.

1.2. Make sure the existing entries are correct.

1.3. Create a new entry for the added slave and tag it with the role slave.

2. Do Ansible `signup` process

2.1. In the master machine execute the signup playbook in the provision/ansible folder:

ansible-playbook playbook-signup.yml -K

2.2. The last part of the signup process will restart Nagios, Hadoop and HBase.

Warnings and troubleshooting

There seems to be some problems in Ansible v2.1 related to the unarchive module as can be seen here bug issue. If unarchive fails by some reason, please check your unarchive.py module (located usually in /usr/lib/python2.7/dist-packages/ansible/modules/core/files/unarchive.py) and make sure to replace the line 532 rc, out, err = self.module.run_command(cmd) with rc, out, err = self.module.run_command(cmd, environ_update={'LC_ALL': 'C'}).

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
atboot		atboot
provision		provision
utils		utils
README.md		README.md
cluster_diag.png		cluster_diag.png
cluster_diag.svg		cluster_diag.svg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RACoop - Rapid Ansible Clustering with Hadoop

Set up an incredible Hadoop cluster in 10 minutes powered by Ansible

Abstract

Installation guide

Master node configuration

1. Install Ubuntu (including graphical mode) over RAID 1

2. Enable root user

3. Clone this Git repo

4. Do `preprovision`

5. Do Ansible `provision`

Slave node configuration

1. Install Ubuntu Server over RAID 1

2. Do `preprovision`

Slave sign-up in Master

1. Add new slave to hostlist.yml file

2. Do Ansible `signup` process

Warnings and troubleshooting

About

Releases

Packages

Contributors 2

Languages

catedrasaes-umu/racoop_quick_clustering

Folders and files

Latest commit

History

Repository files navigation

RACoop - Rapid Ansible Clustering with Hadoop

Set up an incredible Hadoop cluster in 10 minutes powered by Ansible

Abstract

Installation guide

Master node configuration

1. Install Ubuntu (including graphical mode) over RAID 1

2. Enable root user

3. Clone this Git repo

4. Do preprovision

5. Do Ansible provision

Slave node configuration

1. Install Ubuntu Server over RAID 1

2. Do preprovision

Slave sign-up in Master

1. Add new slave to hostlist.yml file

2. Do Ansible signup process

Warnings and troubleshooting

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

4. Do `preprovision`

5. Do Ansible `provision`

2. Do `preprovision`

2. Do Ansible `signup` process

Packages