Asterisk Cluster
Asterisk Cluster
Asterisk Cluster
3 1 of 16
Asterisk-Cluster based on
Trixbox 1.2.3
1. Scenario
This tutorial will help you to build a cluster with two Asterisk server, two Redfone FoneBridges with one PRI (E1) each.
The idea behind this cluster is if one of the server’s and one of the FoneBridge’s fails the user is still able to conduct
phone calls.
The fail-over will happen automatically within a couple of seconds. The Asterisk servers are in an active-passive cluster
relationship.
• Server 1 = asteriskmaster.local
• Server 2 = asteriskslave.local
Everything written on the Master or Slave server will be immediately replicated to the other machine. Therefore if you
cerate a extension on the master the extension will be automatically “created” on the slave, and so on…
For this synchronization we will use DRBD, which will help us to build a network RAID 1.
To monitor our servers if there are up and running we will use Heartbeat, a Linux tool to build high availability systems.
In our scenario each of the Asterisk servers has two network cards (NIC’s). The second network card (ETH1) on both
servers is connected to a switch. This NIC’s will handle the normal traffic between IP phones and the Asterisk server.
• asteriskmaster.local: 192.168.0.200
• asteriskslave.local: 192.168.0.201
Asterisk Cluster based on Trixbox 1.2.3 3 of 16
We will need a third IP address in the 192.168.0.201.xxx range this will be our floating or virtual IP-Address. Our
floating IP-Address is:
• 192.168.0.201.203
• The currently active Asterisk server will automatically bind the floating IP-Address to it’s ETH1. All IP-phones
will point to this floating IP-Address, or in a real setup your DNS-Server need’s to resolve to this IP. Let’s say
your phones pointing to talk.mycompany.com as register server, then talk.mycompany.com needs to resolve to
192.168.0.203
The two other NIC’s (ETH0) on both Asterisk servers are connected via crossover cable. Please be aware that if you
use 100Mbit NIC’s you effective shared drive speed will not be more than 10 MBit/s. So if you like to synchronize a
large RAID 1 it can initially take a long while to do that. I recommend that you use GBit NIC’s for the DRBD
synchronization.
Assumptions:
For our synchronization we need a additional dedicated partition, therefore we need to crate a new one and
re-partition our current systems (AsteriskMaster and AsteriskSlave).
• Enter linux rescue and press Enter. Choose the charset, answer “No” when asked about network
and SKIP when asked about previous installations.
A little remark: commands may differ depending on type of HDD. If you have SATA or SCSI drive installed,
then do as said. For IDE drive, substitute “hda”<->“sda” everywhere.
• Run e2fsck –f /dev/sda2 , to test the partition (you should not get any error). Don’t proceed if you get
errors, resolve the first!
• Now we resize the current sda2 partition. Run resize2fs –f –p /dev/sda2 3000M The “3000M” part
here is saying that you want to resize file system to the size of 3Gb. You should change that,
depending on your hard drive size. When deciding, you should remember:
o You don’t need swap bigger than 1Gb, and less than 512Mb.
o If you create too big DRBD partition it will take lots of time for full synchronization between
nodes. But if you create it too small, you may run out of disk space. Minimum size defined by
DRBD is 4Gb. You may want to leave it so. Maybe twice as big. It’s up to you to decide. I
would recommend you anything between 4 and 16 Gb.
So, the math is easy. For example, you have a 8.5Gb hard drive. You decide to create 512Mb of swap, 4Gb
DRBD and rest of the drive for main partition. So, main partition should be 4Gb. But due to bug(or feature)
in resize2fs(or fdisk?), multiply main partition size by 0.75 , and that will be our file system size. For our
example, you need to resize fs to 3000M.
Now we run run fdisk /dev/sda. Press “p” and enter. You should see the following output. Pls. write
down on which cylinder sda2 starts! In this case it’s cylinder 14.
• Run fsck –f /dev/sda2 It should not report any error. If it reports error about wrong block count, run
fdisk again, and recreate partition (a little bit bigger, remember there is a bug in resize2fs).
• Run resize2fs –f –p /dev/sda2 (without size, that will enlarge the file system size to maximum
allowed on that partition).
• Run fdisk /dev/sda again. Now, we’ll recreate swap partition and create one for DRBD.
o Press “n”
o Enter
o Press “p”
o Enter
o Press “3”
o Enter
o Enter
o Press “+768M” The size of the swap partition you want to create
o Enter
o Press “t’
o Enter
o Press “3”
o Enter
o Press “82”
o Enter
o Enter “n”
o Enter
o Enter “p”
o Enter
o Enter
o Enter
o Press “w”
o Enter
You can reboot your machines now, and boot back into “normal” mode.
Asterisk Cluster based on Trixbox 1.2.3 7 of 16
# mkdir ha
Download, and Unpack the downloaded files into this new ha directory.
# cd ha
# wget http://www.danielaliaman.com/blog/files/phonecube/cluster/onecpu/ha.tar.bz2
Attention: you must create the ha directory in the root directory of your servers, if not the installation
scripts will not work!
The scripts will only work with a Trixbox 1.2.3 installation. If you install on a later Trixbox or plain Asterisk
installation, you need to modify the scripts.
Before we can install these packages you need to do some “pre-flight” checks.
• ETH1 on each server should have personal IP addresses. It doesn’t really matter exactly what addresses,
but so you can access servers from outer network.
• ETH0 should be preconfigured too. Primary server – 10.10.10.10, secondary server 10.10.10.11
If you want other addresses, substitute them in drbd.conf file.
Attention: You need a “.” In your hostname. If you use just something like asteriskmaster sendmail will start
very very slowly, therefore your failover will take minutes instead of seconds!
• Make sure, that servers are accessible by each other. I.e. « ping 10.10.10.11 » from primary node, and «
ping 10.10.10.10 » from secondary node. And there should be no firewall on ETH0 interface. It’s not a
security treat, because ETH0 network is closed.
• If you have Gbit NICs installed as ETH0, change drbd.conf « rate » parameter to «120M».
• If you have sata or scsi drive – change nothing, if your hdd is IDE – replace «sda» with «hda» in drbd.conf
.(Currently, I’ve changed the drbd.conf , so the Primary servers hdd is /dev/sda, and secondary - /dev/hda).
• Run /root/ha/drbd script. On both servers. Script should not return any errors (If you don’t do that
simultaneously, DRBD may report that it’s waiting for peer node to appear. If timeout passes or you type
«yes» - it’s ok. But after running drbd.primary you will need to make sure that nodes synchronize).
• You can check DRBD now by running cat /proc/drbd . Both nodes are in secondary/inconsistent state now.
It’s ok.
• On the primary node run /root/ha/drbd.primary script now. It will take some time for it to finish. But no
errors should be reported either. (In my case, it reports lots of warnings about “timestamp in future”, but it’s
safe to ignore them).
• Wait. DRBD needs to synchronize disks, it may take hours depending on the size of DRBD partition and
speed of the network(on Gbit NICs should be pretty fast). You can check the process state by executing cat
/proc/drbd command.
7) When synchronization is finished, you’ll see the following on the primary server:
Asterisk Cluster based on Trixbox 1.2.3 9 of 16
• Run /root/ha/ha script from the package. On both servers. It will stop following services: mysql, sendmail,
asterisk, httpd, munin-node, ircd, xinetd, xplhub, vsftpd.
• After some time, heartbeat should bring services up on primary node. Check it by executing ps ax|grep
asterisk command. It should give output like:
If you run a ifconfig eth1:0 you should see the floating IP-address 192.168.0.203 bounf to the eth1 of the
primary server.
• Now, let’s test the cluster! Just turn power off the Primary node. In a couple of seconds all the services and
ethernet interface alias on Secondary node should be up. Now start Primary node. After it boots, all services
on Secondary node should go down, and come up on Primary node.
Make sure that your FoneBridge’s are isolated in a VLAN (at least for the production environment).
VLAN1
FoneBridge
Asterisk Servers
VLAN3
VLAN2
The main goal is to isolate the FoneBridge TDMoE-Traffic from the rest of the network.
To run TDMoE with the Redfone FoneBridge’s we need to patch a small bug in the kernel-source code.
# cd /usr/src/kernels/2.6.9-34.0.2.EL-i686/include/linux
(you need to adjust the path if you use multi cpu)
# mv spinlock.h spinlock.h.orig
http://www.danielaliaman.com/blog/files/phonecube/cluster/zaptel-1.2.12.tar.gz
Copy this file into the tmp directory and untar it.
# cd /usr/src/zaptel-1.2.12
# make clean
# make linux26
# make install
# modprobe Zaptel
# shutdown –r now
To make the FoneBrigdes boxes work, you need to do a couple of installation steps.
In a first step you need to download, compile and install the Fonulator software. This software will basically
send the configuration information (from the redfone.conf files) to the Redfone box. As soon the Redfone
box receives it’s configuration file it will start fonulating back to server.
The communication is based on TDMoE (Time Division Multiplexing over Ethernet) via MAC-addresses.
http://www.danielaliaman.com/blog/files/phonecube/cluster/fonulator-0.2.4d.tar.gz
#cd fonulator-0.2.4d
#CFLAGS=-static ./configure
#make clean
#make
#make install
This last step will place the fonulator in the /usr/local/bin/ directory
# cd /usr/local/bin
http://www.danielaliaman.com/blog/files/phonecube/cluster/fonulator
Asterisk Cluster based on Trixbox 1.2.3 13 of 16
# cd /etc/init.d
redfone1.conf
http://www.danielaliaman.com/blog/files/phonecube/cluster/redfone1.conf
redfone2.conf
http://www.danielaliaman.com/blog/files/phonecube/cluster/redfone2.conf
redfone1.conf
[globals]
fb1=00:50:C2:65:D1:62
fb2=00:50:C2:65:D1:63
server1=00:1A:64:11:3B:76
card=eth1,fb1
[span1]
span=1,0,0,ccs,hdb3,crc4
server1
fb1
pri
Change fb1 and fb2 to the MAC address written on the back of your first FoneBridge.
Change server1 to the MAC address of your eth1 network card
Attention: the server1 parameter on the slave will have different MAC address, therefore redfone1.conf
and redfone2.conf are different on the master and slave!
redfone2.conf
[globals]
Asterisk Cluster based on Trixbox 1.2.3 14 of 16
fb1=00:50:C2:65:D1:6A
fb2=00:50:C2:65:D1:6B
server1=00:1A:64:11:3B:76
card=eth1,fb1
[span1]
span=1,0,0,ccs,hdb3,crc4
server1
fb1
pri
Change fb1 and fb2 to the MAC address written on the back of your second FoneBridge.
Change server1 to the MAC address of your eth1 network card
Attention: the server1 parameter on the slave will have different MAC address, therefore redfone1.conf
and redfone2.conf are different on the master and slave!
The fonulator script in /etc/init.d will use this two config files to initialize the fonebridges.
http://www.danielaliaman.com/blog/files/phonecube/cluster/zaptel.conf
This zaptel.conf is configured for two fonebridges with one E1 line each.
zaptel.conf
# Global data
loadzone = us
defaultzone = us
Please adjust the MAC addresses according to your FoneBridge MAC addresses. You only need to configure
the MAC address of the FB1 interface of each FoneBridge.
http://www.danielaliaman.com/blog/files/phonecube/cluster/zapata.conf
# vi /etc/ha.d/haresources
This will start the fonulator script via the cluster software.
Power-Up the FoneBridge’s and make sure they are connected to the switch.
If the servers are restarted login into the master server and run the following
command:
# zttool
Asterisk Cluster based on Trixbox 1.2.3 16 of 16
You should see the two SPAN and the alarms should be cleared.