Ambari Install v20
Ambari Install v20
Ambari Install v20
Table of Contents
Determine Stack Compatibility ................................................................................................... 6
Meet Minimum System Requirements ........................................................................................ 7
Hardware Recommendations .............................................................................................. 7
Operating Systems Requirements ...................................................................................... 7
Browser Requirements........................................................................................................ 8
Software Requirements....................................................................................................... 8
JDK Requirements .............................................................................................................. 9
Database Requirements ..................................................................................................... 9
Memory Requirements .......................................................................................................10
Package Size and Inode Count Requirements ...................................................................10
Check the Maximum Open File Descriptors .......................................................................10
Collect Information ....................................................................................................................11
Prepare the Environment ..........................................................................................................12
Check Existing Package Versions ......................................................................................12
Set Up Password-less SSH................................................................................................13
Set up Service User Accounts............................................................................................14
Using a Local Repository ..........................................................................................................17
Obtaining the Repositories .................................................................................................17
Setting Up a Local Repository ............................................................................................21
Download the Ambari Repo ...............................................................................................26
Set Up the Ambari Server ..................................................................................................31
Next Steps .........................................................................................................................33
Start the Ambari Server......................................................................................................33
Next Steps .........................................................................................................................34
Install, Configure and Deploy a HDP Cluster .............................................................................35
Log In to Apache Ambari .......................................................................................................35
Launching the Ambari Install Wizard ..................................................................................36
Name Your Cluster ................................................................................................................36
Select Stack ..........................................................................................................................36
Install Options........................................................................................................................38
Confirm Hosts........................................................................................................................39
Choose Services ...................................................................................................................39
This section describes the information and materials you should get ready to install a HDP
cluster using Ambari. Ambari provides an end-to-end management and monitoring solution for
your HDP cluster. Using the Ambari Web UI and REST APIs, you can deploy, operate, manage
configuration changes, and monitor services for all nodes in your cluster from a central point.
Collect Information
1.7.0
1.6.1
1.6.0
1.5.1
1.5.0
1.4.4.23
1.4.3.38
1.4.2.104
1.4.1.61
1.4.1.25
1.2.5.17
Installing Accumulo, Hue, and Solr services, see Installing HDP Manually.
Hardware Recommendations
Browser Requirements
Software Requirements
JDK Requirements
Database Requirements
Hardware Recommendations
There is no single hardware requirement set for installing Hadoop.
For more information about hardware components that may affect your installation, see
Hardware Recommendations For Apache Hadoop.
CentOS v6.x
The installer pulls many packages from the base OS repositories. If you do not
have a complete set of base OS repositories available to all your machines at the
time of installation you may run into issues.
If you encounter problems with base OS repositories being unavailable, please
contact your system administrator to arrange for these additional repositories to be
proxied or mirrored. For more information see Using a Local Repository.
Browser Requirements
The Ambari Install Wizard runs as a browser-based Web application. You must have a machine
capable of running a graphical browser to use this tool. The minimum required browser versions
are:
Windows (Vista, 7, 8)
o
Firefox 18
Google Chrome 26
Firefox 18
Safari 5
Google Chrome 26
Firefox 18
Google Chrome 26
On any platform, we recommend updating your browser to the latest stable version.
Software Requirements
On each of your hosts:
apt (Ubuntu)
python v2.6
The Python version shipped with SUSE 11, 2.6.0-8.12.2, has a critical bug that may
cause the Ambari Agent to fail within the first 24 hours. If you are installing on
SUSE 11, please update all your hosts to Python version 2.6.8-0.15.1.
Python v2.7.9 or later is not supported due to changes in how Python performs
certificate validation.
JDK Requirements
The following Java runtime environments are supported:
OpenJDK 7 64-bit (not supported on SLES) To install OpenJDK 7 for RHEL, run the
following command on all hosts:
yum install java-1.7.0-openjdk
Database Requirements
Ambari requires a relational database to store information about the cluster configuration and
topology. If you install HDP Stack with Hive or Oozie, they also require a relational database.
The following table outlines these database requirements:
Component
Description
Ambari
By default, will install an instance of PostgreSQL on the Ambari Server host. Optionally, to use an
existing instance of PostgreSQL, MySQL or Oracle. For further information, see Using Non-Default
Databases - Ambari.
Hive
By default (on RHEL/CentOS/Oracle Linux 6), Ambari will install an instance of MySQL on the Hive
Metastore host. Otherwise, you need to use an existing instance of PostgreSQL, MySQL or Oracle.
See Using Non-Default Databases - Hive for more information.
Oozie
By default, Ambari will install an instance of Derby on the Oozie Server host. Optionally, to use an
existing instance of PostgreSQL, MySQL or Oracle, see Using Non-Default Databases - Oozie for
more information.
For the Ambari database, if you use an existing Oracle database, make sure the
Oracle listener runs on a port other than 8080 to avoid conflict with the default
Ambari port. Alternatively, refer to the Ambari Reference Guide for information on
Changing the Default Ambari Server Port.
Memory Requirements
The Ambari host should have at least 1 GB RAM, with 500 MB free.
The Ambari Metrics Collector host should have the following memory and disk space available
based on cluster size:
Number of hosts Memory Available Disk Space
1
1024
10 GB
10
1024
20 GB
50
2048
50 GB
100
4096
100 GB
300
4096
100 GB
500
8096
200 GB
1000
12288
200 GB
2000
16384
500 GB
Inodes
Ambari Server
100MB 5,000
Ambari Agent
8MB
225MB 4,000
1MB
100
100
N/A
4,000
N/A
500
N/A
200
1,000
10
Collect Information
Before deploying an HDP cluster, you should collect the following information:
The fully qualified domain name (FQDN) of each host in your system. The Ambari install
wizard supports using IP addresses. You can use hostname -f to check or verify the
FQDN of a host.
Deploying all HDP components on a single host is possible, but is appropriate only
for initial evaluation purposes. Typically, you set up at least three hosts; one master
host and two slaves, as a minimum cluster. For more information about deploying
HDP components, see the descriptions for a Typical Hadoop Cluster.
The base directories you want to use as mount points for storing:
o
NameNode data
DataNodes data
Oozie data
11
Check DNS
Configure iptables
None
SLES 11
Component - Description
None
UBUNTU 12
Component - Description
zlibc_0.9k-4.1_amd64
12
libffi 3.0.5-1.el5, python26 2.6.8-2.el5, python26-libs 2.6.82.el5, postgresql 8.4.13-1.el6_3, postgresql-libs 8.4.13-1.el6_3,
postgresql-server 8.4.13-1.el6_3
Generate public and private SSH keys on the Ambari Server host.
ssh-keygen
Copy the SSH Public Key (id_rsa.pub) to the root account on your target hosts.
.ssh/id_rsa .ssh/id_rsa.pub
Add the SSH Public Key to the authorized_keys file on your target hosts.
cat id_rsa.pub >> authorized_keys
Depending on your version of SSH, you may need to set permissions on the .ssh
directory (to 700) and the authorized_keys file in that directory (to 600) on the target
hosts.
chmod 700 ~/.ssh chmod 600 ~/.ssh/authorized_keys
From the Ambari Server, make sure you can connect to each host in the cluster using
SSH, without having to enter a password.
ssh root@<remote.target.host> where <remote.target.host> has the
value of each host name in your cluster.
If the following warning message displays during your first connection: Are you sure
you want to continue connecting (yes/no)? Enter Yes.
Retain a copy of the SSH Private Key on the machine from which you will run the webbased Ambari Install Wizard.
13
It is possible to use a non-root SSH account, if that account can execute sudo
without entering a password.
Using a text editor, open the hosts file on every host in your cluster. For example:
vi /etc/hosts
Add a line for each host in your cluster. The line should consist of the IP address and the
FQDN. For example:
1.2.3.4 <fully.qualified.domain.name>
14
Do not remove the following two lines from your hosts file. Removing or editing the
following lines may cause various programs that require network functionality to fail.
127.0.0.1 localhost.localdomain localhost ::1
localhost6.localdomain6 localhost6
Use the "hostname" command to set the hostname on each host in your cluster. For
example:
hostname <fully.qualified.domain.name>
Using a text editor, open the network configuration file on every host and set the desired
network configuration for each host. For example:
vi /etc/sysconfig/network
Modify the HOSTNAME property to set the fully qualified domain name.
NETWORKING=yes NETWORKING_IPV6=yes HOSTNAME=<fully.qualified.domain.name>
Configuring iptables
For Ambari to communicate during setup with the hosts it deploys to and manages, certain ports
must be open and available. The easiest way to do this is to temporarily disable iptables, as
follows:
chkconfig iptables off /etc/init.d/iptables stop
You can restart iptables after setup is complete. If the security protocols in your environment
prevent disabling iptables, you can proceed with iptables enabled, if all required ports are open
and available. For more information about required ports, see Configuring Network Port
Numbers.
Ambari checks whether iptables is running during the Ambari Server setup process. If iptables is
running, a warning displays, reminding you to check that required ports are open and available.
The Host Confirm step in the Cluster Install Wizard also issues a warning for each host that has
iptables running.
15
You must temporarily disable SELinux for the Ambari setup to function. On each host in
your cluster,
setenforce 0
UMASK (User Mask or User file creation MASK) sets the default permissions or base
permissions granted when a new file or folder is created on a Linux machine. Most Linux
distros set 022 as the default umask value. A umask value of 022 grants read, write,
execute permissions of 755 for new files or folders. A umask value of 027 grants read,
write, execute permissions of 750 for new files or folders. Ambari supports a umask
value of 022 or 027. For example, to set the umask value to 022, run the following
command as root on all hosts, vi /etc/profile then, append the following line:
umask 022
16
No Internet Access
Ambari Repositories
Ambari Repositories
If you do not have Internet access for setting up the Ambari repository, use the link appropriate
for your OS family to download a tarball that contains the software.
17
SLES 11
wget -nv http://public-repo1.hortonworks.com/ambari/suse11/2.x/updates/2.0.0/ambari.repo
UBUNTU 12
wget -nv http://public-repo1.hortonworks.com/ambari/ubuntu12/2.x/updates/2.0.0/ambari.list
RHEL/CentOS/ORACLE Linux 5 (DEPRECATED)
wget -nv http://public-repo1.hortonworks.com/ambari/centos5/2.x/updates/2.0.0/ambari.repo
HDP Stack Repositories
If you do not have Internet access to set up the Stack repositories, use the link appropriate for
your OS family to download a tarball that contains the HDP Stack version you plan to install.
18
SLES 11
wget -nv http://public-repo-1.hortonworks.com/HDP/suse11sp3/HDP2.1.10.0-suse11sp3-rpm.tar.gz
wget -nv http://public-repo-1.hortonworks.com/HDP-UTILS1.1.0.19/repos/suse11/HDP-UTILS-1.1.0.19-suse11.tar.gz
UBUNTU 12
wget -nv http://public-repo-1.hortonworks.com/HDP/ubuntu12/HDP2.1.10.0-ubuntu12-tars-tarball.tar.gz
wget -nv http://public-repo-1.hortonworks.com/HDP-UTILS1.1.0.19/repos/ubuntu12/
RHEL/CentOS/ORACLE Linux 5 (DEPRECATED)
wget -nv http://public-repo-1.hortonworks.com/HDP/centos5/HDP2.1.10.0-centos5-rpm.tar.gz
wget -nv http://public-repo-1.hortonworks.com/HDP-UTILS1.1.0.19/repos/centos5/HDP-UTILS-1.1.0.17-centos5.tar.gz
19
SLES 11SP3
wget -nv http://public-repo1.hortonworks.com/HDP/suse11sp3/2.x/updates/2.2.4.2/hdp.repo -O
/etc/zypp/repos.d/HDP.repo
UBUNTU 12
wget -nv http://public-repo1.hortonworks.com/HDP/ubuntu12/2.x/updates/2.2.4.2/hdp.list -O
/etc/apt/sources.list.d/HDP.list
RHEL/CentOS/ORACLE Linux 5 (DEPRECATED)
wget -nv http://public-repo1.hortonworks.com/HDP/centos5/2.x/updates/2.2.4.2/hdp.repo -O
/etc/yum.repos.d/HDP.repo
20
RHEL/CentOS/ORACLE 5 (DEPRECATED)
wget -nv http://public-repo1.hortonworks.com/HDP/centos5/2.x/updates/2.0.13.0/hdp.repo -O
/etc/yum.repos.d/HDP.repo
No Internet Access
This option involves downloading the repository tarball, moving the tarball to the selected
mirror server in your cluster, and extracting to create the repository.
Both options proceed in a similar, straightforward way. Setting up for each option presents some
key differences, as described in the following sections:
Select an existing server in, or accessible to the cluster, that runs a supported operating
system.
Enable network access from all hosts in your cluster to the mirror server.
Ensure the mirror server has a package manager installed such as yum (RHEL /
CentOS / Oracle Linux), zypper (SLES), or apt-get (Ubuntu).
Optional: If your repository has temporary Internet access, and you are using
RHEL/CentOS/Oracle Linux as your OS, install yum utilities:
yum install yum-utils createrepo
On the mirror server, install an HTTP server (such as Apache httpd) using the
instructions provided here .
Ensure that any firewall settings allow inbound HTTP access from your cluster
nodes to your mirror server.
21
If you are using Amazon EC2, make sure that SELinux is disabled.
For SLES:
mkdir -p /srv/www/htdocs/rpms
For Ubuntu:
mkdir -p /var/www/html/
If you are using a symlink, enable the followsymlinks on your web server.
After you have completed the steps in Getting Started Setting up a Local
Repository, move on to specific setup for your repository internet access type..
Obtain the tarball for the repository you would like to create. For options, see Obtaining
the Repositories.
Copy the repository tarballs to the web server directory and untar.
o
22
Repository Content
Ambari Repository
Repository Location
Untar under <web.server.directory>
Repository
URL
http://<web.server>/ambari/<OS>/2.x/updates/2.0.0
http://<web.server>/hdp/HDP/<OS>/2.x/updates/<latest.version>
where <web.server> = FQDN of the web server host, and <OS> is centos5, centos6,
sles11, or ubuntu12.
Be sure to record these Base URLs. You will need them when installing Ambari and
the cluster.
Optional: If you have multiple repositories configured in your environment, deploy the
following plug-in on all the nodes in your cluster.
o
Put the repository configuration files for Ambari and the Stack in place on the host. For
options, see Obtaining the Repositories.
23
For SLES:
zypper repos
For Ubuntu:
dpkg -list
For Ambari:
createrepo <web.server.directory>/ambari/<OS>/Updates-ambari2.0.0
Repository
URL
http://<web.server>/ambari/<OS>/Updates-ambari-2.0.0
http://<web.server>/hdp/<OS>/HDP-<latest.version>
24
Repository
URL
where <web.server> = FQDN of the web server host, and <OS> is centos5, centos6,
sles11, or ubuntu12.
Be sure to record these Base URLs. You will need them when installing Ambari and
the Cluster.
Optional. If you have multiple repositories configured in your environment, deploy the
following plug-in on all the nodes in your cluster.
o
Download the ambari.repo file from the mirror server you created in the preceding
sections or from the public repository.
o
where <web.server> = FQDN of the web server host, and <OS> is CENTOS6,
SLES11, or UBUNTU12.
Edit the ambari.repo file using the Ambari repository Base URL obtained when setting up
your local repository. Refer to step 3 in Setting Up a Local Repository with No Internet
Access, or step 5 in Setting Up a Local Repository with Temporary Internet Access, if
necessary.
Repository
URL
25
where <web.server> = FQDN of the web server host, and <OS> is CENTOS6, SLES11,
or UBUNTU12.
Place the ambari.repo file on the machine you plan to use for the Ambari Server.
For RHEL/CentOS/Oracle Linux:
/etc/yum.repos.d/ambari.repo
For SLES:
/etc/zypp/repos.d/ambari.repo
For Ubuntu:
/etc/apt/sources.list.d/ambari.list
Log in to your host as root. You may sudo as su if your environment requires such
access. For example, type:
<username> ssh <hostname.FQDN> sudo su - where <username> is your user name and
<hostname.FQDN> is the fully qualified domain name of your server host.
26
Do not modify the ambari.repo file name. This file is expected to be available on
the Ambari Server host during Agent registration.
You should see values similar to the following for Ambari repositories in the list.
Version values vary, depending on the installation.
repo id
repo name
Status
Base
CentOS-6 - Base
6,518
Extras
CentOS-6 - Extras
15
Updates
Install the Ambari bits. This also installs the default PostgreSQL Ambari database.
yum install ambari-server
27
Accept the warning about trusting the Hortonworks GPG Key. That key will be
automatically downloaded and used to validate packages from Hortonworks. You
will see the following message:
Importing GPG key 0x07513CAD: Userid: "Jenkins (HDP Builds)
<jenkin@hortonworks.com>" From :
http://s3.amazonaws.com/dev.hortonworks.com/ambari/centos6/RPM-GPGKEY/RPM-GPG-KEY-Jenkins
SLES 11
On a server host that has Internet access, use a command line editor to perform the following
steps:
Log in to your host as root. You may sudo as su if your environment requires such
access. For example, type:
Download the Ambari repository file to a directory on your installation host. wget -nv
http://public-repo1.hortonworks.com/ambari/suse11/2.x/updates/2.0.0/ambari.repo -O
/etc/zypp/repos.d/ambari.repo
Do not modify the ambari.repo file name. This file is expected to be available on
the Ambari Server host during Agent registration.
Name
Ambari 2.x
Enabled Refresh
Yes
http-demeter.uni-regensburg.de- SUSE-Linux-Enterprise-Software-Development-KitYes
c997c8f9
11-SP1 11.1.1-1.57
Yes
Opensuse
Yes
OpenSuse
Yes
No
28
UBUNTU 12
On a server host that has Internet access, use a command line editor to perform the following
steps:
Log in to your host as root. You may sudo as su if your environment requires such
access.
For example, type:
<username> ssh <hostname.FQDN> sudo su
where <username> is your user name and <hostname.FQDN> is the fully qualified
domain name of your server host.
Do not modify the ambari.list file name. This file is expected to be available on
the Ambari Server host during Agent registration.
Confirm that Ambari packages downloaded successfully by checking the package name
list.
apt-cache pkgnames
29
Name
Log in to your host as root. You may sudo as su if your environment requires such
access. For example, type:
<username> ssh <hostname.FQDN> sudo su
where <username> is your user name and <hostname.FQDN> is the fully qualified
domain name of your server host.
repo Name
status
Base
CentOS-5 Base
3,667
Epel
Puppet
Puppet
433
Updates
CentOS-5 Updates
118
30
When deploying HDP on a cluster having limited or no Internet access, you should
provide access to the bits using an alternative method.
Ambari Server by default uses an embedded PostgreSQL database. When you install the
Ambari Server, the PostgreSQL packages and dependencies must be available for install.
These packages are typically available as part of your Operating System repositories. Please
confirm you have the appropriate repositories available for the postgresql-server packages.
If you have not temporarily disabled SELinux, you may get a warning. Accept the default
(y), and continue.
By default, Ambari Server runs under root. Accept the default (n) at the Customize
user account for ambari-server daemon prompt, to proceed as root. If you
want to create a different user to run the Ambari Server, or to assign a previously
created user, select y at the Customize user account for ambari-server
daemon prompt, then provide a user name.
If you have not temporarily disabled iptables you may get a warning. Enter y to
continue.
Accept the Oracle JDK license when prompted. You must accept this license to
download the necessary JDK from Oracle. The JDK is installed during the deploy phase.
31
If you are using an existing PostgreSQL, MySQL, or Oracle database instance, use one
of the following prompts:
You must prepare a non-default database instance, using the steps detailed in
Using Non-Default Databases-Ambari, before running setup and entering advanced
database configuration.
o
To use an existing Oracle 11g r2 instance, and select your own database name,
user name, and password for that database, enter 2.
Select the database you want to use and provide any information requested at
the prompts, including host name, port, Service Name or SID, user name, and
password.
To use an existing MySQL 5.x database, and select your own database name,
user name, and password for that database, enter 3.
Select the database you want to use and provide any information requested at
the prompts, including host name, port, database name, user name, and
password.
To use an existing PostgreSQL 9.x database, and select your own database
name, user name, and password for that database, enter 4.
Select the database you want to use and provide any information requested at the
prompts, including host name, port, database name, user name, and password.
Setup completes.
If your host accesses the Internet through a proxy server, you must configure
Ambari Server to use this proxy server. See How to Set Up an Internet Proxy
Server for Ambari for more information.
32
Setup Options
The following table describes options frequently used for Ambari Server setup.
Option
Description
Specifies the JAVA_HOME path to use on the Ambari Server and all hosts in the cluster. By default
when you do not specify this option, Ambari Server setup downloads the Oracle JDK 1.7 binary and
accompanying Java Cryptography Extension (JCE) Policy Files to /var/lib/ambari-server/resources.
Ambari Server then installs the JDK to /usr/jdk64.
-j (or -javahome)
Use this option when you plan to use a JDK other than the default Oracle JDK 1.7. See JDK
Requirements for more information on the supported JDKs. If you are using an alternate JDK, you must
manually install the JDK on all hosts and specify the Java Home path during Ambari Server setup. If
you plan to use Kerberos, you must also install the JCE on all hosts.
This path must be valid on all hosts. For example: ambari-server setup j
/usr/java/default
--jdbcdriver
Should be the path to the JDBC driver JAR file. Use this option to specify the location of the JDBC
driver JAR and to make that JAR available to Ambari Server for distribution to cluster hosts during
configuration. Use this option with the --jdbc-db option to specify the database type.
--jdbc-db
Specifies the database type. Valid values are: [postgres | mysql | oracle] Use this option with the --jdbcdriver option to specify the location of the JDBC driver JAR file.
-s (or -silent)
-v (or -verbose)
Prints verbose info and warning messages to the console during Setup.
-g (or -debug)
Next Steps
Start the Ambari Server
33
If If you plan to use an existing database instance for Hive or for Oozie, you must
complete the preparations described in Using Non-Default Databases-Hive and
Using Non-Default Databases-Oozie before installing your Hadoop cluster.
Next Steps
Install, configure and deploy an HDP cluster
34
Select Stack
Install Options
Confirm Hosts
Choose Services
Assign Masters
Customize Services
Review
Complete
Log in to the Ambari Server using the default user name/password: admin/admin. You
can change these credentials later.
For a new cluster, the Ambari install wizard displays a Welcome page from which you
launch the Ambari Install wizard.
35
In Name your cluster, type a name for the cluster you want to create. Use no white
spaces or special characters in the name.
Choose Next.
Select Stack
The Service Stack (the Stack) is a coordinated and tested set of HDP components. Use a radio
button to select the Stack version you want to install. To install an HDP 2x stack, select the HDP
2.2, HDP 2.1, or HDP 2.0 radio button.
36
Expand Advanced Repository Options to select the Base URL of a repository from which Stack
software packages download. Ambari sets the default Base URL for each repository, depending
on the Internet connectivity available to the Ambari server host, as follows:
For an Ambari Server host having Internet connectivity, Ambari sets the repository Base
URL for the latest patch release for the HDP Stack version. For an Ambari Server having
NO Internet connectivity, the repository Base URL defaults to the latest patch release
version available at the time of Ambari release.
You can override the repository Base URL for the HDP Stack with an earlier patch
release if you want to install a specific patch release for a given HDP Stack version. For
example, the HDP 2.1 Stack will default to the HDP 2.1 Stack patch release 7, or HDP2.1.7. If you want to install HDP 2.1 Stack patch release 2, or HDP-2.1.2 instead, obtain
the Base URL from the HDP Stack documentation, then enter that location in Base URL.
If you are using a local repository, see Using a Local Repository for information about
configuring a local repository location, then enter that location as the Base URL instead
of the default, public-hosted HDP Stack repositories.
37
The UI displays repository Base URLs based on Operating System Family (OS
Family). Be sure to set the correct OS Family based on the Operating System you
are running. The following table maps the OS Family to the Operating Systems.
OS Family
Operating Systems
redhat6
suse11
ubuntu12
redhat5
Install Options
In order to build up the cluster, the install wizard prompts you for general information about how
you want to set it up. You need to supply the FQDN of each of your hosts. The wizard also
needs to access the private key file you created in Set Up Password-less SSH. Using the host
names and key file information, the wizard can locate, access, and interact securely with all
hosts in the cluster.
Use the Target Hosts text box to enter your list of host names, one per line. You can
use ranges inside brackets to indicate larger sets of hosts. For example, for
host01.domain through host10.domain use host[01-10].domain
If you are deploying on EC2, use the internal Private DNS host names.
If you want to let Ambari automatically install the Ambari Agent on all your hosts using
SSH, select Provide your SSH Private Key and either use the Choose File
button in the Host Registration Information section to find the private key file
that matches the public key you installed earlier on all your hosts or cut and paste the
key into the text box manually.
If you are using IE 9, the Choose File button may not appear. Use the text box to
cut and paste your private key manually.
Fill in the user name for the SSH key you have selected. If you do not want to use root
, you must provide the user name for an account that can execute sudo without entering
a password.
If you do not want Ambari to automatically install the Ambari Agents, select Perform
manual registration. For further information, see Installing Ambari Agents
Manually.
38
Confirm Hosts
Confirm Hosts prompts you to confirm that Ambari has located the correct hosts for your
cluster and to check those hosts to make sure they have the correct directories, packages, and
processes required to continue the install.
If any hosts were selected in error, you can remove them by selecting the appropriate
checkboxes and clicking the grey Remove Selected button. To remove a single host, click the
small white Remove button in the Action column.
At the bottom of the screen, you may notice a yellow box that indicates some warnings were
encountered during the check process. For example, your host may have already had a copy of
wget or curl. Choose Click here to see the warnings to see a list of what was
checked and what caused the warning. The warnings page also provides access to a python
script that can help you clear any issues you may encounter and let you run Rerun Checks.
If you are deploying HDP using Ambari 1.4 or later on RHEL 6.5 you will likely see
Ambari Agents fail to register with Ambari Server during the Confirm Hosts step
in the Cluster Install wizard. Click the Failed link on the Wizard page to display
the Agent logs. The following log entry indicates the SSL connection between the
Agent and Server failed during registration:
INFO 2014-04-02 04:25:22,669 NetUtil.py:55 - Failed to connect to
https://<ambari-server>:8440/cert/ca due to [Errno 1] _ssl.c:492:
error:100AE081:elliptic curve
routines:EC_GROUP_new_by_curve_name:unknown group
For more information about this issue, see the Ambari Troubleshooting Guide.
When you are satisfied with the list of hosts, choose Next.
Choose Services
Based on the Stack chosen during Select Stack, you are presented with the choice of Services
to install into the cluster. HDP Stack comprises many services. You may choose to install any
other available services now, or to add services later. The install wizard selects all available
services for installation by default.
Choose none to clear all selections, or choose all to select all listed services.
39
Assign Masters
The Ambari install wizard assigns the master components for selected services to appropriate
hosts in your cluster and displays the assignments in Assign Masters. The left column shows
services and current hosts. The right column shows current master component assignments by
host, indicating the number of CPU cores and amount of RAM installed on each host.
To change the host assignment for a service, select a host name from the drop-down
menu for that service.
To remove a ZooKeeper instance, click the green minus icon next to the host address
you want to remove.
Use all or none to select all of the hosts in the column or none of the hosts, respectively.
If a host has a red asterisk next to it, that host is also running one or more master
components. Hover your mouse over the asterisk to see which master components are
on that host.
40
Customize Services
Customize Services presents you with a set of tabs that let you manage configuration
settings for HDP components. The wizard sets reasonable defaults for each of the options here,
but you can use this set of tabs to tweak those settings. You are strongly encouraged to do so,
as your requirements may be slightly different. Pay particular attention to the directories
suggested by the installer.
To prevent out-of-memory errors during the install, at the Customize Services step in the Cluster
Install wizard browse to Hive > hive-site.xml, then modify the following configuration
settings:
Property Name
Purpose
true
true
False
Review
The assignments you have made are displayed. Check to make sure everything is correct. If
you need to make changes, use the left navigation bar to return to the appropriate screen.
To print your information for later reference, choose Print.
When you are satisfied with your choices, choose Deploy.
41
Complete
The Summary page provides you a summary list of the accomplished tasks. Choose
Complete. Ambari Web GUI displays.
42