SAP HANA Smart Data Access Using Hadoop Hive PDF
SAP HANA Smart Data Access Using Hadoop Hive PDF
SAP HANA Smart Data Access Using Hadoop Hive PDF
By
Debajit Banerjee
Table of Contents
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 1
Steps/Procedure :
Hadoop Installation
Data Load in Hadoop system
Activities on Unstructured Data in Hadoop system
ODBC Driver installation & configuration on HANA Server for Hadoop system data access
Smart Data Access in SAP HANA (through SAP HANA Studio), using HADOOP as a remote data source
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 2
In Linux
In Windows
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 3
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 4
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 5
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 6
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 7
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 8
i.
ii.
iii.
iv.
v.
vi.
Open the Control Panel -> System pane and click on Advanced system
settings.
Click on the Advanced tab.
Click the Environment Variables button.
Under System variables, click New.
Enter the Variable Name as JAVA_HOME.
Enter the Variable Value, as the installation path for the Java Development Kit.
For example, if your JDK is installed at C:\Java\jdk1.6.0_31, then you must
provide this path to the Variable Value.
vii.
viii.
Click OK.
Click OK to close the Environment Variables dialog box.
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 9
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 10
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 11
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 12
II. HDP 1.3 for Windows (Hortonworks Data Platform) Standalone Installation
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 13
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 14
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 15
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 16
Services were not started due to 0 bytes in .xml files(master & regionserver)
Also rest/thrift/thrift2.xml are also of zero bytes.
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 17
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 18
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 19
Hadoop Smoketest
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 20
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 21
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 22
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 23
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 24
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 25
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 26
It canperfect !!
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 27
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 28
After refresh
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 29
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 30
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 31
Click on user
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 32
If one can click on .out file, then one can see the part file
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 33
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 34
Created a table called debajit_wc for wordcount part file. But right now, it is empty.
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 35
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 36
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 37
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 38
Download the license file from email and deployed. Problem solved.
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 39
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 40
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 41
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 42
SIMBA Driver
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 43
UNIXODBC
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 44
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 45
So, now the connection is working between HANA Server and Hadoop system from OS level.
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 46
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 47
So, now the connection is working between HANA Server and Hadoop system from SAP HANA Studio.
Creating a schema in HP7
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 48
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 49
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 50
One can do Query and Connection Monitoring when click on Smart Data Access under Provisioning.
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 51
Thats all.
http://debajitb.wix.com/debajitbanerjee | http://debajitb.wix.com/debajitbanerjee/apps/blog
Page 52