Quantcast
Channel: SCN : Document List - SAP HANA and In-Memory Computing
Viewing all articles
Browse latest Browse all 1183

SAP HANA SP100 SDA setup with Apache Hadoop

$
0
0

In my documentation I’ll explain how to setup and configure a SAP Hana SP10 SDA with Apache Hadoop. I will show in detail step and configuration point to achieve this it.

 

HANA revision 100 reserve a lot new features, refer to the following link for the complete list

SAP Hana SP10 what's new

 

In order execution

 

  • Apahce Hadooop installation
  • Setup Hana to consume Hadoop data
  • Connect SAP Hana studio to Hadoop for SDA
  • Manage Hadoop Cluster with Ambari

 

Guide used

Simba ODBC Driver for Apache Hive

HDP installation GuideHANA SDA guide


Note used

2165826 - SAP HANA Platform SPS 10 Release Note

2177918 - SAP HANA Hadoop Ambari Cockpit SP10


Link used

http://hortonworks.com/products/releases/hdp-2-0-ga/#installhttp://www.simba.com/connectors/apache-hadoop-hive-odbc

 

 

Architecture overview

7-5-2015 1-51-00 AM.jpg

Installation of Apache Hadoop

 

Apache Hadoop will be installed on our Windows environment, before installing the package, the following software needs to be installed as requirement :

  • Microsoft Visual C++ 2010 Redistributable Package (64 bit)
  • Oracle JDK 7 64-bit
  • Microsoft.NET framework 4.0
  • Python 2.7

 

Once the required software are installed download the latest version from the website (the current version is 2.3)

 

Open DOS and run "msiexec /lv d:\hdplog.txt /i "D:\Software\Hadoop\hdp-2.0.6.0.winpkg.msi" to launch the program3-21-2014 10-42-38 AM.jpg3-21-2014 10-43-31 AM.jpg

 

Choose Derby as DB flavor

3-21-2014 10-44-46 AM.jpg3-21-2014 10-59-39 AM.jpg

 

Open the command line shortcut and start the hadoop services

3-21-2014 11-00-51 AM.jpg3-21-2014 11-04-08 AM.jpg

 

All the services are running

3-21-2014 11-07-08 AM.jpg

 

Validate the installation by making a SmokeTest

3-21-2014 11-17-03 AM.jpg

 

Check the node status and the cluster status

7-4-2015 10-59-01 PM.jpg

Setup Hana to consume Hadoop Data

The Hadoop server is now up and running but before creating a connection from Hana, two ODBC drivers need to be download on the Hana server.

UnixODBC driver and SimbaODBC driver

 

 

UnixODBC driver can be download from the following website: http://www.unixodbc.org/3-20-2014 2-50-53 PM.jpg

 

Once both are downloaded, start by decompressing the SimbaODBC driver3-21-2014 11-29-06 AM.jpg

 

Use the command “gunzip” to remove the “gz” and use after the command “tar xvf” to decompress the tar file. The simba folder is the extracte. Do the same thing for unixODBC

3-21-2014 11-36-57 AM.jpg

 

Move the two folders at the root level

3-21-2014 11-38-41 AM.jpg

 

As <SID>adm user, move into the samba setup folder and copy the samba.hiveodbc.ini in the home directory, then do a VI and change parameters

3-21-2014 12-34-43 PM.jpg3-21-2014 12-43-58 PM.jpg

 

Now install the last version of unixODBC driver for Simba, from unixODBC folder run the following command:

1) ./configure

2) Make

3) Make install

3-21-2014 12-51-06 PM.jpg

 

Configure the classpathby creating customer.sh file using VI (~/.customer.sh) with the fallowing entry3-21-2014 1-32-19 PM.jpg

 

And create an odbc.ini file using VI (~/.odbc.ini) with the following entry:

  • DSN name
  • Driver location
  • Host ip of the Hadoop server
  • Port to use for Hiveserver(default)
  • Hive server type
  • Athentication method
  • User for authentication
  • Password for user auth.

3-21-2014 2-25-31 PM.jpg

 

And link this file to the customer.sh file created before by adding the following line:
export ODBCINI=$HOME/.odbc.ini

 

Do a test connection from Hana server to Hadoopby running : isql –v HIVE

3-21-2014 2-13-56 PM.jpg

 

The license for samba driver needs to be installed once it’s done do the test again

3-21-2014 2-18-16 PM.jpg

 

Successfully connected

3-21-2014 2-26-53 PM.jpg

 

Do a “show tables “ to make sure we are on the right system

3-21-2014 2-34-57 PM.jpg

 

 

Connect SAP Hana to Hadoop for SDA

 

In provisioning, choose create remote source and create a new

3-21-2014 8-50-00 PM.jpg

 

Fill up all the required information

3-21-2014 8-52-09 PM.jpg

 

Refresh the remote source panel

3-21-2014 8-53-34 PM.jpg

 

The connection is made and can see the tables available

 

 

 

Manage Hadoop cluster with Ambari

 

Stating SP10, HANA provide a new delivery unit which allow to manage your Hadoop cluster over Ambari

"HANAHADOOPAMBR10_0-80001012.zip"

7-5-2015 12-16-09 AM.jpg

 

Once upload, the new role needs to be assign

7-5-2015 12-49-56 AM.jpg

 

and the application is available in the catalogue

7-5-2015 12-41-17 AM.jpg

 

Access it and provide the necessary information

7-5-2015 12-53-06 AM.jpg

 

and access the cockpit

7-5-2015 1-30-55 AM.jpg

 

The simple Hadoop connection over SDA is done.

 

Williams


Viewing all articles
Browse latest Browse all 1183

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>