Josephreddy

- May 09, 2019

Rapid Miner and SAP HANA DB integration:

Objective:

If you have SAP HANA data base which stores all the enterprise transactional data and want to apply predictive/machine learning algorithms on the HANA data base tables or views. This blog gives you the steps to connect SAP HANA data base from Rapid Miner and retrieve tables/views/procedures data and apply Rapid Miner statistical algorithms or machine learning techniques to get the insights of data.

Back Ground and use case:

Rapid Miner is a data science platform for teams that unites data prep, machine learning, text mining and predictive model deployment. It is used for business and commercial applications as well as for research, education, training, rapid prototyping, and application development and supports all steps of the machine learning process including data preparation, results visualization, model validation and optimization.

SAP HANA is an in-memory, column-oriented, relational database management system developed and marketed by SAP SE. Its primary function as a database server is to store and retrieve data as requested by the applications.

With the SAP HANA Data Platform of structured and unstructured data we wanted to do the proof of concept on Text analytics, Market basket analysis, Customer churn, Customer segmentation data science use cases using Rapid Miner Studio.

The below step by step procedure helps to connect HANA DB tables/views/procedures from Rapid miner studio.

Rapid Miner & HANA Integration steps:

1.     Prerequisites : You must have below before trying your hands on Rapid miner models on HANA data.

     HANA CLIENT Tools (Which installs required drivers in the client machine)

     Rapid Miner Studio Client (Interface to develop Rapid miner models)

User Account in HANA DB with read access on DB Objects.

Optional: Install HANASTUDIO/WEB IDE to analyze or preview SAP HANA data.

2. Once you Install HANA Client tools, you will see ngdbc.jar file in the below path

C:\Program Files\SAP\hdbclient

Place this ngdbc.jar file below Rapid miner installation folder path

C:\Program Files\RapidMiner\RapidMiner Studio\lib\jdbc

3. Open the Rapid Miner studio and create data base driver .

Go to manage data base drivers and enter the following parameters as shown in picture

Name of driver: HANA_JDBC

Jar file: C:\Program Files\RapidMiner\RapidMiner Studio\lib\jdbc\ngdbc.jar

Port : 30115

Driver class: com.sap.db.jdbc.Driver

4. Now go to manage data base connections , create data base connection with below parameters. Select previously created driver in the data base system Give the HANA system host, port ,user id, password

Do the test connection- you should see success message.

5. Now use db connect process to import the data and process the data using rapid miner models.

From read data base parameters, you can connect tables or views or you can write custom HANA SQL Query.

Writing custom SQL query :

Date: 09/17/2018
Prepared By : Joseph Yeruva
https://twitter.com/josredy

Search This Blog

Josephreddy

Comments

Post a Comment

Popular posts from this blog

SAP BW DeltaQueue (RSA7)

What is GRC? Why GRC?