Closed
Hadoop/Hive integration with Pega
- We want Pega to integrate with Hive tables on Hortonworks Hadoop cluster.
- We have imported all the jars recommended by Pega.
- We are able to create a hadoop connectivity to Pega directly. This connection is not stable right now.
- Later we want to establish the connection using Kerboros authentication. We are using Pega 7.3.1 running on WebSphere. Unfortunately this is not working in our application. I wanted to check if there is any way to connect to Hadoop using Kerboros on WebSphere. Seems like this is a product limitation.
- We want to read ORC file formats stored on top of HDFS. But currently Pega is not supporting ORC formats on HDFS. Let me know if there is any other alternative approach to read ORC using third party jars.
- If the above approach doesn’t work, we are planning to establish a connection to hive tables directly using hive JDBC drivers. Please let me know if there is any limitation with this approach. Pega is not able to recognize hive JDBC drivers in our WebSphere path.
***Edited by Moderator Marissa to add SR Details***