A complete Big Data platform ready to be explored
Hadoop-PaaS environment offers a complete Big Data platform based on Hortonworks distribution of Apache Hadoop. You can get all your jobs done, be it batch job, real time data streaming, machine learning, you name it. Currently you will have access to following tools: Apache Spark Apache Kafka Apache hive R/Python Apache Pig Apache Zeppelin Apache Oozie Apache Mahout Apache Sqoop Apache Ranger Apache Flume Apache Knox
Spark/Hive as a service
At Hadoop-PaaS, Spark is fully integrated into the Hadoop ecosystem. Customers can benefit not just from Spark but also from other MapReduce and Non-MapReduce tools. All these tools run on the same HDFS cluster managed by YARN.