Introduction to Data Science with Apache Spark
Apache Spark provides a lot of valuable tools for data science. With our release of Apache Spark 1.3.1 Technical Preview, the powerful Data Frame API is available on HDP. Data scientists use data...
View ArticleHadoop All Grown Up
It’s amazing the growth Apache Hadoop and the extended ecosystem has had in the last 10 years. I read through Owen’s “Ten Years of Herding Elephants” blog and downloaded the early docker image of his...
View ArticleThree Open Source Software Projects Transforming Oil & Gas Companies
We are already more than a month into 2016 and it’s anything but business as usual in Oil and Gas. Current markets are making companies rethink every aspect of their business model, foundational cost...
View ArticleAnnouncing GA of Apache Spark 1.6 in Hortonworks Data Platform 2.4
As Apache Spark continues to gain popularity, the rapid march of new Spark releases continues. With HDP 2.4, we are announcing the general availability of Spark 1.6, which is the latest Spark version...
View ArticleApache Spark & Apache Zeppelin: What’s coming in HDP 2.4.2
In March 2016 we announced Apache Spark 1.6 GA on HDP 2.4 and provided the 2nd technical preview of Apache Zeppelin. Since then, Apache Spark 1.6.1, a patch release with bug fixes, has been released by...
View ArticleApache Zeppelin: The Road Ahead
The below blog has been co-authored by Vinay Shukla, Hortonworks, Moon So Lee, Apache Zeppelin PMC & NFLabs, Prabhjyot Singh, Apache Zeppelin PMC & Hortonworks” Recently the Apache Software...
View ArticleIntro: Play-by-Play: Data Hacks & Demos @ #HS16SJ
So, it’s been a month since Hadoop Summit San Jose, where over 5000 of the leading tech innovators in big data came together to share their inventions, wisdom and know-how. One of the sessions – a...
View ArticleDemo #1 Play-by-Play: Data Hacks & Demos @ #HS16SJ
Match image to an identifier, correlate with data and initiate personalized, real time electronic convo with customer in store During the 1st demo of the Data Hacks & Demos session, at Hadoop...
View ArticleDemo #2: Play-by-Play: Data Hacks & Demos @ #HS16SJ
Apache NiFi to prioritize which images should be sent to Spark in the cloud for computer vision machine learning During the 2nd demo of the Data Hacks & Demos session, at Hadoop Summit San Jose,...
View ArticleDemo #3: Play-by-Play: Data Hacks & Demos @ #HS16SJ
Use IoT to get real-time feedback on customer preferences and respond to them During the 3rd demo of the Data Hacks & Demos session, at Hadoop Summit San Jose, it was audience participation time!...
View ArticleDemo #4 & Summary: Play-by-Play: Data Hacks & Demos @ #HS16SJ
Streaming analytics to create an accurate single buyer identity in real-time The 4th and final demo of the Data Hacks & Demos session, at Hadoop Summit San Jose, was done by Simon Ball and it...
View ArticleAnnouncing the Availability of Hortonworks Data Platform 2.5
Hortonworks Empowers Organizations to Maximize the Outcome of their Big Data Initiatives through improvements in security, governance, and operations. We are very pleased to announce that Hortonworks...
View ArticleWhat’s New in SmartSense 1.3
This April, Hortonworks launched a multi-phase initiative to streamline Apache Hadoop operations, and the 1.3 release of SmartSense marks the delivery of the second phase of that initiative, and that...
View ArticleTry the Latest Innovations in Apache Spark and Apache Zeppelin with...
With the release of Hortonworks 2.5 Sandbox several new exciting features have been added to Apache Spark and Apache Zeppelin. Apache Spark Updates One of the most powerful new Hortonworks 2.5 Sandbox...
View ArticleHDF 2.0 Flow Processing Real-Time Tweets from Strata Hadoop with Slack,...
Original post in HCC I had a few hours in the morning before the Strata+ Hadoop World conference schedule kicked in, so I decided to write a little HDF 2.0 flow to grab all the tweets about the Strata...
View ArticleTry Apache Spark 2.1 & Zeppelin in Hortonworks Data Cloud
Apache Spark 2.1 was released recently in the community. The main focus of this release was improvements in Structured Streaming and Machine Learning. Structured Streaming: Kafka .10 support, Metrics...
View ArticleWelcome to Apache Zeppelin 0.7.0
We are very excited about the release of Apache Zeppelin 0.7.0 and want to thank the Apache Foundation along with the Apache Zeppelin community. The long awaited release introduces several key features...
View ArticleThree Open Source Software Projects Transforming Oil & Gas Companies
We are already more than a month into 2016 and it’s anything but business as usual in Oil and Gas. Current markets are making companies rethink every aspect of their business model, foundational cost...
View ArticleAnnouncing GA of Apache Spark 1.6 in Hortonworks Data Platform 2.4
As Apache Spark continues to gain popularity, the rapid march of new Spark releases continues. With HDP 2.4, we are announcing the general availability of Spark 1.6, which is the latest Spark version...
View ArticleApache Spark & Apache Zeppelin: What’s coming in HDP 2.4.2
In March 2016 we announced Apache Spark 1.6 GA on HDP 2.4 and provided the 2nd technical preview of Apache Zeppelin. Since then, Apache Spark 1.6.1, a patch release with bug fixes, has been released by...
View Article
More Pages to Explore .....