Install Tez, Pig and Hive on HDP
Check Support Matrix Check Hortonworks support matrix to check which version of MySQL is supported :https://supportmatrix.hortonworks.com/In support matrix, I will choose :Ubuntu 18.04 under Operating...
View Article"Pig Latin / SQL Challenge" or "Analytic / Window Functions in PIG"
Here is a challenge for those who are new to Pig Latin.We'll first download MovieLens movie ratings data. Our goal is to prepare a report showing : The best rated movie of each decade, beginning with...
View ArticleInstall a single-node Hortonworks Data Platform (HDP) version 3.1.4 on...
This post will lead you through the setup process of a single-node Hortonworks Data Platform, working on a Kubuntu 18.04 workstation.Of course you can easily download a Hortonworks Sandbox image here....
View ArticleAdvanced SQL Challenge
The ChallengeHere is a challenge for SQL enthusiasts.I'll solve it here using PostgreSQL 10.10 on Kubuntu 18.04; but feel free to give it a try in your favorite RDBMS.We have a simple table with two...
View ArticleGet Started with Nifi : Partitioning CSV files based on column value
This tutorial demonstrates how incoming data file can be divided into multiple files based on a column value, using Apache Nifi.The Nifi Flow will :Fetch files from a local folderDivide the content...
View ArticleNiFi Revisited : Aggregate Movie Ratings Data To Find Top 10 Movies
This post is a sample of data aggregation in NiFi.If you just started learning NiFi, check this blog post, which is a much more detailed sample than this one.Our goal is :Fetch the movie ratings...
View ArticleOne challenge with 10 solutions
Technologies we use for Data Analytics has evolved a lot, recently. Good old relational database systems become less popular every day. Now we have to find our way through several new technologies,...
View Article
More Pages to Explore .....