Wednesday, August 20, 2014

Big Data , Hadoop - Where do I start from ?

Lately many people have shown interest in Big Data as this is one of the hottest technology  in market right now.

But,still lot of them do not know exactly what it is or from where to start this journey.(I frequently get messages asking me  where to start from. )

So, In this post I would like to highlight few of the resources as well as tips on how to start your Big Data Journey.

I would suggest that the best notes are always the documentation, but sometimes this can be tricky for a beginner.

Initially I would suggest to buy this book , which is pretty good compared to so many books available now.

Hadoop In Action - by Chuck Lam

Buy Hadoop In Action - on Flipkart

I would suggest to read this at least 5 times. Yes , i mean it , every time you read this, you find something new, I am still reading this!!

Next would be to setup a pseudo node Hadoop cluster. Please go through the blog for other  posts regarding this.

Next step or simultaneously,go to IBM Big Data University free course.
They have lots of free resources , explained clearly , try not to rush through the course but , understand and complete each course one by one.

Go to IBM Big Data University

They also provide a free course completion certificate,if you get more than 60% in quiz at end of each course. If you want to see a sample of it , please visit my LinkedIn profile,find the link on the right center of this page.


Once you are thorough with the basics, Now is the time for some real Action.
Hortonworks provides the best tutorials for beginners as well as some real time implementations.

  Hortonworks Tutorial

Also , Hortonworks provide a VM for beginners. If your interest is only development , then this is for you.
Please go to hortonworks website and download the sandbox.

Other option would be Cloudera's  CDH 4.7 VM.

Next, you can read Hadoop the Definitive Guide- This is also a good book but I feel is this is a bit advanced , but for people who have followed the initial steps , this should be a cakewalk.

Once you have completed these basic steps , by this time you would probably already have idea of what next to do.

Do connect with  me for any doubts or just a general discussion or just to say Hi.




Tuesday, August 12, 2014

Install R Latest Version on Ubuntu

Steps To install R Base
On Ubuntu 12.04: The latest cran repository  is not updated.
So before we directly install, few additional steps needs to be done.
Step 1: go to sudo gedit /etc/apt/sources.list
Add the following default repository at the end of the file 
deb http://cran.rstudio.com/bin/linux/ubuntu precise/
Save and Close the File
Step 2: sudo apt-get update 
Step 3: sudo apt-get install r-base

Now we have latest version of R installed.

To install R studio
Go to http://www.rstudio.com/products/rstudio/download/
and Download the latest version of deb file here in this case rstudio-0.98.994-amd64.deb
Right click and install using Ubuntu Software Center
11ac3e901c8011e4b187ffbd5ac10637.png