Intro to MapReduce

110 views
Skip to first unread message

krishna chaitanya

unread,
Sep 28, 2012, 6:34:31 AM9/28/12
to chenn...@googlegroups.com
Hi,
 
I am a hadoop beginner. I am aware of setting up the hadoop environment and have basic idea of map-reduce functionality.
Can you elaborate on Map-Reduce functionality and how to start with map-reduce alogorithms.
 
 
 
Message has been deleted

Prasad

unread,
Sep 28, 2012, 7:10:47 AM9/28/12
to chenn...@googlegroups.com
Hi Krishna,
Just like hello world example in any java application, word count is the basic exercise for learning map reduce. I would suggest you to go through the tutorial from apache.

Ask us any doubts if you face any difficulties in understanding the logic.

Regards,
Prasad

Prasad

unread,
Sep 28, 2012, 7:14:13 AM9/28/12
to chenn...@googlegroups.com
Probably you can go through this example from latest packages http://wiki.apache.org/hadoop/WordCount . The previous post is with deprecated ones.

Chintan Jain

unread,
Sep 28, 2012, 9:34:49 AM9/28/12
to chenn...@googlegroups.com
If you are a fan pf hadoop and big database, then you should been attending the amazon web services AWS today.

I guess you missed out.. Lot of potential learning..

--
 
 



--

CHINTAN R JAIN

Facebook Fan Page - http://www.facebook.com/meetchintanjain
Twitter Profile - http://twitter.com/meetchintanjain
Facebook Profile - http://www.facebook.com/chintanjain25


Pavan Kulkarni

unread,
Sep 28, 2012, 1:44:36 PM9/28/12
to chenn...@googlegroups.com
Please check the book Hadoop "A Definitive Guide " from Tom White . It is written very comprehensively and gives you an overall idea of how Hadoop works and how to write MapReduce programs.

--
 
 



--

--With Regards
Pavan Kulkarni

siva

unread,
Oct 4, 2012, 9:52:06 AM10/4/12
to chenn...@googlegroups.com
  Hi experienced hadoop professionals,

     Can you share some real time use cases where and how to apply MapReduce?
     This will really add more interest to this topic...

Prasad

unread,
Oct 4, 2012, 12:16:47 PM10/4/12
to chenn...@googlegroups.com
Hi Siva,
The primary constraint in using hadoop and map reduce is this can be used for scenarios where there is a huge volume of data and batch processing is applicable.
It is definitely not suited for real time processing.
 
To quote a few examples,
1. Consider deriving various analytics such as audit log reports from monthly system log report of all employees in organization
2. An example to analyse the trend in customers buying capacity of various products in an organisation. (Behavioural Marketing)
3.  Credit cards fraud detection by analysing users' history of banking records where sampling will not be best possible solution to analyse the entire data set
4. Call Records Processing in telecommunication industries.
5. Image Processing areas etc..

kumar vaibhav

unread,
Nov 20, 2012, 3:48:52 AM11/20/12
to chenn...@googlegroups.com
Hi,

I have mentioned this earlier too but I think what beginners need are not books like Hadoop:The Definitive Guide or Hadoop In Action. Although they are good books but there is no fun at all in reading huge volumes like these. What beginners need are simple step-by-step POC ideas that someone can guide them through and helpl building them. 

@Prasad - Hadoop is being used by Stumbleupon recommendation engine and companies like Explorys. I agree it cannot be real time since its a batch processing system but frameworks like Storm are a big help.

Thanks,
Kumar

Senthil Kumar

unread,
Nov 22, 2012, 6:04:31 AM11/22/12
to chenn...@googlegroups.com
@Vaibhav

Sorry for the delayed reply. We will discuss the same in presence with other members during next meetup.

Senthil
Reply all
Reply to author
Forward
0 new messages