Hi All
Thanks for attending the january meet.
We had a roughly around 50+ attendees.
Minutes of the Meeting
- Started off with a MapReduce session
- Explained how an rdbms query [select sum(amount) from table1 group by drug] can be converted into MR job.
- Went through Key Value pair in each phase (Map,Reduce) and also about Writables.
- How parallel processing in MR and much more.. Thanks to all who actively participated in interaction
- Dividied into five teams headed by senthil, siva, prasad, bini, ashwin respectively
- Datasets - MobieLens and Million Songs
In discussion with the team members, team leads mentioned above implemented MR for a query of their choice.
- In between, we had refreshment.
Some of the feedback given are:
- to discuss one real time usecase (or casestudy) in every meet.
- Ask beginners (for Hadoop Ecosystem / Big Data) to come early(half an hour before the scheduled time) for a introduction session
- Some web links for getting started to Hadoop. (we will soon launch our group webite having all links)
Please post us your feedback.
When shall we have our next meet - 9 feb or 16 feb?
What shall we discuss??
A small thought
- Shall we take up a single dataset and understand it completely before coming for the meet? or use hive for the queries??
- Everyone can come with their one simple query and try to implement it on their own assisted by others.
- How about million songs dataset??
PFA: MapReduce code and patient dataset used for the session.
photo snaps of the meet(reduced size for attachment purpose)
Happy Hadooping!!!!!!
Thanks
Senthil
Chennai Hadoop User Group - A Community run by the community for the community