How are you involved in Big Data?

168 views
Skip to first unread message

Wouter de Bie

unread,
Oct 4, 2013, 5:11:51 AM10/4/13
to big-dat...@googlegroups.com
Hi all,

To get some discussion going, it might be nice to share how we are involved in Big Data, so we get to understand what people do that joined here.

Let me start:

I work for Spotify as a Team Lead for Data Infrastructure. My team works on building infrastructure so that others at Spotify can use data in an easy and fast way. Part of this is managing our Hadoop cluster and related technologies, but we spend a lot of effort building the right tools for Spotify employees, partners and record labels/rights holders. For developers we have things like luigi (http://github.com/spotify/luigi), for analysists and people that want to access large datasets we built a datawarehouse interface on top of Hive and Postgres databases and for people interested in aggregates we have built a dashboarding system.
We use data because it allows us to do quicker product iterations, gain user insights and be able to report to our external stakeholders.

// Wouter

Iván de Prado

unread,
Oct 4, 2013, 5:58:51 AM10/4/13
to Wouter de Bie, big-data-europe
Hi all, 

I'm Iván de Prado Alonso, CEO of www.datasalt.com. We are offering consulting and training in the Big Data space. Our clients are big companies (banking, etc) and startups. We love open source and we have done some contributions to the cause. The first one is Pangool (www.pangool.net), and advanced Java API for Hadoop based on a new paradigm, "Tuple MapReduce". The second project is Splout SQL (www.sploutsql.com), a web latency SQL spout for Hadoop. It allows you to create materialized SQL views over Hadoop data that can used to build your web or mobile application over Big Data. We also like to publish some technological posts in our blog (http://www.datasalt.com/blog/). 


2013/10/4 Wouter de Bie <wou...@spotify.com>

--
You received this message because you are subscribed to the Google Groups "Big Data Europe" group.
To unsubscribe from this group and stop receiving emails from it, send an email to big-data-euro...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.



--
Iván de Prado
CEO & Co-founder

Daniel Olmedilla

unread,
Oct 8, 2013, 5:01:00 AM10/8/13
to big-dat...@googlegroups.com
Hi all,

My name is Daniel Olmedilla, and I am Vice President Data Science of XING (www.xing.com), the professional social network leader in german speaking countries. My team is responsible for all backend services that involve the storage, usage and analysis of large amounts of data, and the development of any new service that requires it. We have run Hadoop for some years, which now we upgraded in order to run YARN (we have in production since only a couple of months). We also use Elastic Search as a search engine, and in some situations as well as a NoSQL key value store (where we also use Redis and are actually considering other options such as HBase).

    D.

Joseph Pollack

unread,
Oct 8, 2013, 7:00:47 AM10/8/13
to big-dat...@googlegroups.com
Dear Big Data Euro,

I'm a PhD candidate at ERCIS. I use big data for time-critical evaluations of mass casualty emergencies and large scale (humanitarian) crises. I want to connect with all of you and perhaps meet you up at academic and professional events. I'm interested in LOD, open government, financial markets, mapping, development economics, and data visualisation. Looking forward to participating in this community!

Warm Regards,

-Joseph.

James Kinley

unread,
Oct 8, 2013, 7:54:34 AM10/8/13
to Wouter de Bie, big-dat...@googlegroups.com
Hi all,

I'm a Senior Solutions Architect at Cloudera, and part of Cloudera's London-based Field Technical Services team. I work with our customers in EMEA to ensure that they are successful with Hadoop, whether it be helping them go from soup to nuts (from the whiteboard to a complete end-to-end production solution using the Hadoop stack), to data science and training.

I've been a Hadoop engineer since 2009 and pre-Cloudera I worked in the UK defence industry where I specialised in cyber security (lots of data there!). I think this group is a great idea and I'm looking forward to discussing and sharing ideas with everyone involved.

Cheers,
James.

@jrkinley
--

Markus Schmidberger

unread,
Oct 10, 2013, 6:26:39 AM10/10/13
to big-dat...@googlegroups.com
Hi,

I am Markus and working as a Big Data Analyst and HPC & Cloud Computing Expert for the comSysto GmbH, Munich, Germany. I did his PhD at the University of Munich in ‘Parallel Computing for Biological Data’. My main research focus is on high performance computing and big data analyses with the R (R-Project, #rstats) language. My work and publications have a strong influence to the R, computational statistic and computational biology community. 
Currently I am working in a project for Telefonica called SmartSteps and I am happy processing real world big data with Hadoop and R. I am organiser of the Munich useR and Hadoop user group.

See you soon
Markus

Chris Harris

unread,
Oct 11, 2013, 2:43:56 PM10/11/13
to big-dat...@googlegroups.com
I work for Hortonworks as EMEA Solution Engineer and spend my life architecting Apache Hadoop projects for enterprises across EMEA.

Prior to Hortonworks, I worked as Solution Architect at MongoDB where he was responsible for driving MongoDB projects across Europe. Before joining MongoDB  I spent time at both SpringSource and RedHat/JBoss where I worked with major clients on their next generation Java platforms. You will also see me at regular conferences and events. 

Chris

CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.

Michael Hausenblas

unread,
Oct 15, 2013, 4:52:43 PM10/15/13
to big-dat...@googlegroups.com

Wouter, thanks a lot for starting this great forum—much appreciated!

I'm Chief Data Engineer EMEA at MapR Technologies, originally from Austria and moved to Ireland in 2009; our kids speak more Gaelic than I do ;)

I found my way into Big Data essentially as an end-user of Apache ecosystem components, from Lucene to Hadoop. My background is in large-scale data integration, the Internet of Things, and Web applications and I also gathered some experience in the standardisation domain (six years at W3C and IETF, in various Working Groups).

As a Data Engineer at MapR, I'm often on travel throughout Europe, talking at HUGs and other events. The other part of my work revolves around working with our customers on solutions, pretty much everything from use case discovery (incl. business goals, TCO and ROI questions) over POCs to system architecture.

Last but certainly not least I'm contributing to Apache Drill, a distributed system for interactive, ad-hoc analysis and query of large-scale datasets (and also giving talks on this topic on a regular basis; please let me know if I haven't been to your local HUG, yet).


Some of the upcoming events I'll be speaking at and where I hope to meet the one or the other of the over 143 members of this group:

* Apache Drill at JAX, London http://jaxlondon.com/sessions/large-scale-interactive-ad-hoc-queries-over-different-datastores-apache-drill
* Where Polyglot Persistence meets the Lambda Architecture at Strata 2013, London http://strataconf.com/strataeu2013/public/schedule/detail/31701
* On Solr and Machine Learning at the LUCENE/SOLR REVOLUTION EU 2013, Dublin http://www.lucenerevolution.org/sessions
* Harnessing the Internet of Things with NoSQL at NoSQL Matters 2013, Barcelona http://2013.nosql-matters.org/bcn/abstracts/#abstract_342628039


Feel free to get in touch with me:

* Twitter: https://twitter.com/mhausenblas
* Google+: https://plus.google.com/u/0/102497386507936526460/about
* Skype: mhausenblas
* GitHub: https://github.com/mhausenblas


Cheers,
Michael

--
Michael Hausenblas
Ireland, Europe
http://mhausenblas.info/

Tóth Zoltán

unread,
Oct 24, 2013, 4:20:34 PM10/24/13
to Michael Hausenblas, big-dat...@googlegroups.com
Hi, 

I am Zoltan, Data Engineer at Prezi. I am mainly responsible to develop and run prezi's Hadoop infrastructure, ETL and our toolset around these. Most of the time I work with Pig, Native MapReduce, Redshift and some Web/Javascript. Furthermore, of course, there is no data analytics without extensively using R and Bash. :) Haskell is also coming up big time at Prezi. So basically I am a classic startup hacker guy, simply solving problems and pushing data infrastructure into a state that satisfies our internal customers the most.

Prior joining Prezi I was mostly Involved in Usability Engineering, Component Based Systems, Web and Pharmaceutical Market Analytics.

I love this initiative, thanks Wouter!

Let me invite Zoli Prekopcsak, CEO at Radoop, Andras Balogh, Business Analyst at Prezi and Zoltan Papp, Quant. Partner at Morgan Stanley.

I'd be honoured if you added me:

Zoltan Prekopcsak

unread,
Nov 3, 2013, 10:54:54 AM11/3/13
to big-dat...@googlegroups.com
Hi All,

Zoltan, thank you for the invite. This group is a great idea, and I am happy to be part of it.

On myself: I have a background in data science in both academic research and startup/enterprise projects. I have been working with Hadoop for more than 3 years, and now I am running a company called Radoop which combines Hadoop with RapidMiner, the popular data mining tool. We mainly focus on advanced analytics and data mining on Hadoop, so feel free to reach out, I am happy to discuss anything related.

Best, Zoli

Bence Arató

unread,
Feb 6, 2014, 2:35:41 AM2/6/14
to big-dat...@googlegroups.com
Hi,

I'm Bence Arato from Budapest, Hungary and I work as an industry analyst,  architect, and advisor. I have been in the BI and analytics space for almost 20 years. I'm founder and CEO of BI Consulting Hungary.

We run two international data-related events every year in Hungary, the Budapest DW & Big Data Forum in June (http://budapestdwforum.com), and the Budapest BI Forum in November (http://budapestbiforum.com/).
I'm also the main organizer of the Budapest Big Data Meetup (http://www.meetup.com/Big-Data-Meetup-Budapest).


Bests, Bence




Tristan Dorsey

unread,
Feb 16, 2015, 8:37:59 AM2/16/15
to big-dat...@googlegroups.com
Hi,...

I have always been in the conversation Hadoop vs MongoDB - Which is a better Platforms 

The amount of data produced across the world is increasing exponentially, and is currently doubling in size every two years. It is estimated that by the year 2020, the data available will reach 44 zettabytes (44 trillion gigabytes). The processing of large amounts of data not suitable for traditional methods has become known as Big Data, and although the term only gained popularity in recent years, the concept has been around for over a decade.


If you’re not using the data you have in all the ways you want to, you just might have an opportunity to embrace Big Data. Our process of working with Big Data initiatives begins by educating you on Big Data and exploring how it may apply to your company. With help of use cases from varies industries, we’ll bring Big Data concepts to life and see where there might be value in introducing this new form of data management to your organization.
Reply all
Reply to author
Forward
0 new messages