http://thenoisychannel.com/2010/01/31/linkedin-search-a-look-beneath-the-hood/
We do use Voldemort directly (i.e., it isn't extract and processed
separately) for People You May Know, who viewied my profile and other
features. People You May Know is pre-computed in Hadoop and then
pushed out to a read-only Voldemort cluster (using the read only
storage engine).
"Who viewed my profile" (another site feature that uses aggregation
and counting) is powered by read/write Voldemort. Unfortunately, I am
not sure exactly how much about information I am allowed to provide
about the working of this feature (how much is public and how much
isn't), but I've CC'd one of the engineers familiar with this feature
to see if he can point you to some public information on this.
- Alex
> --
> You received this message because you are subscribed to the Google Groups "project-voldemort" group.
> To post to this group, send email to project-...@googlegroups.com.
> To unsubscribe from this group, send email to project-voldem...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/project-voldemort?hl=en.
>
>
A simple suggestion may be to have stores mapping:
* Time sent -> message ids (e.g., a list of message ids sent between
0830 and 0900 on a specific day)
* Time received -> message ids
* Message id -> status object (when was it sent, when was it received)
You may want to look at papers related to OLAP cubing:
http://en.wikipedia.org/wiki/OLAP_cube
Here's a public presentation on Avatara, or scalable OLAP layer built
on top of Hadoop and Voldemort (both read-write and read-only):
http://sna-projects.com/sna/images/avatara-sam-sig.ppt
- Alex