OK, so the idea is:
Data -> CloudBase (on top of Hadoop)
SQL -> CloudBase -> Aggregates/Rollups -> Import to external RDBMS
SQL -> RDBMS -> Real-time Report
Is this correct?
Would there be additional stage between Aggregates/Rollups and Import
to external RDBMS? That is, where do Aggregates/Rollups live before
they are imported in a RDBMS? I can imagine this being up to do the
implementer, but I'm wondering if you could recommend an approach?
For example, maybe Aggregates/Rollups would get stored in HDFS? Or in
custom CloudBase tables? Or stored in memory and loaded into RDBMS
after in-memory Aggregates/Rollups are created?
Thanks,
Otis
On Mar 2, 7:13 pm, "Leo Dagum" <
leo_da...@yahoo.com> wrote:
> Most real time reports work off aggregate or rollup tables. Where CloudBase
> fits in is generating the aggregates as part of a nightly batch processing,
> but the real time reporting is executed by a traditional rdbms.
>
> - leo
>
> _____
>
> From:
cloudba...@googlegroups.com
> [mailto:
cloudba...@googlegroups.com] On Behalf Of Tarandeep Singh
> Sent: Monday, March 02, 2009 3:17 PM
> To:
cloudba...@googlegroups.com
> Subject: Re: Real-time/quick analytics reports?
>
> Hi Otis,
>
> CloudBase is not meant for real time queries. CloudBase provides database
> abstraction layer on top of Hadoop and coupled with its SQL interface makes
> it easier to use Hadoop to query/mine your data.
>
> We do use CloudBase to generate reports but not real time reports. With
> CloudBase-1.2 release, it is possible to index the table data that will help
> to reduce query execution time but again the performance won't be closer to
> real time/quick reporting.
>
> -Taran
>
> On Mon, Mar 2, 2009 at 2:48 PM,
otis.gospodne...@gmail.com