Gmail Calendar Documents Reader Web more »
Recently Visited Groups | Help | Sign in
Google Groups Home
Business Intelligence solution in Cloud Computing
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  Messages 26 - 50 of 201 - Collapse all  -  Translate all to Translated (View all originals) < Older  Newer >
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post will appear after it is approved by moderators
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Ray Nugent  
View profile  
 More options Jun 19 2008, 2:17 pm
From: Ray Nugent <rnug...@yahoo.com>
Date: Thu, 19 Jun 2008 11:17:13 -0700 (PDT)
Local: Thurs, Jun 19 2008 2:17 pm
Subject: Re: Business Intelligence solution in Cloud Computing

Chris, couple of thoughts -

1) is it possible to have the app run on AWS so that the derived data does not need to traverse back down in real time (that way you could use a lazy download in the background to archive it in their DC while their app accesses the copy in real time on AWS.)?

2) I've been thinking about the problem of upload times as well (in the context of large DNA data sets). The cost of loading into AWS is not that prohibitive so if one where to pre-process that data such that it could be uploaded in a bunch of parrallel processes to AWS you could reduce the bottleneck considerably. In theory.

Ray

----- Original Message ----
From: Chris K Wensel <ch...@wensel.net>
To: cloud-computing@googlegroups.com
Sent: Thursday, June 19, 2008 9:27:51 AM
Subject: Re: Business Intelligence solution in Cloud Computing

> On Jun 19, 2008, at 8:08 AM, Chris K Wensel wrote:

>> That said, with bandwidth being the bottleneck in the face of the
>> ability to spin up 100 or 1000 nodes to crunch numbers

> how big are the datasets you're working with?  Random or linear  
> access ?

total data is 100's of G. Individual work loads are ~10G. All linear  
(this being Hadoop), but there is much joining, binning, and crunching  
between the multiple input datasets (the actual workload translates to  
~60 MapReduce jobs, all rendered and managed by Cascading).

So it kinda sucks to have uploads of data to the cluster take longer  
than it does to compute on it. Worse since my client then has to fetch  
the derived data back.

ckw

--
Chris K Wensel
ch...@wensel.net
http://chris.wensel.net/
http://www.cascading.org/


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "Issues of data in the cloud..." by Marc Evans
Marc Evans  
View profile  
 More options Jun 19 2008, 2:45 pm
From: Marc Evans <m...@softwarehackery.com>
Date: Thu, 19 Jun 2008 14:45:44 -0400
Local: Thurs, Jun 19 2008 2:45 pm
Subject: Re: Issues of data in the cloud...
In my experiences, there are cases where having the data / computation
as close to the customer edge as possible is what is required for an
acceptable user experience. In other cases, the relationship of the user
/ data / computation is not important. Most often, there is a mix of
both. One of the ideas behind Hadoop as I understand it is to bring the
computation to the data location, while also providing for the data to
be in several locations. The scheduler is critical to making good use of
data locality. So yes, I believe that what you are looking for does
exist within Hadoop at a minimum, though I also believe that there is
alot of room to evolve the techniques that it uses.

- Marc


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Jim Peters  
View profile  
 More options Jun 19 2008, 2:40 pm
From: "Jim Peters" <jazzman...@gmail.com>
Date: Thu, 19 Jun 2008 11:40:35 -0700
Local: Thurs, Jun 19 2008 2:40 pm
Subject: Re: Issues of data in the cloud...

Even if the cloud providers come up with excellent answers to the security
and reliability questions, who's going to trust them? Credit card numbers
are one thing, but cloud data is something else entirely.

+J

--
Jim Peters
+415-608-0851

    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Chaz.  
View profile  
 More options Jun 19 2008, 3:10 pm
From: "Chaz." <eprparad...@gmail.com>
Date: Thu, 19 Jun 2008 15:10:33 -0400
Local: Thurs, Jun 19 2008 3:10 pm
Subject: Re: Issues of data in the cloud...
Jim,

  I definitely agree with your point. I can't think of very many
multi-nationsls that would let there data out to wander around. I'd
think they would want to protect their data and move the computing
resources close to it....

Chuck


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Stuart Altenhaus  
View profile  
 More options Jun 19 2008, 2:39 pm
From: "Stuart Altenhaus" <saltenh...@gmail.com>
Date: Thu, 19 Jun 2008 18:39:03 +0000
Local: Thurs, Jun 19 2008 2:39 pm
Subject: Re: Issues of data in the cloud...
If the programs are moved to the data, then what is the distinction between cloud computing and CORBA? Seems like the same basic tenets would have to be in place.

(I'm new to the concept of cloud computing, but do see the opportunities for advancing a network of computers that renders geo location trivial. Surely enhancing existing network clouds such that the computing power were placed at each node, a net-centric approach is achieved... The telcos do that today, right?)

Sent from my Verizon Wireless BlackBerry


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Utpal Datta  
View profile  
 More options Jun 19 2008, 3:43 pm
From: "Utpal Datta" <utpal8...@gmail.com>
Date: Thu, 19 Jun 2008 15:43:50 -0400
Local: Thurs, Jun 19 2008 3:43 pm
Subject: Re: Issues of data in the cloud...
May be this is a redundant question, where is this protected data
residing? In the cloud or in the user's data center?

If it is in the cloud then we are still dealing with Security,
Availability and Recoverability isues (that everyone agrees on).

If is in the users data center then how will the computing resources
offered (and controlled by Amazon) be brought to that specific user's
datacenter?

--utpal


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Lynne VanArsdale  
View profile  
 More options Jun 19 2008, 3:57 pm
From: "Lynne VanArsdale" <lynnevanarsd...@gmail.com>
Date: Thu, 19 Jun 2008 13:57:37 -0600
Local: Thurs, Jun 19 2008 3:57 pm
Subject: Re: Issues of data in the cloud...

Just joined cloud-computing and this is the first conversation I've
received.

A couple of weeks ago I attended Gartner Security where Neil MacDonald spoke
on "Adaptive Security." In a nutshell, this approach builds a resilient
system for secure data, acting much like the human immune system.  It
involves whitelisting as the foundation, blacklisting as a mid-tier and
learned/adaptive mechanisms at the top.  In such an environment, elements
would be "autonomic" and self-managing to a large degree, and would share
and communicate with other elements to protect workloads and information (as
opposed to endpoints).  There is a lot more to this vision, and it is
probably a number of years away, but it may be a reasonable approach to
address the concerns about data security being discussed here.

In any case, does anyone know of any product or standards efforts for the
industry to collaborate on a more cohesive architecture for security in the
cloud?

On 6/19/08, Chaz. <eprparad...@gmail.com> wrote:

...

read more »


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Chris K Wensel  
View profile  
 More options Jun 19 2008, 4:03 pm
From: Chris K Wensel <ch...@wensel.net>
Date: Thu, 19 Jun 2008 13:03:21 -0700
Local: Thurs, Jun 19 2008 4:03 pm
Subject: Re: Issues of data in the cloud...
CORBA isn't about mobility, it's just typesafe OO RPC. There was work  
done by ObjectSpace and GeneralMagic in the 90's on agent based  
computing (move code to the data). but that movement died off.

if the Cloud is a collection of compute resources, and you need to  
apply them to lots of your data, you have little choice but to move  
your data. you can't move the compute power. (unless you order a  
shipping container of servers I guess)

ckw

On Jun 19, 2008, at 11:39 AM, Stuart Altenhaus wrote:

--
Chris K Wensel
ch...@wensel.net
http://chris.wensel.net/
http://www.cascading.org/

    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "Business Intelligence solution in Cloud Computing" by Chris K Wensel
Chris K Wensel  
View profile  
 More options Jun 19 2008, 3:55 pm
From: Chris K Wensel <ch...@wensel.net>
Date: Thu, 19 Jun 2008 12:55:05 -0700
Local: Thurs, Jun 19 2008 3:55 pm
Subject: Re: Business Intelligence solution in Cloud Computing

> 1) is it possible to have the app run on AWS so that the derived  
> data does not need to traverse back down in real time (that way you  
> could use a lazy download in the background to archive it in their  
> DC while their app accesses the copy in real time on AWS.)?

The pattern is roughly this:

-- load dataset to S3 from datacenter (in small pieces, in parallel),  
repeat

- identify current dataset
- boot hadoop cluster
- start job on given dataset
- head of job pulls down parts from S3 in parallel (very natural with  
Hadoop)
- compete middle of job
- tail of job stuffs results sets into S3 in parallel (again fairly  
natural with Hadoop)
-- repeat above concurrently as datasets become available (easy to  
have concurrent Hadoop clusters in EC2).

-- pull data from S3 in parts in parallel

note 'job' above means a given data processing flow. in terms of  
Hadoop, the 'job' could be dozens of MapReduce jobs on the cluster.

> 2) I've been thinking about the problem of upload times as well (in  
> the context of large DNA data sets). The cost of loading into AWS is  
> not that prohibitive so if one where to pre-process that data such  
> that it could be uploaded in a bunch of parrallel processes to AWS  
> you could reduce the bottleneck considerably. In theory.

you will see a boost if you spawn multiple connects from one location  
to S3. it seems (was clearly in the past, unsure as of today) that  
individual connections were throttled, and up to a point bandwidth  
from a given ip was throttled. so doing things in parallel by breaking  
your big data into small parts give you a boost. I can't remember the  
numbers, else i'd share. its been a couple months since that project.

one benefit of using small parts, is that a given part will be  
available before the 'whole' is available. S3 won't show things for  
download that aren't finished uploading. So this also improves things  
(especially when coupled with SQS).

by 'parts' i mean, I may have locally 10G of data. I will break it  
into n MB  pieces (compressed) and push them up to S3 (in parallel).  
having a manifest (*.parts file) is great when you need to manage the  
integrity of individual parts (MD5) and the whole (parts list all  
available, MD5 on parts file). This in part guarantees you aren't  
processing a job on partial data (because the upload failed an no one  
noticed).

ckw

--
Chris K Wensel
ch...@wensel.net
http://chris.wensel.net/
http://www.cascading.org/


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "Issues of data in the cloud..." by Chaz.
Chaz.  
View profile  
 More options Jun 19 2008, 4:18 pm
From: "Chaz." <eprparad...@gmail.com>
Date: Thu, 19 Jun 2008 16:18:45 -0400
Local: Thurs, Jun 19 2008 4:18 pm
Subject: Re: Issues of data in the cloud...
Security is a funny issue. Can you ever use a cloud computing complex
and know for certain your data is protected?  I'm betting there is no
fool proof way that it can be. So the only real way is to fall back to
what we know today: maintain physical control of it for once that is
gone you are on your own baby.

Chuck Wegrzyn

...

read more »


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Chaz.  
View profile  
 More options Jun 19 2008, 4:30 pm
From: "Chaz." <eprparad...@gmail.com>
Date: Thu, 19 Jun 2008 16:30:30 -0400
Local: Thurs, Jun 19 2008 4:30 pm
Subject: Re: Issues of data in the cloud...
I don't believe it is possible to have data security in the "cloud"
without having physical security of the data. After all whenever I use a
cloud computer I hope that no one has hacked it to replace the security
modules, or to map memory and look into a running program, etc.

Now if you have to build out an autonomic system we will never have
secure cloud computing. No system today is so tight that it can't be
hacked. Just look at all the attempts to protect DVDs or BD disks...

Chuck Wegrzyn

...

read more »


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Chaz.  
View profile  
 More options Jun 19 2008, 4:34 pm
From: "Chaz." <eprparad...@gmail.com>
Date: Thu, 19 Jun 2008 16:34:38 -0400
Local: Thurs, Jun 19 2008 4:34 pm
Subject: Re: Issues of data in the cloud...
And CORBA isn't what I am thinking of, or even HADOOP but things like
JavaSpaces (?).

I'm not sure I would agree you have to ship your data to somewhere else.
After all a "cloud data provider" could create just the secure
environment for holding the data and processing it (isn't that really
what S3 is all about?). The only thing the using company needs to do is
write the program and have it installed, more or less automagically, on
the machines that hold the user's data.

Chuck Wegrzyn


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Chris K Wensel  
View profile  
 More options Jun 19 2008, 4:39 pm
From: Chris K Wensel <ch...@wensel.net>
Date: Thu, 19 Jun 2008 13:39:58 -0700
Local: Thurs, Jun 19 2008 4:39 pm
Subject: Re: Issues of data in the cloud...

If you are deploying an application in EC2, you must architect it to  
survive failure, because it will fail in varying degrees. Subsequently  
features of AWS allow you to do that, roughly (booting a pre-
configured xen vm, simple db, sqs, s3, etc etc).

I suggest you do the same regarding security, just assume it's a  
hostile environment.

The question is, what features of AWS support you in this? shared  
keychains/stores, encrypted volumes, CA, kerberos, ?? or will this  
always be left to the user. or could you ever really trust those  
services the same way you trust them to not lose data.

That said, not being a security person. What 'cloud security services'  
could a provider provide? Or should they even bother.

ckw

On Jun 19, 2008, at 1:18 PM, Chaz. wrote:

...

read more »


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ray Nugent  
View profile  
 More options Jun 19 2008, 4:51 pm
From: Ray Nugent <rnug...@yahoo.com>
Date: Thu, 19 Jun 2008 13:51:48 -0700 (PDT)
Local: Thurs, Jun 19 2008 4:51 pm
Subject: Re: Issues of data in the cloud...

Hey Chuck,

I think the front page of the Wall Street Journal proves that even having physical security of your data does not provide security! :-)

Security is really a business issue. Each layer of security should cost no more than the data is worth. So the concept of "secure enough" becomes important. What security is appropriate for a given type of data and is it more or less secure in the cloud than in the corp DC? Is data inherently "less secure" by virtue of being in the cloud than, say, an employees laptop or flash dongle or "on the wire"? I don't think corporate data centers are a secure as you're suggesting they are...

Ray

...

read more »


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "Business Intelligence solution in Cloud Computing" by Ray Nugent
Ray Nugent  
View profile  
 More options Jun 19 2008, 4:55 pm
From: Ray Nugent <rnug...@yahoo.com>
Date: Thu, 19 Jun 2008 13:55:41 -0700 (PDT)
Local: Thurs, Jun 19 2008 4:55 pm
Subject: Re: Business Intelligence solution in Cloud Computing

Chris, it's the last step I wonder about. If you leave the resultant data on S3 and run whatever app they have that operates against that data on EC2 it seems you could save some time?

Ray


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Chris K Wensel  
View profile  
 More options Jun 19 2008, 5:47 pm
From: Chris K Wensel <ch...@wensel.net>
Date: Thu, 19 Jun 2008 14:47:57 -0700
Local: Thurs, Jun 19 2008 5:47 pm
Subject: Re: Business Intelligence solution in Cloud Computing

If there was a next processing step, then yes it would save time. But  
those jobs represent all the work being done that isn't done by client/
customers of my client.

On Jun 19, 2008, at 1:55 PM, Ray Nugent wrote:

--
Chris K Wensel
ch...@wensel.net
http://chris.wensel.net/
http://www.cascading.org/

    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "Issues of data in the cloud..." by Chaz.
Chaz.  
View profile  
 More options Jun 19 2008, 6:52 pm
From: "Chaz." <eprparad...@gmail.com>
Date: Thu, 19 Jun 2008 18:52:32 -0400
Local: Thurs, Jun 19 2008 6:52 pm
Subject: Re: Issues of data in the cloud...
Ray,

  You are absolutely correct. Once you have a person involved it can be
compromised. It is all about risk and how to make it so small it would
take an act of God (or a really large budget) to breach it!

Chuck

...

read more »


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Chaz.  
View profile  
 More options Jun 19 2008, 6:50 pm
From: "Chaz." <eprparad...@gmail.com>
Date: Thu, 19 Jun 2008 18:50:51 -0400
Local: Thurs, Jun 19 2008 6:50 pm
Subject: Re: Issues of data in the cloud...
I think you missed the point. Even having shared keychains, use of X509,
etc. there is no guarantee you data is safe. Once it is in the hands of
a 3rd party you better assume it is compromised.

Perhaps the real solution is to carefully architect your solution to
provide "bulk" services outside the company and leave the critical
things - those that are absolutely vital - to inside the company.

Chuck Wegrzyn

...

read more »


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "Business Intelligence solution in Cloud Computing" by Alan Ho
Alan Ho  
View profile  
 More options Jun 20 2008, 2:00 am
From: Alan Ho <karlu...@yahoo.ca>
Date: Thu, 19 Jun 2008 23:00:07 -0700 (PDT)
Local: Fri, Jun 20 2008 2:00 am
Subject: Re: Business Intelligence solution in Cloud Computing

Hi Chris,

I've looked at this issue quite a bit too. There are a few ways that I think the problem can be "relieved"

1. Don't encourage your clients to download the entire data-set. As long as you provide URLs to the "crunched data", they should only have to pull the data as needed. You can index the data too using SDB too - a nice convenience function for searching the data.

2. See if the customers can split the dataset into sub-datasets, each reachable via some sort of URL. When you run your Map job, each of the Map nodes will be responsible for downloading the data from your clients - you might get some benefits from the parallelization of the download.

3. Use S3 for more of a backing store - If you don't have many clients consuming the data, or you think that the clients will download the data soon after the mapreduce job is complete, they can download it directly from the HDFS (http://hadoop.apache.org/core/docs/r0.17.0/hdfs_design.html#Browser+I...)

I don't know if that helps.

Regards,
Alan Ho


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "Issues of data in the cloud..." by Nati Shalom
Nati Shalom  
View profile  
 More options Jun 20 2008, 3:33 am
From: "Nati Shalom" <na...@gigaspaces.com>
Date: Fri, 20 Jun 2008 10:33:55 +0300
Local: Fri, Jun 20 2008 3:33 am
Subject: RE: Issues of data in the cloud...
A practical way for dealing with data in the cloud IMO is to decouple
the way we persist the data from the application.
What that basically means is that the application loads the data into an
in-memory cloud and that memory cloud keeps the data synchronized with
persistent storage asynchronously.
There is an opensource version that does that with Amazon -SimpleDB and
GigaSpaces as the data-grid
http://www.openspaces.org/display/EDS/External+Data+Source+by+Amazon+Sim
pleDB - It basically means that the data is stored in Amazon S3 as the
persistent storage. When the application boots, the datagrid loads the
data from S3 using SimpleDB interface to the memory of the cloud
resources.
The application use the in-memory data. Updates to the memory is being
propagated asynchronously back to S3 through the same datagrid and
SimpleDB interface. You can do pretty much the same thing with MySQL
instead of SimpleDB as I noted in one of my previous posts:
http://natishalom.typepad.com/nati_shaloms_blog/2008/03/scaling-out-mys.
html
Once the persistent storage is decoupled from our application we can
easily use the same model for keeping our persistent data outside of the
cloud i.e. in our local IT.

The nice thing is that we can be very flexible with our strategy as it
relates to where the data will reside, how it will be stored, and at
what rate it will be synchronized from the application. We can change it
overtime to best fit our application scenario and constraints without
touching our application code.

" And CORBA isn't what I am thinking of, or even HADOOP but things like
JavaSpaces (?)."

JavaSpaces is indeed more relevant for this type of scenarios.
What's unique about JavaSpaces IMO is that it can be used for handling
both the compute side and the data storage. The references above shows
how you could use space-based storage for handling the data side.
Now that its stored in the in-memory space cluster you can easily use
the same space to route business logic on those in-memory instances in
parallel. There's a nice way to abstract that from the user using a
remoting abstraction - see more details on how that works here:
http://uri-cohen.blogspot.com/2008/02/openspaces-svf-remoting-on-steroid
s.html

Nati S.
GigaSpaces


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Oscar Koeroo  
View profile  
 More options Jun 20 2008, 4:00 am
From: Oscar Koeroo <okoe...@nikhef.nl>
Date: Fri, 20 Jun 2008 10:00:55 +0200
Local: Fri, Jun 20 2008 4:00 am
Subject: Re: Issues of data in the cloud...

Isn't the discussion relative to the level of assurance (in terms of
security here) that is supplied and demanded per use case?

For absolute control, you should stay absolutely in control of the
resources and I don't think Cloud Computing is something for you.

If you want a secured environment you should understand that the
administrator of the resource can read the memory. If you want to
prevent that, then you should look into secured techniques to hide the
memory contents. Ultimately you must get your instructions through a
(virtual) CPU which can also be obscured on what it is doing, but that's
the only threat left in that solution.

Most cloud computing solution don't allow for that level of security in
the cloud.

The other more generic good thing to do IMHO is to encrypt all your data
that resides in the Cloud. Irregardless if this is somewhere between
absolutely needed and a tiny wish. This would also solve issues where
inadvertently some transfer protocol are unencrypted.

For some bio-medical use-cases in Grid computing (more my main field)
this approach is also being used. Decryption happens just prior to the
actual processing. A more advanced solution is the sliding decrypted
window approach. Where the dataset is decrypted per section or block.
CPU usage goes up, but most of the file/database stays encrypted and
opportunities to snoop around on the resource is very limited in its
opportunity.

cheers,

        Oscar Koeroo

...

read more »

  smime.p7s
4K Download

    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ray Nugent  
View profile  
 More options Jun 20 2008, 10:54 am
From: Ray Nugent <rnug...@yahoo.com>
Date: Fri, 20 Jun 2008 07:54:59 -0700 (PDT)
Local: Fri, Jun 20 2008 10:54 am
Subject: Re: Issues of data in the cloud...

Nati, I agree that decoupling will help. However your point here confuses me -

"Once the persistent storage is decoupled from our application we can easily use the same model for keeping our persistent data outside of the cloud i.e. in our local IT."

Keeping persistence that far away is bound to have pretty significant impact on your performance isn't it?

Ray


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "Business Intelligence solution in Cloud Computing" by Chris K Wensel
Chris K Wensel  
View profile  
 More options Jun 20 2008, 11:15 am
From: Chris K Wensel <ch...@wensel.net>
Date: Fri, 20 Jun 2008 08:15:29 -0700
Local: Fri, Jun 20 2008 11:15 am
Subject: Re: Business Intelligence solution in Cloud Computing

Thanks for the comments Alan. My previous post should outline how we  
have parallelized much of the infrastructure to alleviate my clients  
issues to a reasonable degree. In short, we employed the patterns you  
suggest, but not the specific technologies for various reason. I'd be  
happy to go into a little more detail offline.

The gist of my comments in this thread are to complain that you can't  
unfortunately scale bandwidth into a cloud to match the relative scale  
of the compute resources, currently. many hours to upload, and  
relatively few minutes to crunch, is an annoying imbalance.

For the analytics in the cloud space, there is an opportunity for a  
vendor to offer whatever services (many introduced in this thread by  
others) to alleviate the imbalance.

cheers,
ckw

On Jun 19, 2008, at 11:00 PM, Alan Ho wrote:

--
Chris K Wensel
ch...@wensel.net
http://chris.wensel.net/
http://www.cascading.org/

    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Alan Ho  
View profile  
 More options Jun 20 2008, 12:11 pm
From: Alan Ho <karlu...@yahoo.ca>
Date: Fri, 20 Jun 2008 09:11:41 -0700 (PDT)
Local: Fri, Jun 20 2008 12:11 pm
Subject: Re: Business Intelligence solution in Cloud Computing

Yeah. The whole issue with SOA as it is today is that you are expected to move the data to where the data is processed. What we really need is the ability to move the processing to where the data is (Which is kinda the point of Hadoop)

Cheers,
Alan Ho


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Gavan Corr  
View profile  
 More options Jun 20 2008, 12:32 pm
From: Gavan Corr <gc...@nyx.com>
Date: Fri, 20 Jun 2008 12:32:42 -0400
Local: Fri, Jun 20 2008 12:32 pm
Subject: Re: Business Intelligence solution in Cloud Computing

Hadoop, fantastic idea (it would be great if it worked...)

if you need a production ready environment in Finance, it's a long way  
off. The distributed caching products, Gemfire, Oracle's Coherence and  
Nati's gigaspaces are all miles ahead of hadoop at this point, some  
more than others ;-)

On Jun 20, 2008, at 12:11 PM, Alan Ho wrote:

</pre>
<P><hr size=1></P>
<P><STRONG>
Visit our website at <a href="http://www.nyse.com">http://www.nyse.com</a> <br>

*************************************************************************** **
<br>
Note:  The information contained in this message and any attachment to it is privileged, confidential and protected from disclosure.  If the reader of this message is not the intended recipient, or an employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited.  If you have received this communication in error, please notify the sender immediately by replying to the message, and please delete it from your system. Thank you.  NYSE Euronext, Inc.

</STRONG></P><pre>


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Messages 26 - 50 of 201 < Older  Newer >
« Back to Discussions « Newer topic     Older topic »

Create a group - Google Groups - Google Home - Terms of Service - Privacy Policy
©2009 Google