Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
Is Couchbase suitable for a small-reco​rd write-inte​nsive billion-re​cord applicatio​n?
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  5 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
yassen  
View profile  
 More options Oct 23 2012, 1:38 pm
From: yassen <yassen....@gmail.com>
Date: Tue, 23 Oct 2012 10:38:22 -0700 (PDT)
Local: Tues, Oct 23 2012 1:38 pm
Subject: Is Couchbase suitable for a small-reco​rd write-inte​nsive billion-re​cord applicatio​n?

Hi everyone,

Absolutely new (and ignorant) to NoSQL solutions and to Couchbase (my
apologies; but extensive experience with SQL RDBMS).

We consider a NoSQL DB deployment for a mission-critical application where
we need to store several hundreds of millions of data records, each record
consisting of about 6 string fields, record total length is 160 bytes.
There is a unique key in each record that seems suitable for hashing (20+
bytes string, e.g. "cle01_tpls01_2105328884").

The application should be able to write several hundreds of new records per
second, but first check if the unique key already exists. Writing is to be
done only if it is not there. If it is, the app needs to retrieve the whole
record and return it to the client and no writing is done in this case.

We need to have a cluster of at least 2-3 nodes, which must be able to grow
easily if a need be.
I need to know if Couchbase would be suitable for such application. Please,
advice, thank you!


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Chad Kouse  
View profile  
 More options Oct 23 2012, 3:01 pm
From: Chad Kouse <c...@tunewiki.com>
Date: Tue, 23 Oct 2012 15:01:33 -0400
Local: Tues, Oct 23 2012 3:01 pm
Subject: Re: Is Couchbase suitable for a small-reco​rd write-inte​nsive billion-re​cord applicatio​n?

Well, shallowly, there is an "add" method in couchbase that you could use -- it only adds a record if one doesn't already exist.  You can check the result code to detect if it already existed or not.

From there it's just a "get" to retrieve the document

The part I'm not great at answering is how much space will be eaten up by metadata on such small keys.  Maybe one of the couchbase pros can do that.

--
Chad Kouse


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Zeming Jin  
View profile  
 More options Oct 23 2012, 3:36 pm
From: Zeming Jin <zeming....@gmail.com>
Date: Tue, 23 Oct 2012 14:36:34 -0500
Local: Tues, Oct 23 2012 3:36 pm
Subject: Re: Is Couchbase suitable for a small-reco​rd write-inte​nsive billion-re​cord applicatio​n?

57-bytes for each entry is a good number to estimate metadata size.


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Chad Kouse  
View profile  
 More options Oct 23 2012, 5:03 pm
From: Chad Kouse <chad.ko...@gmail.com>
Date: Tue, 23 Oct 2012 17:03:00 -0400
Subject: Re: Is Couchbase suitable for a small-reco​rd write-inte​nsive billion-re​cord applicatio​n?

btw, this sounds similar to our use case. (see screenshot: http://grab.by/gZ0C ) these 2 buckets both reside on the same 16 nodes. -- we've been really happy with performance.. although I'm not sure where those 2 disk fetches per second are creeping in from.. must investigate!

We also carry 1 replica copy so couchbase is handling over a billion data points for us (by the way our document sizes can be a lot larger than 160 bytes)

--
Chad Kouse


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
yassen  
View profile  
 More options Oct 29 2012, 8:35 am
From: yassen <yassen....@gmail.com>
Date: Mon, 29 Oct 2012 05:35:21 -0700 (PDT)
Local: Mon, Oct 29 2012 8:35 am
Subject: Re: Is Couchbase suitable for a small-reco​rd write-inte​nsive billion-re​cord applicatio​n?

Chad, Musician, thank you guys!

> 57-bytes for each entry is a good number to estimate metadata size

This sounds great; 57 bytes is really nice metatdata overhead :)

> Well, shallowly, there is an "add" method in couchbase that you could use

-- it only adds a record if one doesn't already exist.

That sounds very nice as well!

Can anyone tell me something about b-tree vs. hashing for the primary key?
Do we have both options in couchbase? How about different data stores for a
similar use case? (Heading me to the proper documentation sources would be
great.)

Thank you!
Yassen


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »