Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
GC overhead limit exceeded
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  Messages 26 - 39 of 39 - Collapse all  -  Translate all to Translated (View all originals) < Older 
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Peter Neubauer  
View profile  
 More options Apr 8 2012, 11:46 am
From: Peter Neubauer <peter.neuba...@neotechnology.com>
Date: Sun, 8 Apr 2012 17:46:21 +0200
Local: Sun, Apr 8 2012 11:46 am
Subject: Re: [Neo4j] GC overhead limit exceeded
Darren,
any chance you can profile this? Would be interesting to see what
exactly is happening, or you could give us the dataset and a sample
CURL query?

Cheers,

/peter neubauer

G:  neubauer.peter
S:  peter.neubauer
P:  +46 704 106975
L:   http://www.linkedin.com/in/neubauer
T:   @peterneubauer

Neo4j                                - Graphs rule.
Program or be programmed - Computer Literacy for kids.
http://foocafe.org/#CoderDojo


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
James Thornton  
View profile  
 More options Apr 8 2012, 12:58 pm
From: James Thornton <james.thorn...@gmail.com>
Date: Sun, 8 Apr 2012 09:58:18 -0700 (PDT)
Local: Sun, Apr 8 2012 12:58 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

On Sunday, April 8, 2012 6:31:42 AM UTC-5, project2501 wrote:

> In some cases, it looks like it can return 1000 nodes. I'm reusing
> vertices but not edges. So there can be many duplicate edges (for counting).

> How does one return just node properties? That's all I really need, but I
> see a lot of other stuff in nodes like URLs.

> Perhaps its a question for James as well since I'm using bulbs neo4jserver
> graph.

Darren, you are returning all the paths emanating from a node -- you're not
returning the nodes so returning the node properties is not relevant in
this case.

To get an idea of the size of the data you are returning, return a count of
the paths instead:

g.idx(index_name)[[name:vname]].outE.inV.loop(2){it.loops<=3}{true}.paths.c ount()

- James


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Michael Hunger  
View profile  
 More options Apr 8 2012, 1:14 pm
From: Michael Hunger <michael.hun...@neopersistence.com>
Date: Sun, 8 Apr 2012 19:14:28 +0200
Local: Sun, Apr 8 2012 1:14 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

Aldo 64 MB is by far too little for a prod db you
Will get the gc churn w/o finishing.

Run a real db with 2 to 8 GB RAM

and it will not add up.

Sent from mobile device

Am 08.04.2012 um 17:24 schrieb Darren Govoni <darreng5...@gmail.com>:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Darren Govoni  
View profile  
 More options Apr 9 2012, 3:49 pm
From: Darren Govoni <darreng5...@gmail.com>
Date: Mon, 9 Apr 2012 12:49:01 -0700 (PDT)
Local: Mon, Apr 9 2012 3:49 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

I will. I bumped the maxmemory to 2GB and things are working, but when I
have a chance
I will run experiments with lower memory. Because I hope not to get hit
with this in production
setting. So its on my TODO list.


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Darren Govoni  
View profile  
 More options Apr 11 2012, 4:50 pm
From: Darren Govoni <darreng5...@gmail.com>
Date: Wed, 11 Apr 2012 13:50:53 -0700 (PDT)
Local: Wed, Apr 11 2012 4:50 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

Here's my basic query.

g.idx('vertex')[[name:'document']].bothE.bothV.loop(2){it.loops<=4}{true}.p aths.count()

for it.loops<2, count is 288
for it.loops<3, count is 3278
for it.loops<4, count is 46,960

My dashboard reads.

156 nodes
957 properties
180 relationships
61 relationship types


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Darren Govoni  
View profile  
 More options Apr 11 2012, 4:53 pm
From: Darren Govoni <darreng5...@gmail.com>
Date: Wed, 11 Apr 2012 13:53:03 -0700 (PDT)
Local: Wed, Apr 11 2012 4:53 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

What I see happening here is a combinatorial explosion. As you can see, my
data set is not that rich..... :(


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Peter Neubauer  
View profile  
 More options Apr 11 2012, 5:03 pm
From: Peter Neubauer <peter.neuba...@neotechnology.com>
Date: Wed, 11 Apr 2012 23:03:39 +0200
Local: Wed, Apr 11 2012 5:03 pm
Subject: Re: [Neo4j] GC overhead limit exceeded
Yes,
looks like it. What is the reason for this query, and is there any way
you can prune it down by e.g. including directions in the relationship
traversals?

Cheers,

/peter neubauer

G:  neubauer.peter
S:  peter.neubauer
P:  +46 704 106975
L:   http://www.linkedin.com/in/neubauer
T:   @peterneubauer

Neo4j                                - Graphs rule.
Program or be programmed - Computer Literacy for kids.
http://foocafe.org/#CoderDojo


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Darren Govoni  
View profile  
 More options Apr 11 2012, 8:20 pm
From: Darren Govoni <darreng5...@gmail.com>
Date: Wed, 11 Apr 2012 17:20:37 -0700 (PDT)
Local: Wed, Apr 11 2012 8:20 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

I want to display a graph centered on a vertex and fanning out some number
of levels, most likely 3.
I was told to use paths for this and it seems to make sense.

My graph is composed of sentence triples like.

document contains text
index has documents

But for a repeat occurrences, I reuse the same vertex so path finding will
work, but
I don't know how to re-use an edge (is it even possible?) so if "document
contains text"
appears 20 times, then this adds a factorial of combinations to the path
finding.

If I had 100 mllion nodes in my graph (which is supported by neo4j?) do
these gremlin queries
scale? I'm thinking maybe not, if any vertex has a handful of unique edges,
the combinations will be
off the charts when you add even half a dozen vertices with their own edges.

Is there a best practice or known limits to neo4j's ability to find paths
with more than a few dozen vertex?


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Marko Rodriguez  
View profile  
 More options Apr 11 2012, 8:27 pm
From: Marko Rodriguez <okramma...@gmail.com>
Date: Wed, 11 Apr 2012 18:27:50 -0600
Local: Wed, Apr 11 2012 8:27 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

Hi,

Gremlin is a lazy language so you can next() results all day long. If you
are trying to save all these results in memory, then you will run into
problems.

What is your query scenario?

Marko.

http://markorodriguez.com
On Apr 11, 2012 6:20 PM, "Darren Govoni" <darreng5...@gmail.com> wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
James Thornton  
View profile  
 More options Apr 11 2012, 8:55 pm
From: James Thornton <james.thorn...@gmail.com>
Date: Wed, 11 Apr 2012 17:55:30 -0700 (PDT)
Local: Wed, Apr 11 2012 8:55 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

Yes, this Gremlin "paths" expression will return all possible path
combinations emanating from all vertices named "document" so it will be
quite large -- is this really what you want?

Please describe your context and end goal, and maybe we can help you craft
a better solution.

- James


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Darren Govoni  
View profile  
 More options Apr 11 2012, 10:13 pm
From: Darren Govoni <darreng5...@gmail.com>
Date: Wed, 11 Apr 2012 19:13:03 -0700 (PDT)
Local: Wed, Apr 11 2012 10:13 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

Thanks James. Well, I guess I want all minimal paths emanating (both ins
and outs) from a vertex. For example,
let's say I have the following vertex-edge-vertex triples (directed).

v1-e1-v2
v1-e2-v2
v1-e3-v2
v2-e4-v3
v2-e5-v3
v2-e6-v3

where e1-e3 and e4-e6 have the same "name" property (but different id's).
So reducing the above list to:

v1-e1-v2
v2-e4-v3

or perhaps

v1-"buy"-v2
v2-"likes"-v3

such that any one of e1,e2,e3 would suffice because they have the same name
(e.g. "buy").
I duplicate the edges because I'm not sure how to reuse one or if that
concept exists,
but also because I want to count the number of edges with "buy" for Page
Rank purposes.
And I use properties on edges that trace back to other objects elsewhere.

The above example is only 2 edges away, but it can vary (e.g. it.loops<X)

I'm still wrestling with gremlin syntax/semantics to understand how to
affect this.
Maybe something with simplePath. Still trying. Slow but sure.


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Marko Rodriguez  
View profile  
 More options Apr 11 2012, 4:57 pm
From: Marko Rodriguez <okramma...@gmail.com>
Date: Wed, 11 Apr 2012 14:57:36 -0600
Local: Wed, Apr 11 2012 4:57 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

Hi,

That is odd. Here are some notes:
        1. Use 'both' instead of bothE.bothV (and then loop(1))
        2. Try without paths and simply do count() (just for testing)

That is such a tiny graph that I don't know why you are having a GC overhead limit exceeded. Are you doing this from the Gremlin REPL, a Groovy class, or from the Web Admin?

Oh reading lower, I see you are doing some Bulbs related stuff----can you speak more to that as James is the Bulbs guy.....

Thanks,
Marko.

http://markorodriguez.com

On Apr 11, 2012, at 2:53 PM, Darren Govoni wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
James Thornton  
View profile  
 More options Apr 12 2012, 3:31 pm
From: James Thornton <james.thorn...@gmail.com>
Date: Thu, 12 Apr 2012 12:31:37 -0700 (PDT)
Local: Thurs, Apr 12 2012 3:31 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

Marko, he's returning all possible paths over REST -- all 47,000 of them --
but he doesn't really want or need this, he just wants the shortest path.

- James


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Marko Rodriguez  
View profile  
 More options Apr 12 2012, 3:53 pm
From: Marko Rodriguez <okramma...@gmail.com>
Date: Thu, 12 Apr 2012 13:53:15 -0600
Local: Thurs, Apr 12 2012 3:53 pm
Subject: Re: [Neo4j] GC overhead limit exceeded

Hey,

> Marko, he's returning all possible paths over REST -- all 47,000 of them -- but he doesn't really want or need this, he just wants the shortest path.

        startVertex.both.loop(1){true}{it.object == endVertex}.paths[0]

To be safe, you can do a "max loop" so you don't go on forever (e.g. not a strongly connected graph).

        startVertex.both.loop(1){it.loops < 4}{it.object == endVertex}.paths[0]

Take care,
Marko.

http://markorodriguez.com

On Apr 12, 2012, at 1:31 PM, James Thornton wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages < Older 
« Back to Discussions « Newer topic     Older topic »