Hi there,I have been looking at the docs for 2.0 particularly around support for labels during batch import.I see there is support for adding labels to nodes during batch import, directly querying labels for nodes and so on. However, unless I am missing something I don't see that there is support for locating a node by label and ID. I have found I have needed to do this when I import a large dataset where the relationships come separately from the nodes (say a dump from a relational database) and I need to use an external ID to find the nodes for the relationship.I wondered what the intended approach for looking up a node by label and ID is during batch import. I can see the following choices:- Use the standard EmbeddedGraphDatabase (making sure to have shut down the batch inserter of course) to look up the nodes for a bunch of relationship inserts before going into insert mode.- Use the BatchInserterIndexProvider to somehow hack into the underlying index that I believe is created for labels- Be patient and wait for support to appear in the batch API for querying nodes by label and ID :)Thanks
Jen
--
You received this message because you are subscribed to the Google Groups "Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an email to neo4j+unsubscribe@googlegroups.com.
> <mailto:neo4j%2Bu...@googlegroups.com>.
> For more options, visit
> https://groups.google.com/__groups/opt_out
> <https://groups.google.com/groups/opt_out>.
>
>
>
>
> --
> You received this message because you are subscribed to the Google
> Groups "Neo4j" group.
> To unsubscribe from this group and stop receiving emails from it,
> send an email to neo4j+unsubscribe@__googlegroups.com
> <mailto:neo4j%2Bu...@googlegroups.com>.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/neo4j/eq_2fD2BlQU/unsubscribe.
For more options, visit https://groups.google.com/d/optout.
To unsubscribe from this group and stop receiving emails from it, send an email to neo4j+un...@googlegroups.com.
Yago has a ridiculously deep taxonomy. Most ontologies have several thousands of classes though; due to the nature of any RDF store out there. Traversal and property queries (in SPARQL) are complicated and very slow because lots of things are postfiltered (collect nodes first, filter by property later). Querying by class/type and relationships on the other hand is strongly optimized and very fast. That's why most ontologies have lots of classes (are multiclassing).
Aside from that: isn't denormalization the main point of NoSQL stores? Although stuff like this shouldn't exist in a proper triple store; just found it in a yago sample data set and found it funny...