We are experiencing a number of data problems with our graph. We have looked through our application code and not found a way that we could be writing such data. We suspect a problem somewhere in titan or cassandra or our cassandra/titan config/operations. If anyone has ideas that might help us they would be much appreciated.
* 1. missing user defined labels - Whenever our application creates a vertex the vertex is given a label. We never use the label 'vertex', however, sometimes we find vertexes that have a label of 'vertex'. I don't know where that string is coming from.
* 2. unindexed vertices - We initialized the graph indices before adding any vertices. We also tried rebuilding the index in question. Still, we have vertices that cannot be found via indexed lookups. We can only find these via vertex id.
# When I do a lookup by properties and label I don't find it:
gremlin> g.V().hasLabel('accessPoint').has('bssid', '50602816b3a1').has('ssid', '415641432057494649')
gremlin>
# But if I do the lookup by vertex id I do find it:
gremlin> g.V(76362871024).label()
==>accessPoint
gremlin> g.V(76362871024).values('bssid', 'ssid')
==>50602816b3a1
==>415641432057494649
gremlin>
* 3. missing properties -
I don't have an example handy for this one. But the idea is that we know our application code always writes certain properties to vertexes of a given label. However, we sometimes find these vertices are missing the expected properties.
* 4. indexed vertices but the indexed properties are missing from the vertex - This is an odd one. I do a lookup using properties that are indexed but then I find a vertex that doesn't have the indexed properties.
gremlin> g.V().hasLabel('accessPoint').has('bssid', '8e705a7fef00').has('ssid', '44532d4775657374').valueMap()
==>[:]
gremlin>