Gremlin JanusGraph BulkLoad报告无接触元素错误,已加载节点但没有边

Gremlin JanusGraph BulkLoad报告无接触元素错误,已加载节点但没有边,gremlin,janusgraph,Gremlin,Janusgraph,我正在探索在JanusGraph中加载csv的方法。我尝试了官方文件中给出的感恩死亡的例子,效果很好。方法如下: hadoop-load-csv.properties gremlin.graph=org.apache.tinkerpop.gremlin.hadoop.structure.HadoopGraph gremlin.hadoop.graphReader=org.apache.tinkerpop.gremlin.hadoop.structure.io.script.ScriptInput

我正在探索在JanusGraph中加载csv的方法。我尝试了官方文件中给出的感恩死亡的例子,效果很好。方法如下:

hadoop-load-csv.properties

gremlin.graph=org.apache.tinkerpop.gremlin.hadoop.structure.HadoopGraph
gremlin.hadoop.graphReader=org.apache.tinkerpop.gremlin.hadoop.structure.io.script.ScriptInputFormat
gremlin.hadoop.scriptInputFormat.script=./data/script-input-grateful-dead.groovy
gremlin.hadoop.inputLocation=./data/grateful-dead.txt

gremlin.hadoop.graphWriter=org.apache.hadoop.mapreduce.lib.output.NullOutputFormat
gremlin.hadoop.graphOutputFormat=org.apache.hadoop.mapreduce.lib.output.NullOutputFormat
gremlin.hadoop.outputLocation=output
gremlin.hadoop.jarsInDistributedCache=true
gremlin.graph=org.janusgraph.core.JanusGraphFactory
storage.hbase.table=grateful
storage.hostname=
janusgraph-properties

gremlin.graph=org.apache.tinkerpop.gremlin.hadoop.structure.HadoopGraph
gremlin.hadoop.graphReader=org.apache.tinkerpop.gremlin.hadoop.structure.io.script.ScriptInputFormat
gremlin.hadoop.scriptInputFormat.script=./data/script-input-grateful-dead.groovy
gremlin.hadoop.inputLocation=./data/grateful-dead.txt

gremlin.hadoop.graphWriter=org.apache.hadoop.mapreduce.lib.output.NullOutputFormat
gremlin.hadoop.graphOutputFormat=org.apache.hadoop.mapreduce.lib.output.NullOutputFormat
gremlin.hadoop.outputLocation=output
gremlin.hadoop.jarsInDistributedCache=true
gremlin.graph=org.janusgraph.core.JanusGraphFactory
storage.hbase.table=grateful
storage.hostname=
run.groovy

outputGraphConfig = [path to janusgraph-grateful.properties]
:load ./data/grateful-dead-jansugraph-schema.groovy
graph = JanusGraphFactory.open(outputGraphConfig)
defineGratefulDeadSchema(graph)
graph.close()

readGraph = GraphFactory.open([path to hadoop-load-csv.properties])

blvp = BulkLoaderVertexProgram.build().writeGraph(outputGraphConfig).create(readGraph)
readGraph.compute(SparkGraphComputer).program(blvp).submit().get()
g = GraphFactory.open(outputGraphConfig).traversal()
g.V().count()
g.E().count()
在那之后,我放弃了整个图形,对数据进行二次采样并再次加载,结果失败了

1,song,HEY BO DIDDLEY,cover,5   followedBy,2,1|followedBy,3,2|followedBy,4,1|followedBy,5,1|followedBy,6,1  
2,song,IM A MAN,cover,1     followedBy,1,1
3,song,NOT FADE AWAY,cover,531  followedBy,5,572    followedBy,5,40|followedBy,1,2
4,song,BERTHA,original,394  followedBy,10,4 followedBy,1,1
5,song,GOING DOWN THE ROAD FEELING BAD,cover,293        
6,song,MONA,cover,1 sungBy,3|writtenBy,5    followedBy,1,1|followedBy,2,1
7,song,WHERE HAVE THE HEROES GONE,,0    followedBy,8,1  followedBy,9,1
8,song,OH BOY,cover,2   followedBy,9,1|followedBy,3,1|followedBy,7,1|sungBy,5|writtenBy,4   followedBy,1,1|followedBy,7,1|followedBy,6,1
800,song,WINING BOY BLUES,cover,1   sungBy,5|writtenBy,4    
9,song,HERE COMES SUNSHINE,original,65  followedBy,10,1 followedBy,6,2  
10,song,HERE COMES SUNSHINE,original,65     
我得到了一个NoTouchElement错误,当我查看图表时,
g.V().count()
返回10,而
g.E().count()
返回0。
有人知道发生了什么吗?非常感谢您给我一些建议。

很难说有什么问题,因为我不确定您的代码加载该csv文件的方法。我想你需要提供更多的信息。@stephenmallette谢谢你的回复!我已经了解了更多的细节,如果有帮助的话?这有助于改善情况,但我不认为有问题。有趣的是,标准文件加载可以找到,但您的子集不能找到。我想这让我相信你的子样本数据有问题,但我不知道那是什么。我对BLVP的记忆有点模糊。也许有人最近有过JanusGraph+BLVP的经验可以帮上忙。