Warning: file_get_contents(/data/phpspider/zhask/data//catemap/3/apache-spark/5.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/1/cassandra/3.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Apache spark 在单节点群集上运行时,Cassandra Spark connector不会返回任何结果_Apache Spark_Cassandra_Spark Cassandra Connector - Fatal编程技术网

Apache spark 在单节点群集上运行时,Cassandra Spark connector不会返回任何结果

Apache spark 在单节点群集上运行时,Cassandra Spark connector不会返回任何结果,apache-spark,cassandra,spark-cassandra-connector,Apache Spark,Cassandra,Spark Cassandra Connector,我正在使用DSE 5.0.0。 我在单节点Cassandra群集上创建了下表: CREATE TABLE IF NOT EXISTS dummy ( id uuid, txt text, PRIMARY KEY (id) ); INSERT INTO dummy(id, txt) values (uuid(), 'hello world'); 然后,当我使用Spark cassandra连接器查询特定id时,我没有得到任何结果: val df = sqlc.read.format

我正在使用DSE 5.0.0。 我在单节点Cassandra群集上创建了下表:

CREATE TABLE IF NOT EXISTS dummy (
  id uuid,
  txt text,
  PRIMARY KEY (id)
);

INSERT INTO dummy(id, txt) values (uuid(), 'hello world');
然后,当我使用Spark cassandra连接器查询特定id时,我没有得到任何结果:

val df = sqlc.read.format("org.apache.spark.sql.cassandra")
         .options(Map("table" -> "mytable", "keyspace" -> "myks"))
         .load()

df.show(false)

// +------------------------------------+-----------+
// |id                                  |txt        |
// +------------------------------------+-----------+
// |2b69ddc1-2c15-485d-a30f-1b2d7f86c200|hello world|
// +------------------------------------+-----------+

df.filter("id = '2b69ddc1-2c15-485d-a30f-1b2d7f86c200'").show

// 16/07/28 08:51:43 DEBUG CassandraTableScanRDD: Fetching data for range (token("id") <= ?,List(-9223372036854775808)) with SELECT "id", "txt" FROM "myks"."mytable" WHERE token("id") <= ? AND "id" = ?   ALLOW FILTERING with params [-9223372036854775808,2b69ddc1-2c15-485d-a30f-1b2d7f86c200]
// +---+---+
// | id|txt|
// +---+---+
// +---+---+
val df=sqlc.read.format(“org.apache.spark.sql.cassandra”)
.选项(映射(“表”->“mytable”,“键空间”->“myks”))
.load()
df.show(假)
// +------------------------------------+-----------+
//| id | txt|
// +------------------------------------+-----------+
//| 2b69ddc1-2c15-485d-a30f-1b2d7f86c200 |你好,世界|
// +------------------------------------+-----------+
df.过滤器(“id='2b69ddc1-2c15-485d-a30f-1b2d7f86c200')。显示

//16/07/28 08:51:43调试CassandraTableScanRDD:获取范围(令牌(“id”)的数据单节点群集在这里有任何意义吗?它看起来像是UUID过滤的问题…是的,只有在单节点群集上运行此操作时才会出现问题。我在持续集成环境中使用单节点群集。看起来范围代码中的错误应该切换不平等性,调查这对我在D上有效SE 5.0,您有关于回购的更多信息吗?单节点集群在这里有任何意义吗?它看起来像是UUID过滤的问题…是的,只有在单节点集群上运行此操作时才会出现问题。我在持续集成环境中使用单节点集群。看起来范围代码中的错误应该是切换不平等性,调查这在DSE 5.0上为我工作,你有关于回购的更多信息吗?
     WHERE token("id") <= Long.MinValue
// com.datastax.driver.core.Metadata, line 671
    private static Set<TokenRange> makeTokenRanges(List<Token> ring, Token.Factory factory) {
        ImmutableSet.Builder<TokenRange> builder = ImmutableSet.builder();
        // JAVA-684: if there is only one token, return the range ]minToken, minToken]
        if (ring.size() == 1) {
            builder.add(new TokenRange(factory.minToken(), factory.minToken(), factory));