Warning: file_get_contents(/data/phpspider/zhask/data//catemap/0/jpa/2.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Scala 如何在Spark中读取多个Elasticsearch索引?_Scala_Apache Spark_<img Src="//i.stack.imgur.com/RUiNP.png" Height="16" Width="18" Alt="" Class="sponsor Tag Img">elasticsearch_Spark Dataframe - Fatal编程技术网 elasticsearch,spark-dataframe,Scala,Apache Spark,elasticsearch,Spark Dataframe" /> elasticsearch,spark-dataframe,Scala,Apache Spark,elasticsearch,Spark Dataframe" />

Scala 如何在Spark中读取多个Elasticsearch索引?

Scala 如何在Spark中读取多个Elasticsearch索引?,scala,apache-spark,elasticsearch,spark-dataframe,Scala,Apache Spark,elasticsearch,Spark Dataframe,我需要从Elasticsearch的多个索引中读取数据。但所有这些指数都具有相同的数据结构 例如: val df1 = spark.read.format("org.elasticsearch.spark.sql") .option("query", myquery) .option("pushdown", "true") .load("news_01/myitem") val df2 = spark.read

我需要从Elasticsearch的多个索引中读取数据。但所有这些指数都具有相同的数据结构

例如:

val df1 = spark.read.format("org.elasticsearch.spark.sql")
              .option("query", myquery)
              .option("pushdown", "true")
              .load("news_01/myitem")

val df2 = spark.read.format("org.elasticsearch.spark.sql")
              .option("query", myquery)
              .option("pushdown", "true")
              .load("news_02/myitem")
如果我得到索引名数组[news\u 01,news\u 02],会发生什么


如何避免像我现在这样创建df1、df2?

鉴于ElasticSearch允许您在搜索请求期间同时针对多个索引,您可以执行以下操作:

val df = spark.read.format("org.elasticsearch.spark.sql")
              .option("query", myquery)
              .option("pushdown", "true")
              .load("news_01,news_02")

你是说,你想合并两个索引中的数据?