Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/sharepoint/4.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Scala 擦除旧的Spark结构化流媒体事件时间窗口_Scala_Apache Spark_Spark Structured Streaming - Fatal编程技术网

Scala 擦除旧的Spark结构化流媒体事件时间窗口

Scala 擦除旧的Spark结构化流媒体事件时间窗口,scala,apache-spark,spark-structured-streaming,Scala,Apache Spark,Spark Structured Streaming,我在事件时间窗口中获取聚合数据,但我需要的数据是最新3个窗口中包含的数据。擦除其他我不需要的已处理旧窗口的正确方法是什么 val query = lines .withWatermark("timestamp", "20 seconds") .groupBy(window($"timestamp", "10 seconds"),$"value").count().sort(-$"window") .writeStream .outputMode("complete") .fo

我在事件时间窗口中获取聚合数据,但我需要的数据是最新3个窗口中包含的数据。擦除其他我不需要的已处理旧窗口的正确方法是什么

val query = lines
  .withWatermark("timestamp", "20 seconds")
  .groupBy(window($"timestamp", "10 seconds"),$"value").count().sort(-$"window")
  .writeStream
  .outputMode("complete")
  .format("console")
  .start()
我只想看最后3个最新的窗口。过滤掉超过30秒的窗口是否足够好

谢谢