Apache spark 如何从套接字读取流数据集？_Apache Spark_Spark Structured Streaming - Fatal编程技术网

Apache spark 如何从套接字读取流数据集？

apache-spark

Apache spark 如何从套接字读取流数据集？,apache-spark,spark-structured-streaming,Apache Spark,Spark Structured Streaming,下面的代码从套接字读取，但我没有看到任何输入进入作业。我运行了nc-l 1111并转储了数据，但我不确定为什么Spark作业无法从10.176.110.112:1111读取数据数据集d=sparkSession.readStream（）.format（“套接字”） .选项（“主机”，“10.176.110.112”） .选项（“端口”，1111）.load（）；下面的代码从套接字读取，但我没有看到任何输入进入作业嗯，老实说，你确实不从任何地方读任何东西。您只描述了启动流媒体管道时要执行的

下面的代码从套接字读取，但我没有看到任何输入进入作业。我运行了

nc-l 1111

并转储了数据，但我不确定为什么Spark作业无法从

10.176.110.112:1111

读取数据

数据集d=sparkSession.readStream（）.format（“套接字”） .选项（“主机”，“10.176.110.112”） .选项（“端口”，1111）.load（）；下面的代码从套接字读取，但我没有看到任何输入进入作业

嗯，老实说，你确实不从任何地方读任何东西。您只描述了启动流媒体管道时要执行的操作

因为您使用结构化流从套接字读取数据集，所以应该使用运算符触发数据获取（并且只有在定义接收器之后）

start（）：StreamingQuery开始执行流式查询，当新数据到达时，流式查询将持续将结果输出到给定路径。返回的StreamingQuery对象可用于与流交互

在

开始之前

您应该定义在何处传输数据。它可以是卡夫卡、文件、自定义流媒体接收器（可能使用

foreach

operator）或控制台

我在下面的示例中使用

console

sink（aka格式）。我还使用Scala，并将重写它到Java作为您的家庭练习

d.writeStream.  // <-- this is the most important part
  trigger(Trigger.ProcessingTime("10 seconds")).
  format("console").
  option("truncate", false).
  start         // <-- and this

d.writeStream//




[google chrome]相关文章推荐



                                                        
Google chrome 如何在Linux上构建PPAPI插件？
google-chromegoogle-chrome-extension 
Google chrome 我可以使用带Chromium的Watir webdriver吗？
google-chrome 
Google chrome Chrome插件与我的服务器交互
google-chrome 
Google chrome 如何在Chrome中调试HTTP POST？
google-chromedebuggingpost 
Google chrome Chrome-加载时自动播放视频
google-chromezurb-foundation 
Google chrome 在Chrome浏览器仿真中查看移动网站的完整容器框架
google-chromemobile 
Google chrome 你不玩mp3吗？
google-chromeaudio 
Google chrome 编写适用于某些URL的Google chrome扩展
google-chromegoogle-chrome-extension 
Google chrome 如何监听使用chrome developer工具进行的DOM更改
google-chromegoogle-chrome-devtools 
Google chrome Can'；t为GCM注册Chrome应用程序
google-chrome 
Google chrome Chrome中的访问控制允许原点错误
google-chrome 
Google chrome Web浏览器假定我的HTTP服务器已准备好接受许多连接
google-chromehttptcp 
Google chrome WebGL不可用，GPU进程无法启动
google-chromewebgl 
Google chrome Azure Traffic Manager浏览器缓存问题
google-chromeazurecachingdns 
Google chrome 如何在chrome开发工具中获得FPS
google-chromegoogle-chrome-devtools 
Google chrome Chrome中Jupyter笔记本的emacs键绑定中的Ctrl-N
google-chromejupyter-notebook 
Google chrome Chrome会话cookie对于数据库列太长
google-chromeflaskcookies 
Google chrome NTLM身份验证不支持'；除非Fiddler正在运行，否则无法处理JS文件
google-chromeauthenticationiishttps 
Google chrome chrome navigator.serial的初学者问题
google-chrome 
Google chrome chrome阻止Cookie，即使使用samesite=None
google-chromeflaskcookies 
                                       





随机文章推荐



                                                        
Cluster computing 如何将qsub设置为在作业1完成五秒钟后运行作业2（或任何所需的值）？
cluster-computing 
Cluster computing 加入Hazelcast Multimap lasts>；2节点群集上的5秒
cluster-computinghazelcast 
Cluster computing @应用程序范围和JBoss群集
cluster-computing 
Cluster computing 以下哪一项不是Beowulf集群的一部分？
cluster-computing 
Cluster computing 在linux集群上运行脚本的Mamp php代码
cluster-computing 
Cluster computing 如何查看当前用户'；斯劳姆的队列
cluster-computing 
Cluster computing 命令以独占方式在cfncluster SGE调度程序中的单个实例上运行作业
cluster-computing 
Cluster computing 无法识别的日志错误
cluster-computingmarklogic 
Cluster computing 使用单个公共IP的Apache Ignite群集
cluster-computingignite 
将节点添加到正在运行的群集elasticsearch导致未发现主节点异常
问题
cluster-computing 
Cluster computing 在群集中使用用户的工作目录
cluster-computing


                                        

                                        
                                        


                                                
                                                        [apache spark]相关推荐
                                                        
Apache spark 无法实现Spark函数的顺序执行
									Apache Spark
							 
Apache spark Spark捐款的认可地点？
									Apache Spark
							 
Apache spark Spark RDD上的Lazy foreach
									Apache Spark
							 
Apache spark SCALA Spark环境下决策树的精度、召回率和准确度计算
									Apache Spark
							 
Apache spark 不能'；当我们在ApacheSpark中使用UI时，找不到集合（[TOPICNNAME，0]）的引线
									Apache Spark
							 									Apache Kafka
							 
Apache spark 为什么火花行对象与等效结构相比如此大？
									Apache Spark
							 
Apache spark Spark Streaming：如何在接收器'；失败
									Apache Spark
							 
Apache spark Pyspark使用子进程运行外部程序可以'；无法从hdfs读取文件
									Apache Spark
							 									Pyspark
							 
Apache spark fromOffset/untilOffset/offset.count与RDD分区中记录总数之间的差异
									Apache Spark
							 									Apache Kafka
							 
Apache spark 如何最大化值并保留所有列（对于每个组的最大记录）？
									Apache Spark
							 
Apache spark 将字符串强制转换为int null问题
									Apache Spark
							 									Pyspark
							 
Apache spark 为什么Spark程序不一致，并且没有考虑完整的输入或记录数
									Apache Spark
							 
Apache spark 如何在Spark类中获取环境变量的值？
									Apache Spark
							 
Apache spark 如何使用纱线簇主控器获取进度条（包括阶段和任务）？
									Apache Spark
							 									Jar
							 
Apache spark 使用Spark中的数据类型map将数据帧写入csv
									Apache Spark
							 
Apache spark 无法从Spark测试S3支持的Hbase
									Apache Spark
							 									Amazon S3
							 									Hbase
							 
Apache spark 简化pyspark数据帧中的代码并减少join语句
									Apache Spark
							 									Pyspark
							 
Apache spark 使用预览映像创建dataproc群集时无法启动云SQL元存储
									Apache Spark
							 									Hive
							 									Google Cloud Platform
							 
Apache spark 多个SPARK安装和SCALA依赖项
									Apache Spark
							 
Apache spark 火花作业在具有内存错误的纱线上失败
									Apache Spark
							 
Apache spark 如何发送一份工作给Kubernetes。无法实例化外部计划程序
									Apache Spark
							 									Kubernetes
							 
Apache spark 索引器错误：在pyspark外壳上使用reduceByKey操作时，列表索引超出范围
									Apache Spark
							 									Pyspark
							 
Apache spark Pyspark作业在从GCS读取时陷入休眠并重试循环
									Apache Spark
							 									Pyspark
							 									Google Cloud Storage
							 
Apache spark 使用spark重试Oracle连接
									Apache Spark
							 
Apache spark 火花不确定性和重新计算安全性
									Apache Spark
							 
Apache spark 如何在所有可用节点上运行Apache Spark以测试网络连接
									Apache Spark
							 
Apache spark Spark配置单元SQL返回空数据帧
									Apache Spark
							 									Hive
							 
Apache spark Pyspark：如何按日期过滤并读取按日期分区的拼花地板文件
									Apache Spark
							 									Pyspark
							 
Apache spark spark数据帧中每行的映射类型列中的按键排序
									Apache Spark
							 									Dictionary
							 									Pyspark
							 
Apache spark 如何使用sqoop仅将两个表导入配置单元
									Apache Spark
							 									Hadoop
							 									Hive
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Jar
Memory Leaks
Hbase
Lambda
If Statement
Properties
Ethereum
Appium
Adobe
For Loop
Deployment
Ldap
Datatables
Webpack
Magento2
Less
Model
Serialization
Xmpp
Discord.py
Teamcity
Silverlight
Json
Jaxb
Angular Material
Parameters
Codenameone
Xamarin.android
Ios8
Sql Server 2005
Join
Enums
Php
Hive
Artifactory
Google Plus
Qml
Performance
Karate
Outlook
Visual Studio 2012
Sed
Plone
Clang
Struts2
Authentication
Javascript
Amp Html
Linker
Entity Framework 4
Internet Explorer 8
Events
Arm
Python 3.x
Inno Setup
Arrays
Swing
Xcode
Telegram
Spring
Zsh
Synchronization
Azure Functions
Entity Framework
Html5 Canvas
Oracle
Mysql
Jvm
Mapreduce
Triggers
Playframework 2.0
Drupal 6
Windows Runtime
Camera
Gulp
Amazon Web Services
D3.js
Isabelle
Vector
Security
Keras
Office Js
Loopbackjs
Couchdb
Jsp
Ada
Fonts
Url Rewriting
Drupal 7
Stripe Payments
Windbg
Data Structures
Verilog
Xaml
Redirect
Com
Leaflet
Google Visualization
Discord.js
Odata
Jhipster
Angular
Primefaces
Amazon S3
Apache Zookeeper
Cordova
Coq
Stanford Nlp
Exchange Server
Google Bigquery
Datetime
Ansible
Coffeescript
Node.js
Google Api
Jasper Reports
Webstorm
Jms
Vhdl
Selenium
Msbuild
Combobox
Interface
Proxy
Ag Grid
Iis 7
Jquery Mobile
Asp.net Mvc
Wpf
Cloud
Exception
Elm
Ssh
Mongodb
Websphere
Redis
Ffmpeg
Hybris
Data Binding
Jboss
Matplotlib
Uitableview
Azure Sql Database
Swift2
Http
Julia
Google Cloud Firestore
Dialogflow Es
Dependencies
Air
Ruby On Rails
Dom
Spring Batch
Lotus Notes
Three.js
Keyboard
Wxpython
Snmp
Compression
Pine Script
Passwords
Pycharm
Deep Learning
Swift3
Junit
Sequelize.js
Webgl
Shopify
Vaadin
Matlab
Openshift
Neural Network
Dojo
Domain Driven Design
D
Unity3d
Merge
Laravel 5
Bots
Elixir
Seo
Processing
Opencart
Cygwin
Logic
Sonarqube
Class
Error Handling
Kibana
Prolog
Unicode
Asp Classic
Google Colaboratory
Django
Kotlin
Laravel
Spotify
Devexpress
Azure
Latex
Time Complexity


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网