使用spark streaming和scala获取twitter主页时间线推文
我如何使用spark streaming和scala获得twitter主页时间线推文使用spark streaming和scala获取twitter主页时间线推文,scala,apache-spark,twitter,spark-streaming,twitter4j,Scala,Apache Spark,Twitter,Spark Streaming,Twitter4j,我如何使用spark streaming和scala获得twitter主页时间线推文 val ssc = new StreamingContext(sc, Seconds(1)) val output = TwitterUtils.createStream(ssc, None) 当我使用createStream时,它不会返回我的时间线。为了使用Spark Streaming和Scala获得家庭时间线推文,我们需要使用twitter4j中的TwitterFactory设置TwitterOAuth
val ssc = new StreamingContext(sc, Seconds(1))
val output = TwitterUtils.createStream(ssc, None)
当我使用createStream时,它不会返回我的时间线。为了使用Spark Streaming和Scala获得家庭时间线推文,我们需要使用
twitter4j
中的TwitterFactory
设置TwitterOAuth凭据
import twitter4j.TwitterFactory
import twitter4j.auth.AccessToken
// Twitter Authentication credentials
val consumerKey = "twitter_consumer_key"
val consumerSecret = "twitter_consumer_secret"
val accessToken = "twitter_access_token"
val accessTokenSecret = "twitter_access_token_secret"
// Authorizing with your Twitter Application credentials
val twitter = new TwitterFactory().getInstance()
twitter.setOAuthConsumer(consumerKey, consumerSecret)
twitter.setOAuthAccessToken(new AccessToken(accessToken, accessTokenSecret))
// Setting up streaming context with a window of 10 seconds
val ssc = new StreamingContext(sc, Seconds(1))
val output = TwitterUtils.createStream(ssc, Option(twitter.getAuthorization()))
或者,如果您不想在ssc
中设置访问凭据,则可以使用以下代码:
System.setProperty("twitter4j.oauth.consumerKey", "twitter_consumer_key")
System.setProperty("twitter4j.oauth.consumerSecret", "twitter_consumer_secret")
System.setProperty("twitter4j.oauth.accessToken", "twitter_access_token")
System.setProperty("twitter4j.oauth.accessTokenSecret", "twitter_access_token_secret")
val ssc = new StreamingContext(sc, Seconds(1))
val output = TwitterUtils.createStream(ssc, None)
有关Spark Streaming和Scala的完整示例,请参阅以下示例:为了使用Spark Streaming和Scala获得家庭时间线推文,我们需要使用
twitter4j
中的TwitterFactory
设置TwitterOAuth凭据
import twitter4j.TwitterFactory
import twitter4j.auth.AccessToken
// Twitter Authentication credentials
val consumerKey = "twitter_consumer_key"
val consumerSecret = "twitter_consumer_secret"
val accessToken = "twitter_access_token"
val accessTokenSecret = "twitter_access_token_secret"
// Authorizing with your Twitter Application credentials
val twitter = new TwitterFactory().getInstance()
twitter.setOAuthConsumer(consumerKey, consumerSecret)
twitter.setOAuthAccessToken(new AccessToken(accessToken, accessTokenSecret))
// Setting up streaming context with a window of 10 seconds
val ssc = new StreamingContext(sc, Seconds(1))
val output = TwitterUtils.createStream(ssc, Option(twitter.getAuthorization()))
或者,如果您不想在ssc
中设置访问凭据,则可以使用以下代码:
System.setProperty("twitter4j.oauth.consumerKey", "twitter_consumer_key")
System.setProperty("twitter4j.oauth.consumerSecret", "twitter_consumer_secret")
System.setProperty("twitter4j.oauth.accessToken", "twitter_access_token")
System.setProperty("twitter4j.oauth.accessTokenSecret", "twitter_access_token_secret")
val ssc = new StreamingContext(sc, Seconds(1))
val output = TwitterUtils.createStream(ssc, None)
有关Spark Streaming和Scala的完整示例,请参阅以下示例:我做了相同的操作,并在ssc中设置了访问凭据,但它接收了一个全局推文流,而不是来自时间线。我做了相同的操作,并在ssc中设置了访问凭据,但它接收了一个全局推文流,而不是来自时间线。