Python 从包含给定关键字的文本文件中筛选句子。行之间用倒逗号分隔_Python_Sentiment Analysis

Python 从包含给定关键字的文本文件中筛选句子。行之间用倒逗号分隔

python

Python 从包含给定关键字的文本文件中筛选句子。行之间用倒逗号分隔,python,sentiment-analysis,Python,Sentiment Analysis,我使用python中的tweepy访问推特上的数据，并将其存储在文本文件中。文本文件包含来自30个twitter句柄的tweet，并以逗号分隔。我现在只想选择那些包含关键字“@Tcs”的帖子，并将其存储在一个新的文本文件中 import tweepy import json import json import csv # Consumer keys and access tokens, used for OAuth CONSUMER_KEY = "86YITDk6xVLNZOizW

我使用

python

中的

tweepy

访问推特上的数据，并将其存储在文本文件中。文本文件包含来自30个twitter句柄的tweet，并以逗号分隔。我现在只想选择那些包含关键字“@Tcs”的帖子，并将其存储在一个新的文本文件中

import tweepy
import json
import json
import csv

# Consumer keys and access tokens, used for OAuth
CONSUMER_KEY = "86YITDk6xVLNZOizWf"
CONSUMER_SECRET = "exf2t2f7txKObmOlYqrHRFt820D4kflCGzCxOStNyKXf"
ACCESS_TOKEN = "578789959-shMwxAvEZc3JW9c3F9QzbBnjwneIqsRo88"
ACCESS_SECRET = "XGXhI80cX30P20RYjuedicbrcBoFKzfkM5ckW"

# OAuth process, using the keys and tokens
auth = tweepy.OAuthHandler(CONSUMER_KEY, CONSUMER_SECRET)
auth.set_access_token(ACCESS_TOKEN, ACCESS_SECRET)
List_Screen=['@livemint', '@ReutersIndia', '@EconomicTimes', '@NDTVProfit', '    @forbes_india','@moneycontrolcom','@ETNOWlive', ' 
 @ETmarkets','@Investopedia','@BloombergTVInd']
# Creation of the actual interface, using authentication
api = tweepy.API(auth)
with open('data.txt', 'w') as outfile:
   for i in range(len(List_Screen)):
      for status in tweepy.Cursor(api.user_timeline, 
screen_name=List_Screen[i]).items(10):
         print status._json['text']
         data=status._json['text']
         json.dump(data, outfile)

下面是访问tweets并将其存储在文本文件中的代码

import tweepy
import json
import json
import csv

# Consumer keys and access tokens, used for OAuth
CONSUMER_KEY = "86YITDk6xVLNZOizWf"
CONSUMER_SECRET = "exf2t2f7txKObmOlYqrHRFt820D4kflCGzCxOStNyKXf"
ACCESS_TOKEN = "578789959-shMwxAvEZc3JW9c3F9QzbBnjwneIqsRo88"
ACCESS_SECRET = "XGXhI80cX30P20RYjuedicbrcBoFKzfkM5ckW"

# OAuth process, using the keys and tokens
auth = tweepy.OAuthHandler(CONSUMER_KEY, CONSUMER_SECRET)
auth.set_access_token(ACCESS_TOKEN, ACCESS_SECRET)
List_Screen=['@livemint', '@ReutersIndia', '@EconomicTimes', '@NDTVProfit', '    @forbes_india','@moneycontrolcom','@ETNOWlive', ' 
 @ETmarkets','@Investopedia','@BloombergTVInd']
# Creation of the actual interface, using authentication
api = tweepy.API(auth)
with open('data.txt', 'w') as outfile:
   for i in range(len(List_Screen)):
      for status in tweepy.Cursor(api.user_timeline, 
screen_name=List_Screen[i]).items(10):
         print status._json['text']
         data=status._json['text']
         json.dump(data, outfile)

这是需要过滤的文本文件的屏幕截图。

您可以使用列表

考虑到每条tweet都是列表中的一个字符串：

tweet\u list=[tweet for tweet in data if'@Tcs'in tweet]

到目前为止，您在过滤@Tcs方面有何尝试？我是一名新的python程序员，因此无法理解如何编写上述代码。我是python新手，您能给我完整的代码来过滤所有包含关键字“@Tcs”的tweet吗。我还附上了输出文本文件的截图，请给出相应的代码。