Python 将JSON文件读入数据帧
我正在尝试读取一系列JSON文件并将其转换为Pandas DataFrame,但是,我所遵循的示例中没有一个适用于读取部分 这是我拥有的JSON文件的一个示例:Python 将JSON文件读入数据帧,python,json,pandas,Python,Json,Pandas,我正在尝试读取一系列JSON文件并将其转换为Pandas DataFrame,但是,我所遵循的示例中没有一个适用于读取部分 这是我拥有的JSON文件的一个示例: { "created_at": "Thu Nov 02 01:09:12 +0000 2017", "text": "RT @coindesk: SEC: Celebrity ICO Endorsements Could Be Illegal gHoWduXOBp t.co/iyWla0Ryuk", "tweet
{
"created_at": "Thu Nov 02 01:09:12 +0000 2017",
"text": "RT @coindesk: SEC: Celebrity ICO Endorsements Could Be Illegal gHoWduXOBp t.co/iyWla0Ryuk",
"tweet_id": 925892516087558145,
"user_id": 153962533,
"user_name": "Christine Duhaime"
}{
"created_at": "Thu Nov 02 01:09:44 +0000 2017",
"text": "Cornell Professor C t.co/RuNu6UQyr9",
"tweet_id": 925892650884108289,
"user_id": 1255045351,
"user_name": "Local SEO Somerset"
}
我试过:
with codecs.open('./output/streamer_20171022-2010.json', 'r+', encoding='utf-8') as data_file:
data = json.load(data_file)
这导致了
JSONDecodeError: Extra data: line 1 column 416 (char 415)
我还试着一行一行地读……没有成功
有什么想法吗?您的JSON文件格式无效。在有效的JSON中只能有一个顶级元素 尝试将顶级对象放置到阵列中
[
{ "created_at": "Thu Nov 02 01:09:12 +0000 2017",
"text": "RT @coindesk: SEC: Celebrity ICO Endorsements Could Be Illegal gHoWduXOBp t.co/iyWla0Ryuk",
"tweet_id": 925892516087558145,
"user_id": 153962533,
"user_name": "Christine Duhaime"
}, {
"created_at": "Thu Nov 02 01:09:44 +0000 2017",
"text": "Cornell Professor C t.co/RuNu6UQyr9",
"tweet_id": 925892650884108289,
"user_id": 1255045351,
"user_name": "Local SEO Somerset"
}
]
嗨,Caio,我在这里有一个不同。谢谢你的提示。干杯:快乐编码