Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/362.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 使用嵌套字典拆分字符串并转换为数据帧_Python_Pandas_Dictionary - Fatal编程技术网

Python 使用嵌套字典拆分字符串并转换为数据帧

Python 使用嵌套字典拆分字符串并转换为数据帧,python,pandas,dictionary,Python,Pandas,Dictionary,我对这些数据有这个问题 CSV的第一行 {'grade1': '47.614465', 'grade2': '-122.32174', 'grade3': '{"addr": "123 AV MOUNTIAN", "town": "HAMBOURG", "dep": GR", "code": ""}'} {'grade1': '47.61699416', 'grade2': '-122.320405', 'grade3': '{"addr": "5555 WALL STREET", "town"

我对这些数据有这个问题

CSV的第一行

{'grade1': '47.614465', 'grade2': '-122.32174', 'grade3': '{"addr": "123 AV MOUNTIAN", "town": "HAMBOURG", "dep": GR", "code": ""}'}
{'grade1': '47.61699416', 'grade2': '-122.320405', 'grade3': '{"addr": "5555 WALL STREET", "town": "NY", "dep": "NY", "code": "98122"}'}
{'grade1': '47.61676902', 'grade2': '-122.3215492', 'grade3': '{"addr": "6776  SPAU - 65 ", "town": "GHAN", "dep": "IU", "code": "122"}'}
导入csv文件后,我得到以下数据帧:

Grade
0   {'grade1': '47.614465', 'grade2': '-122.32174', 'grade3': '{"addr": "123 AV MOUNTIAN", "town": "HAMBOURG", "dep": GR", "code": ""}'}
1   {'grade1': '47.61699416', 'grade2': '-122.320405', 'grade3': '{"addr": "5555 WALL STREET", "town": "NY", "dep": "NY", "code": "98122"}'}
2   {'grade1': '47.61676902', 'grade2': '-122.3215492', 'grade3': '{"addr": "6776 SPAU - 65 ", "town": "GHAN", "dep": "IU", "code": "122"}'}
只有一列,数据类型为对象

我需要把它转换成一个数据帧并得到这个输出

  grade1        grade2         addr              town         dep       code
47.614465     -122.32174    123 AV MOUNTIAN    HAMBOURG       GR            
47.61699416   -122.320405   5555 WALL STREET      NY          NY        98122 
我尝试了以下代码:

dic_loc=[]
#adress=[]
cordinates=[]
address=[]
for key, value in df['grade'][:3].items():  
    print (key,value, type(value), pd.Series(value), type(pd.Series(value)))
    dic_loc.append(value)  ### I get a string
结果是:

{'grade1': '47.614465', 'grade2': '-122.32174', 'grade3': '{"addr": "123 AV MOUNTIAN", "town": "HAMBOURG", "dep": GR", "code": ""}'} <class 'str'> 0    
{'grade 1':'47.614465','grade 2':'-122.32174','grade 3':{'addr:'123 AV MOUNTIAN','town:'HAMBOURG','dep:'GR','code:'.}0
问题是如何迭代该字符串并将其转换为数据帧

欢迎提出任何意见
非常感谢您的帮助

我认为可以通过以下方式解决:

import json, pandas

def fix_line(line):
    # first convert the string to proper JSON
    json_string = line.replace("'",'"').replace('"{', '{').replace('}"', '}')
    # convert JSON to dict
    d = json.loads(json_string)
    # convert dict to a tuple 
    return (float(d['grade1']), float(d['grade2']), d['grade3']['addr'],
        d['grade3']['town'], d['grade3']['dep'], d['grade3']['code'])

# create a dataframe from a list of tuples
df = pandas.DataFrame.from_records([fix_line(line) for line in df['Grade']], 
     columns=['grade1', 'grade2', 'addr', 'town', 'dep', 'code'])

print(df)

您的csv最初是json文件吗?使用它来导入会更容易。您应该确定如何导入数据。csv不是json文件您的文件不是csv。它是json字符串的一个序列,每行一个。嗯,我有一部分是对的-这既不是有效的csv也不是有效的json。干得好。我在这个链接上也发现了类似的东西,希望不是他的建议可能会有所帮助。