用python格式化数据
嗨,伙计们,我有这个文件,我需要清理它,不知道从哪里开始用python格式化数据,python,formatting,Python,Formatting,嗨,伙计们,我有这个文件,我需要清理它,不知道从哪里开始 ['GB', 'LINCOLN'] ['GB;ENG;Lincoln\n'] 3396 (['IT', 'SEGRATE'], "IT~25~Segrate'\n", 3397) (['IT', 'SEGRATE'], "IT~25~Segrate'\n", 3398) (['MX', 'IZTAPALAPA', 'CIUDAD DE MEXICO'], "MX~CMX~Iztapalap
['GB', 'LINCOLN'] ['GB;ENG;Lincoln\n'] 3396
(['IT', 'SEGRATE'], "IT~25~Segrate'\n", 3397)
(['IT', 'SEGRATE'], "IT~25~Segrate'\n", 3398)
(['MX', 'IZTAPALAPA', 'CIUDAD DE MEXICO'], "MX~CMX~Iztapalapa'\n", 3399)
(['US', 'VA', 'CHESAPEAKE'], "US~VA~Chesapeake'\n", 3400)
(['US', 'GA', 'UNION CITY'], "US~GA~Union City'\n", 3401)
['GB', 'HULL'] ['GB;ENG;Sutton', 'on', 'Hull\n'] 3402
理想情况下,我希望输出的是一个csv文件
GB,LINCOLN,GB,ENG,Lincoln
MX,IZTAPALAPA,CIUDAD DE MEXICO,MX,CMX,Iztapalapa
此文件被视为极不一致的数据,因此我不确定如何执行此操作
我不是程序员,所以我在这里有些挣扎。您可以使用以下代码来完成这项工作:
with open("filetoopen.txt","r") as f:
lines = f.readlines()
no = ['[',']','(',')','\'','\"',' ']
com = [';','~']
li = []
for i in range(len(lines)):
for j in range(len(lines[i])):
if lines[i][j] == '\\':
break
elif lines[i][j] in com:
li.append(',')
elif lines[i][j] not in no and lines[i][j] not in com:
li.append(lines[i][j])
li.append("\n")
li.pop()
ans = ''.join(li)
with open("mycsv.csv", "w") as g:
g.write(ans)
f.close()
g.close()