Python pd.read\u csv解析错误
我有一个CSV文件,如下所示:Python pd.read\u csv解析错误,python,pandas,csv,Python,Pandas,Csv,我有一个CSV文件,如下所示: Date,Time,Mood,Tags,Medications,Notes "Jul 25, 2018",9:41 PM,8,,,"", "Jul 26, 2018",10:05 AM,4,,,"", "Jul 26, 2018",12:00 PM,3,,,"", "Jul 26, 2018",7:00 PM,8,,,"", "Jul 27, 2018",12:01 PM,8,,,"", 我运行以下代码: import pandas as pd df = pd
Date,Time,Mood,Tags,Medications,Notes
"Jul 25, 2018",9:41 PM,8,,,"",
"Jul 26, 2018",10:05 AM,4,,,"",
"Jul 26, 2018",12:00 PM,3,,,"",
"Jul 26, 2018",7:00 PM,8,,,"",
"Jul 27, 2018",12:01 PM,8,,,"",
我运行以下代码:
import pandas as pd
df = pd.read_csv("./data/MoodLog_2018_09_14.csv",
dtype={'Date': str, 'Time': str, 'Mood': str, 'Tags': str,
'Medications': str, 'Notes': str})
print(df['Time'].head(5))
并打印以下内容:
Jul 25, 2018 8
Jul 26, 2018 4
Jul 26, 2018 3
Jul 26, 2018 8
Jul 27, 2018 8
Name: Time, dtype: object
它在时间
列中包含情绪
列
为什么会这样?问题在于您的行后面有一个
,
,而标题没有。将标题更改为:
日期、时间、情绪、标签、药物、注释,
,您将获得一个额外的列,然后可以删除该列
输入:test.csv
代码:
输出:df
嗯,不知道为什么会这样?我添加了我使用的.csv,因为我没有注意到您更改了csv。
Date,Time,Mood,Tags,Medications,Notes,
"Jul 25, 2018",9:41 PM,8,,,"",
"Jul 26, 2018",10:05 AM,4,,,"",
"Jul 26, 2018",12:00 PM,3,,,"",
"Jul 26, 2018",7:00 PM,8,,,"",
"Jul 27, 2018",12:01 PM,8,,,"",
df = pd.read_csv("test.csv",
dtype={'Date': str, 'Time': str, 'Mood': str, 'Tags': str,
'Medications': str, 'Notes': str}).iloc[:, :-1]
Date Time Mood Tags Medications Notes
0 Jul 25, 2018 9:41 PM 8 NaN NaN NaN
1 Jul 26, 2018 10:05 AM 4 NaN NaN NaN
2 Jul 26, 2018 12:00 PM 3 NaN NaN NaN
3 Jul 26, 2018 7:00 PM 8 NaN NaN NaN
4 Jul 27, 2018 12:01 PM 8 NaN NaN NaN