Python 使用Pandas从CSV中提取列
我试图从csv文件中提取起始站,示例数据如下Python 使用Pandas从CSV中提取列,python,pandas,csv,Python,Pandas,Csv,我试图从csv文件中提取起始站,示例数据如下 Start Time,End Time,Trip Duration,Start Station,End Station,User Type,Gender,Birth Year 1423854,2017-06-23 15:09:32,2017-06-23 15:14:53,321,Wood St & Hubbard St,Damen Ave & Chicago Ave,Subscriber,Male,1992.0 我遇到的问题是,当
Start Time,End Time,Trip Duration,Start Station,End Station,User Type,Gender,Birth Year
1423854,2017-06-23 15:09:32,2017-06-23 15:14:53,321,Wood St & Hubbard St,Damen Ave & Chicago Ave,Subscriber,Male,1992.0
我遇到的问题是,当我尝试提取数据时,我收到以下错误消息:
AttributeError:“Series”对象没有属性“start”
我相信我的问题源于转换起始站,但似乎不知道为什么
# convert the Start Station column to dataframe
df['Start Station'] = pd.DataFrame(df['Start Station'])
# extract street names from Start Station and End Station to create new columns
df['start'] = df['Start Station'].start
def station_stats(df):
"""Displays statistics on the most popular stations and trip."""
# TO DO: display most commonly used start station
popular_start_station = df['start']
print(popular_start_station)
你的代码很混乱。试试这个:
df = pd.read_csv(CITY_DATA, index = True) # load data file into a one df
start_data_series = df[['Start Station']] # create series with column of interest
您可以根据自己的喜好在第二行中添加更多列。要进一步阅读,请参阅post。我认为您是将数据帧分配给列,而不是系列
df = pd.read_csv(CITY_DATA, index = True) # load data file into a one df
start_data_series = df[['Start Station']] # create series with column of interest