Python 查找从startdate到enddate之间的日期 < P> >我想将它们分组,即代码< PARISFAN < /COD>, LoSangeSununtEdt/,然后 DpTcTyTyDPTWORD (相同),然后考虑日期(即 DATEAR 和 DateDpt < /代码>)。
比如说Python 查找从startdate到enddate之间的日期 < P> >我想将它们分组,即代码< PARISFAN < /COD>, LoSangeSununtEdt/,然后 DpTcTyTyDPTWORD (相同),然后考虑日期(即 DATEAR 和 DateDpt < /代码>)。,python,csv,pandas,Python,Csv,Pandas,比如说 ParisFrance[它应该列出ID,DateDpt,DateAr,列出所有与ParisFrance相关的内容,而不必重复写入ParisFrance,但可以列出与之相关的内容] LosAngelesUnitedState[它应该列出ID,DateDpt,DateAr,用于所有与LosAngelesUnitedState相关的内容,而无需重复写入LosAngelesUnitedState,但可以列出与之相关的内容] ID ArCityArCountry DptCit
ParisFrance
[它应该列出ID
,DateDpt
,DateAr
,列出所有与ParisFrance
相关的内容,而不必重复写入ParisFrance
,但可以列出与之相关的内容]
LosAngelesUnitedState
[它应该列出ID
,DateDpt
,DateAr
,用于所有与LosAngelesUnitedState
相关的内容,而无需重复写入LosAngelesUnitedState
,但可以列出与之相关的内容]
ID ArCityArCountry DptCityDptCountry DateDpt DateAr
1922 ParisFrance NewYorkUnitedState 2008-03-10 2001-02-02
1002 LosAngelesUnitedState California UnitedState 2008-03-10 2008-12-01
1901 ParisFrance LagosNigeria 2001-03-05 2001-02-02
1922 ParisFrance NewYorkUnitedState 2011-02-03 2008-12-01
1002 ParisFrance CaliforniaUnitedState 2003-03-04 2002-03-04
1099 ParisFrance BeijingChina 2011-02-03 2009-02-04
1901 LosAngelesUnitedState ParisFrance 2001-03-05 2001-02-02
听起来您正在寻找以下内容:
ParisFrance
[1922, NewYorkUnitedState, 2008-03-10, 2001-02-02], [1901,LagosNigeria, 2001-03-05 2001-02-02], [1922,NewYorkUnitedState,2011-02-03, 2008-12-01]
LosAngelesUnitedState
[1901,ParisFrance,2001-03-05, 2001-02-02]
这使您接近您指定的格式-当然可以进一步调整print()
您可以发布一个所需的输出来澄清您的问题吗?我已经输入了所需的输出-fabioThanks,但是日期格式不是相同的numpy.datetime64('2008-03-09T20:00:00.000000000-0400'),numpy.datetime64('2001-02-01T19:00:00.000000000-0500')。这是我想要的格式2001-03-05,2001-02-02
ParisFrance
[1922, NewYorkUnitedState, 2008-03-10, 2001-02-02], [1901,LagosNigeria, 2001-03-05 2001-02-02], [1922,NewYorkUnitedState,2011-02-03, 2008-12-01]
LosAngelesUnitedState
[1901,ParisFrance,2001-03-05, 2001-02-02]
df['DateAr'] = pd.to_datetime(df['DateAr'])
df['DateDpt'] = pd.to_datetime(df['DateDpt'])
dept_cities = df.groupby('ArCityArCountry')
for city, departures in dept_cities:
print(city)
print([list(r) for r in departures.loc[:, ['ID', 'DptCityDptCountry', 'DateDpt', 'DateAr']].to_records()])
LosAngelesUnitedState
[[1, 1002, 'California UnitedState', numpy.datetime64('2008-03-09T18:00:00.000000000-0600'), numpy.datetime64('2008-11-30T18:00:00.000000000-0600')], [6, 1901, 'ParisFrance', numpy.datetime64('2001-03-04T18:00:00.000000000-0600'), numpy.datetime64('2001-02-01T18:00:00.000000000-0600')]]
ParisFrance
[[0, 1922, 'NewYorkUnitedState', numpy.datetime64('2008-03-09T18:00:00.000000000-0600'), numpy.datetime64('2001-02-01T18:00:00.000000000-0600')], [2, 1901, 'LagosNigeria', numpy.datetime64('2001-03-04T18:00:00.000000000-0600'), numpy.datetime64('2001-02-01T18:00:00.000000000-0600')], [3, 1922, 'NewYorkUnitedState', numpy.datetime64('2011-02-02T18:00:00.000000000-0600'), numpy.datetime64('2008-11-30T18:00:00.000000000-0600')], [4, 1002, 'CaliforniaUnitedState', numpy.datetime64('2003-03-03T18:00:00.000000000-0600'), numpy.datetime64('2002-03-03T18:00:00.000000000-0600')], [5, 1099, 'BeijingChina', numpy.datetime64('2011-02-02T18:00:00.000000000-0600'), numpy.datetime64('2009-02-03T18:00:00.000000000-0600')]]