Warning: file_get_contents(/data/phpspider/zhask/data//catemap/3/sql-server-2005/2.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 数据帧上的多个操作_Python_Pandas_Dataframe_Aggregate_Median - Fatal编程技术网

Python 数据帧上的多个操作

Python 数据帧上的多个操作,python,pandas,dataframe,aggregate,median,Python,Pandas,Dataframe,Aggregate,Median,我试着用一列或两列进行分组,求第四列的值和第五列的平均值。每个操作都要写在单独的输出中。一开始就觉得有点棘手 输入:无标题,超过100k行 StartTime, EndTime,Day,SumCount,UniqueCount 00:00:00,01:00:00,Mon,13534,594 01:00:00,02:00:00,Mon,16674,626 02:00:00,03:00:00,Mon,23736,671 03:00:00,04:00:00,Mon,16977,671 00:00:00

我试着用一列或两列进行分组,求第四列的值和第五列的平均值。每个操作都要写在单独的输出中。一开始就觉得有点棘手

输入:无标题,超过100k行

StartTime, EndTime,Day,SumCount,UniqueCount
00:00:00,01:00:00,Mon,13534,594
01:00:00,02:00:00,Mon,16674,626
02:00:00,03:00:00,Mon,23736,671
03:00:00,04:00:00,Mon,16977,671
00:00:00,01:00:00,Tue,17262,747
01:00:00,02:00:00,Tue,19072,777
02:00:00,03:00:00,Tue,18275,785
03:00:00,04:00:00,Tue,13589,757
04:00:00,05:00:00,Tue,16053,735
05:00:00,06:00:00,Tue,11440,636
我想找到的是

  • Groupby
    StartTime
    EndTime
    查找
    SumCount
    UniqueCount的中位数
  • Groupby
    Day
    查找
    SumCount
    UniqueCount的中位数
  • 我不知道如何将这两种类型的groupby放在一个程序中并获得两种不同的输出

    试试看:

    df.groupby(['StartTime', 'EndTime']).agg({'SumCount': ['sum'],
                                 'UniqueCount': {'median': lambda x: np.median(x).round(0)}})
    

    或:


    谢谢!但中位数不能有浮点值。“我该如何弥补呢?”SitzBlogz说,非常感谢。如果我有任何错误,我会写回。再次感谢
    df.groupby(['Day']).agg({'SumCount': ['sum'],
                             'UniqueCount': {'median': lambda x: np.median(x).round(0)}})