Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/299.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 为数字列和文本列重新编制索引_Python_Python 3.x_Pandas_Python 2.7_Numpy - Fatal编程技术网

Python 为数字列和文本列重新编制索引

Python 为数字列和文本列重新编制索引,python,python-3.x,pandas,python-2.7,numpy,Python,Python 3.x,Pandas,Python 2.7,Numpy,我在python中,我有一个数据框,它包含这样的数字列和文本列 subject_id hour_measure urinecolor blood pressure 1 1.00 red 40 1 1.15 red high 2 2.00 yellow low

我在python中,我有一个数据框,它包含这样的数字列和文本列

subject_id   hour_measure urinecolor blood pressure  
    1          1.00        red             40           
    1          1.15        red           high          
    2          2.00     yellow            low          
    2          3.00     yellow             20   
subject_id  hour_measure urinecolor blood pressure  
    1          1.00        red             40           
    1          2.00     yellow            low            
    1          3.00     yellow             20           
    1          4.00     yellow             20
我想根据hour measure从1到6对数据集重新编制索引,这样对于数字列,但以小时为单位的平均值,对于文本列,使用最频繁的值,最后是这样

subject_id   hour_measure urinecolor blood pressure  
    1          1.00        red             40           
    1          1.15        red           high          
    2          2.00     yellow            low          
    2          3.00     yellow             20   
subject_id  hour_measure urinecolor blood pressure  
    1          1.00        red             40           
    1          2.00     yellow            low            
    1          3.00     yellow             20           
    1          4.00     yellow             20
以下代码可以正确处理数字列,但对于文本列,它将被删除

df2= pd.read_csv('file path')
mux = pd.MultiIndex.from_product([df['subject_id'].unique(), np.arange(1,24)],
                                  names=['subject_id','hour_measure'])
df = df.groupby(['subject_id','hour_measure']).mean().reindex(mux).reset_index()

如何更新此代码以重新编制索引而不删除文本列

似乎缺少列
主题id
如何工作解决方案?是的,更新它以包括主题id似乎缺少列
主题id
如何工作解决方案?是的,更新它以包括主题id