Python 如何从数据帧中的分类变量中找到定量变量的平均值？_Python_Pandas

Python 如何从数据帧中的分类变量中找到定量变量的平均值？

python pandas

Python 如何从数据帧中的分类变量中找到定量变量的平均值？,python,pandas,Python,Pandas,假设我有以下熊猫数据框： data = {'State':['CA', 'CA', 'CA', 'CA', 'NY', 'NY', 'TX'], 'Cost':[20, 30, 40, 50, 60, 70, 70]} test = pd.DataFrame(data) print(test.head(7)) 下面是哪一个 State Cost 0 CA 20 1 CA 30 2 CA 40 3 CA 50 4 NY

假设我有以下熊猫数据框：

data = {'State':['CA', 'CA', 'CA', 'CA', 'NY', 'NY', 'TX'],
        'Cost':[20, 30, 40, 50, 60, 70, 70]}
test = pd.DataFrame(data)
print(test.head(7))

下面是哪一个

  State  Cost
0    CA    20
1    CA    30
2    CA    40
3    CA    50
4    NY    60
5    NY    70
6    TX    70

在这种情况下，加利福尼亚州的平均成本为35，纽约州的平均成本为65，德克萨斯州的平均成本为70

我的问题是：在pandas中，我们可以通过什么查询找到给定州的平均成本？

使用

groupby

和

mean

：

print(test.groupby('State').mean())

哪些产出：

       Cost
State      
CA       35
NY       65
TX       70

如果您想要一个更干净的

数据帧

：

print(test.groupby('State', as_index=False).mean())

其中：

  State  Cost
0    CA    35
1    NY    65
2    TX    70