Pandas 如何在传递到Plotly函数之前重塑数据帧?
我正在尝试使用Pandas 如何在传递到Plotly函数之前重塑数据帧?,pandas,plotly,plotly-dash,plotly-python,Pandas,Plotly,Plotly Dash,Plotly Python,我正在尝试使用绘图中的table()函数创建一个数据表 我的数据如下: import pandas as pd test_df = pd.DataFrame({'Manufacturer':['Mercedes', 'Buick', 'Ford', 'Buick', 'Buick', 'Ford', 'Buick', 'Chrysler', 'Ford', 'Buick', 'Chrysler', 'Ford', 'Buick', 'Ford', 'Ford', 'Chrysler', 'Ch
绘图中的table()
函数创建一个数据表
我的数据如下:
import pandas as pd
test_df = pd.DataFrame({'Manufacturer':['Mercedes', 'Buick', 'Ford', 'Buick', 'Buick', 'Ford', 'Buick', 'Chrysler', 'Ford', 'Buick', 'Chrysler', 'Ford', 'Buick', 'Ford', 'Ford', 'Chrysler', 'Chrysler', 'Ford', 'Chrysler', 'Chrysler', 'Chrysler', 'Buick'],
'Metric':['MPG', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score'],
'Statistic':['External', 'Min', 'Max', 'Average', 'Median', '90th', '95th', '99th', 'Min', 'Max', 'Average', 'Median', '90th', '95th', '99th','Min', 'Max', 'Average', 'Median', '90th', '95th', '99th'],
'Value':[22, 3.405, 100.29, 4.62, 4.425, 5.34, 5.83, 7.75, 2.6323, 210, 4.193, 3.28, 5.04, 6.36, 11.01, 3.72, 43, 4.98, 4.82, 5.775, 6.18, 7.182],
})
Manufacturer Min Max Average Median 90th 95th 99th
Buick 3.405 210 4.62 4.425 5.04 5.83 7.182
Chrysler 3.72 43 4.193 4.82 5.775 6.18 7.75
Ford 2.6323 100.29 4.98 3.28 5.34 6.36 11.01
# THIS IS THE EXAMPLE FROM THE DOCS (SHOWING THE USE OF A DATA FRAME)
fig = go.Figure(data=[go.Table(
header=dict(values=list(df.columns),
fill_color='paleturquoise',
align='left'),
cells=dict(values=[df.Rank, df.State, df.Postal, df.Population],
fill_color='lavender',
align='left'))
])
我希望能够创建如下所示的表:
import pandas as pd
test_df = pd.DataFrame({'Manufacturer':['Mercedes', 'Buick', 'Ford', 'Buick', 'Buick', 'Ford', 'Buick', 'Chrysler', 'Ford', 'Buick', 'Chrysler', 'Ford', 'Buick', 'Ford', 'Ford', 'Chrysler', 'Chrysler', 'Ford', 'Chrysler', 'Chrysler', 'Chrysler', 'Buick'],
'Metric':['MPG', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score'],
'Statistic':['External', 'Min', 'Max', 'Average', 'Median', '90th', '95th', '99th', 'Min', 'Max', 'Average', 'Median', '90th', '95th', '99th','Min', 'Max', 'Average', 'Median', '90th', '95th', '99th'],
'Value':[22, 3.405, 100.29, 4.62, 4.425, 5.34, 5.83, 7.75, 2.6323, 210, 4.193, 3.28, 5.04, 6.36, 11.01, 3.72, 43, 4.98, 4.82, 5.775, 6.18, 7.182],
})
Manufacturer Min Max Average Median 90th 95th 99th
Buick 3.405 210 4.62 4.425 5.04 5.83 7.182
Chrysler 3.72 43 4.193 4.82 5.775 6.18 7.75
Ford 2.6323 100.29 4.98 3.28 5.34 6.36 11.01
# THIS IS THE EXAMPLE FROM THE DOCS (SHOWING THE USE OF A DATA FRAME)
fig = go.Figure(data=[go.Table(
header=dict(values=list(df.columns),
fill_color='paleturquoise',
align='left'),
cells=dict(values=[df.Rank, df.State, df.Postal, df.Population],
fill_color='lavender',
align='left'))
])
执行此操作的代码如下所示(硬编码时):
根据位于的文档,cells
参数需要一个列表,可以获取一个数据帧(太好了!)
使用文档中的示例,传递熊猫数据帧的代码如下所示:
import pandas as pd
test_df = pd.DataFrame({'Manufacturer':['Mercedes', 'Buick', 'Ford', 'Buick', 'Buick', 'Ford', 'Buick', 'Chrysler', 'Ford', 'Buick', 'Chrysler', 'Ford', 'Buick', 'Ford', 'Ford', 'Chrysler', 'Chrysler', 'Ford', 'Chrysler', 'Chrysler', 'Chrysler', 'Buick'],
'Metric':['MPG', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score', 'Score'],
'Statistic':['External', 'Min', 'Max', 'Average', 'Median', '90th', '95th', '99th', 'Min', 'Max', 'Average', 'Median', '90th', '95th', '99th','Min', 'Max', 'Average', 'Median', '90th', '95th', '99th'],
'Value':[22, 3.405, 100.29, 4.62, 4.425, 5.34, 5.83, 7.75, 2.6323, 210, 4.193, 3.28, 5.04, 6.36, 11.01, 3.72, 43, 4.98, 4.82, 5.775, 6.18, 7.182],
})
Manufacturer Min Max Average Median 90th 95th 99th
Buick 3.405 210 4.62 4.425 5.04 5.83 7.182
Chrysler 3.72 43 4.193 4.82 5.775 6.18 7.75
Ford 2.6323 100.29 4.98 3.28 5.34 6.36 11.01
# THIS IS THE EXAMPLE FROM THE DOCS (SHOWING THE USE OF A DATA FRAME)
fig = go.Figure(data=[go.Table(
header=dict(values=list(df.columns),
fill_color='paleturquoise',
align='left'),
cells=dict(values=[df.Rank, df.State, df.Postal, df.Population],
fill_color='lavender',
align='left'))
])
我最勇敢的尝试失败了:
仅按“分数”记录筛选:
test_df_subset = test_df[(test_df['Metric'] == 'Score') & (test_df['Manufacturer'].isin(['Buick', 'Ford', 'Chrysler']))]
创建数据透视表:
temp_df = pd.pivot_table(data=test_df_subset,index=['Statistic', 'Manufacturer'])
temp_df.unstack(0)
取消堆叠数据透视表:
temp_df = pd.pivot_table(data=test_df_subset,index=['Statistic', 'Manufacturer'])
temp_df.unstack(0)
问题:我将如何重塑我的test_df
数据帧,以便能够将其传递给data
和cells
函数中的参数?
提前谢谢 你们很接近,这里有一个方法
import plotly.graph_objects as go
cols_ = ["Manufacturer", "Min", "Max",
"Average", "Median", "90th",
"95th", "99th"]
manufacturers = ['Buick', 'Ford', 'Chrysler']
#this is what you are looking for
df_ = (test_df[test_df['Manufacturer'].isin(manufacturers)]
.set_index(['Manufacturer', 'Statistic'])
['Value'].unstack()
.reset_index()[cols_]
)
go.Figure(go.Table(
header=dict(
values=cols_,
font=dict(size=10),
align="left"
),
cells=dict(
values=df_.T, # note the T here
align = "left")
))
与您的方法相比,我认为df
(在我的符号中)相当于temp\u df.unstack(0)['Value'].reset\u index()[cols\uz]
使用您的符号,并使用cols
按预期对它们进行排序您是否尝试过用df.columns.to\u list()替换标题中的值
参数和单元格中的值
参数
以及df.values
?谢谢,@Ben.T。您的解决方案在测试和生产数据上运行良好。