Python 运行func(df)创建新的数据帧并重命名它们
我可以保持df10和df20的名称相同,并在运行func(df)后分别调用它们,甚至可以重命名它们吗Python 运行func(df)创建新的数据帧并重命名它们,python,pandas,Python,Pandas,我可以保持df10和df20的名称相同,并在运行func(df)后分别调用它们,甚至可以重命名它们吗 df = pd.DataFrame( { 'A': ['d','d','d','d','d','d','g','g','g','g','g','g','k','k','k','k','k','k'], 'B': [5,5,6,4,5,6,-6,7,7,6,-7,7,-8,7,-6,6,-7,50], 'C': [1,1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2
df = pd.DataFrame( {
'A': ['d','d','d','d','d','d','g','g','g','g','g','g','k','k','k','k','k','k'],
'B': [5,5,6,4,5,6,-6,7,7,6,-7,7,-8,7,-6,6,-7,50],
'C': [1,1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2],
'S': [2012,2013,2014,2015,2016,2012,2012,2014,2015,2016,2012,2013,2012,2013,2014,2015,2016,2014]
} );
df10 = (df.B + df.C).groupby([df.A, df.S]).agg(['sum','size']).unstack(fill_value=0)
df20 = (df['B'] - df['C']).groupby([df.A, df.S]).agg(['sum','size']).unstack(fill_value=0)
def func(df):
df1 = df.groupby(level=0, axis=1).sum()
new_cols= list(zip(df1.columns.get_level_values(0),['total'] * len(df.columns)))
df1.columns = pd.MultiIndex.from_tuples(new_cols)
df2 = pd.concat([df1,df], axis=1).sort_index(axis=1).sort_index(axis=1, level=1)
df2.columns = ['_'.join((col[0], str(col[1]))) for col in df2.columns]
df2.columns = df2.columns.str.replace('sum_','')
df2.columns = df2.columns.str.replace('size_','T')
return df2
dfs = []
for df in [df10, df20]:
dfs.append(func(df))
dfs
您可以使用
dict
进行存储,并使用list创建存储在dfs
中的DataFrames
的新名称:
names = ['a','b']
dfs = {names[i]:func(df) for i,df in enumerate([df10, df20])}
print (dfs)
{'a': T2012 2012 T2013 2013 T2014 2014 T2015 2015 T2016 2016 Ttotal \
A
d 2 13 1 6 1 7 1 5 1 6 6
g 2 -11 1 8 1 8 1 8 1 7 6
k 1 -6 1 9 2 48 1 8 1 -5 6
total
A
d 37
g 20
k 54 , 'b': T2012 2012 T2013 2013 T2014 2014 T2015 2015 T2016 2016 Ttotal \
A
d 2 9 1 4 1 5 1 3 1 4 6
g 2 -15 1 6 1 6 1 6 1 5 6
k 1 -10 1 5 2 40 1 4 1 -9 6
total
A
d 25
g 8
k 30 }
但如果需要相同名称的
DataFrames
,则可以将函数func
的输出分配给相同的变量:
df10 = func(df10)
df20 = func(df20)
print (df10)
T2012 2012 T2013 2013 T2014 2014 T2015 2015 T2016 2016 Ttotal \
A
d 2 13 1 6 1 7 1 5 1 6 6
g 2 -11 1 8 1 8 1 8 1 7 6
k 1 -6 1 9 2 48 1 8 1 -5 6
total
A
d 37
g 20
k 54
print (df20)
T2012 2012 T2013 2013 T2014 2014 T2015 2015 T2016 2016 Ttotal \
A
d 2 9 1 4 1 5 1 3 1 4 6
g 2 -15 1 6 1 6 1 6 1 5 6
k 1 -10 1 5 2 40 1 4 1 -9 6
total
A
d 25
g 8
k 30
您还可以为新变量赋值:
dfa = func(df10)
dfb = func(df20)
太好了,谢谢。它们是重命名后的常规数据帧,对吗?是的,没错。正常数据帧。
df10 = func(df10)
df20 = func(df20)
print (df10)
T2012 2012 T2013 2013 T2014 2014 T2015 2015 T2016 2016 Ttotal \
A
d 2 13 1 6 1 7 1 5 1 6 6
g 2 -11 1 8 1 8 1 8 1 7 6
k 1 -6 1 9 2 48 1 8 1 -5 6
total
A
d 37
g 20
k 54
print (df20)
T2012 2012 T2013 2013 T2014 2014 T2015 2015 T2016 2016 Ttotal \
A
d 2 9 1 4 1 5 1 3 1 4 6
g 2 -15 1 6 1 6 1 6 1 5 6
k 1 -10 1 5 2 40 1 4 1 -9 6
total
A
d 25
g 8
k 30
dfa = func(df10)
dfb = func(df20)