Ipython [sklearn decomposition.PCA]我的变量应该包含哪一列?
这是我的部分数据:Ipython [sklearn decomposition.PCA]我的变量应该包含哪一列?,ipython,pca,decomposition,Ipython,Pca,Decomposition,这是我的部分数据: yahoo msft goog close open adj 105.349998 54.799999 741.840027 105.349998 102.610001 103.057063 102.709999 55.049999 742.580017 102.709999 105.750000 100.474523 100.699997 54.049999 743.619995
yahoo msft goog close open adj
105.349998 54.799999 741.840027 105.349998 102.610001 103.057063
102.709999 55.049999 742.580017 102.709999 105.750000 100.474523
100.699997 54.049999 743.619995 100.699997 100.559998 98.508268
96.449997 52.169998 726.390015 96.449997 98.680000 94.350769
我将它们转换为数据帧
变量“data4”包含我的这部分数据
这是我进行主成分分析的代码:
from sklearn import decomposition
import pandas as pd
pca1=decomposition.PCA(n_components=3)
data5=pca1.fit_transform(data4)
data5=pd.DataFrame(data5)
我在ipython笔记本中输出数据5:
0 1 2
-2.185894 -0.503732 -0.035038
-1.986696 1.970558 0.270880
-0.785015 7.550989 0.682496
18.013318 7.836848 1.146809
28.745800 2.697891 0.101636
这是变量“data5”的数据部分。
我想知道data5列的平均值,例如data5[0]的原始数据是data4['yahoo']