Python 属性错误:';numpy.ndarray和#x27;对象没有属性';列';
我试图创建一个函数来删除彼此高度相关的功能。但是,我得到的错误是,Python 属性错误:';numpy.ndarray和#x27;对象没有属性';列';,python,numpy,pandas,Python,Numpy,Pandas,我试图创建一个函数来删除彼此高度相关的功能。但是,我得到的错误是,“AttributeError:'numpy.ndarray'对象没有属性“columns”… 我只想打电话给熊猫队,看看他们的号码。下一步我能做什么 import pandas as pd import numpy as np def remove_features_identical(DataFrame,data_source): n=len(DataFrame.columns) print 'dealing
“AttributeError:'numpy.ndarray'对象没有属性“columns”…
我只想打电话给熊猫队,看看他们的号码。下一步我能做什么
import pandas as pd
import numpy as np
def remove_features_identical(DataFrame,data_source):
n=len(DataFrame.columns)
print 'dealing with %d features of %s data......... \n' % (n,data_source)
remove_ind = []
R = np.corrcoef(DataFrame.T)
for i in range(n-1):
for j in range(i+1,n):
if R[i,j]==1:
remove_ind.append(j)
DataFrame.drop(remove_ind, axis=1, inplace=True)
DataFrame.drop(remove_ind, axis=1, inplace=True)
print ('deleting %d columns with correration factor >0.99') % ( len(remove_ind))
return DataFrame
if __name__ == "__main__":
# load data and initialize y and x from train set and test set
df_train = pd.read_csv('train.csv')
df_test = pd.read_csv('test.csv')
y_train=df_train['TARGET'].values
X_train =df_train.drop(['ID','TARGET'], axis=1).values
y_test=[]
X_test = df_test.drop(['ID'], axis=1).values
# delete identical feartures in raw data
X_train = remove_features_identical(X_train,'train set')
X_test = remove_features_identical(X_test,'test set')
检查熊猫的文档,但我认为
X_train =df_train.drop(['ID','TARGET'], axis=1).values
.values
返回一个numpy
数组,而不是数据帧。数组没有列
属性
删除\u功能\u相同
-如果将其传递给数组,请确保仅使用数组而不是数据帧功能。否则,请确保向其传递数据帧。不要使用变量名,如DataFrame