Python 将DF转换为Numpy数组进行计算
我有数据帧格式的数据,我将使用用户构建的函数进行线性回归计算。代码如下:Python 将DF转换为Numpy数组进行计算,python,arrays,numpy,dataframe,toarray,Python,Arrays,Numpy,Dataframe,Toarray,我有数据帧格式的数据,我将使用用户构建的函数进行线性回归计算。代码如下: from sklearn.datasets import load_boston boston = load_boston() bos = pd.DataFrame(boston.data) # convert to DF bos.columns = boston.feature_names bos['PRICE'] = boston.target y = bos.PRICE x = bos.drop('PRICE',
from sklearn.datasets import load_boston
boston = load_boston()
bos = pd.DataFrame(boston.data) # convert to DF
bos.columns = boston.feature_names
bos['PRICE'] = boston.target
y = bos.PRICE
x = bos.drop('PRICE', axis = 1) # DROP PRICE since only want X-type variables (not Y-target)
xw = df.to_array(x)
xw = np.insert(xw,0,1, axis = 1) # to insert a column of "1" values
但是,我得到了一个错误:
AttributeError Traceback (most recent call last)
<ipython-input-131-272f1b4d26ba> in <module>()
1 import copy
2
----> 3 xw = df.to_array(x)
AttributeError: 'int' object has no attribute 'to_array'
无功而返
有什么想法吗?它是x.as\u matrix()
不是df.to\u数组(x)
有关详细信息,请参阅熊猫文件 下面是有效的代码
from sklearn.datasets import load_boston
import pandas as pd
import numpy as np
boston = load_boston()
bos = pd.DataFrame(boston.data) # convert to DF
bos.columns = boston.feature_names
bos['PRICE'] = boston.target
y = bos.PRICE
x = bos.drop('PRICE', axis = 1) # DROP PRICE since only want X-type variables (not Y-target)
xw = x.as_matrix()
xw = np.insert(xw,0,1, axis = 1) # to insert a column of "1" values
我甚至检查了转换后的类型(type(xw),它给了我一个np.ndarray作为一个类型。不确定问题出在哪里,你得到了什么?或者可能包括一个
x
的打印输出,我想我不知怎的得到了它。不确定,但什么是有效的:xw=copy.deepcopy(x)xw=np.c_uu[np.ones(lnY),xw]不确定这是否有效(插入“1”的第一列,另一种方法没有,但这里的结果是(lnY是目标数组的大小(Y值)什么是df
?它没有在任何地方定义,但似乎是int
。您的意思是x.as_matrix()
?
from sklearn.datasets import load_boston
import pandas as pd
import numpy as np
boston = load_boston()
bos = pd.DataFrame(boston.data) # convert to DF
bos.columns = boston.feature_names
bos['PRICE'] = boston.target
y = bos.PRICE
x = bos.drop('PRICE', axis = 1) # DROP PRICE since only want X-type variables (not Y-target)
xw = x.as_matrix()
xw = np.insert(xw,0,1, axis = 1) # to insert a column of "1" values