Scikit learn ValueError:HistGradientBoostingRegressionr至少需要一个数组或数据类型
我正在使用HistGradientBoostingRegressionor 我的代码:-Scikit learn ValueError:HistGradientBoostingRegressionr至少需要一个数组或数据类型,scikit-learn,Scikit Learn,我正在使用HistGradientBoostingRegressionor 我的代码:- X_train, X_test, y_train, y_test = train_test_split(Train, target, test_size=0.2, random_state=16) model = HistGradientBoostingRegressor(learning_rate = 0.1, max_ite
X_train, X_test, y_train, y_test = train_test_split(Train, target, test_size=0.2, random_state=16)
model = HistGradientBoostingRegressor(learning_rate = 0.1,
max_iter=800,
random_state = 16,
validation_fraction=None,
verbose = 0,
max_depth=12,
min_samples_leaf=25,
l2_regularization=0.05)
model.fit(X_train, y_train)
我从.fit()方法中得到了标题中所示的错误,但由于我是ML新手,所以对文档了解不多。
错误:
ValueError回溯(最近一次调用)
在里面
7分钟样品叶=25,
8 l2_正则化=0.05)
---->9型装配(X_系列、y_系列)
/opt/conda/lib/python3.6/site-packages/sklearn/employee//u hist\u gradient\u boosting/gradient\u boosting.py in fit(self,X,y)
102#为梯度和hessians更新预测X所花费的时间
103 acc\u预测时间=0。
-->104 X,y=check_X_y(X,y,dtype=[X_dtype],force_all_finite=False)
105 y=自我编码y(y)
106
/检查中的opt/conda/lib/python3.6/site-packages/sklearn/utils/validation.py(X,y,接受稀疏,接受大稀疏,数据类型,顺序,复制,强制所有有限,确保二维,允许多输出,确保最小样本,确保最小特征,数字,警告数据类型,估算器)
753确保最小功能=确保最小功能,
754 warn_on_dtype=warn_on_dtype,
-->755估算器=估算器)
756如果多输出:
757 y=检查数组(y,'csr',强制所有有限=真,确保2d=假,
/检查数组中的opt/conda/lib/python3.6/site-packages/sklearn/utils/validation.py
473
474如果数据类型中的数据类型为all(isinstance(数据类型,np.dtype)):
-->475数据类型原始=np.结果类型(*数据类型原始)
476
477如果数据类型为数字:
结果类型中(*args,**kwargs)
ValueError:至少需要一个数组或数据类型
谢谢你的帮助
ValueError Traceback (most recent call last)
in
7 min_samples_leaf=25,
8 l2_regularization=0.05)
----> 9 model.fit(X_train, y_train)
/opt/conda/lib/python3.6/site-packages/sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py in fit(self, X, y)
102 # time spent predicting X for gradient and hessians update
103 acc_prediction_time = 0.
--> 104 X, y = check_X_y(X, y, dtype=[X_DTYPE], force_all_finite=False)
105 y = self._encode_y(y)
106
/opt/conda/lib/python3.6/site-packages/sklearn/utils/validation.py in check_X_y(X, y, accept_sparse, accept_large_sparse, dtype, order, copy, force_all_finite, ensure_2d, allow_nd, multi_output, ensure_min_samples, ensure_min_features, y_numeric, warn_on_dtype, estimator)
753 ensure_min_features=ensure_min_features,
754 warn_on_dtype=warn_on_dtype,
--> 755 estimator=estimator)
756 if multi_output:
757 y = check_array(y, 'csr', force_all_finite=True, ensure_2d=False,
/opt/conda/lib/python3.6/site-packages/sklearn/utils/validation.py in check_array(array, accept_sparse, accept_large_sparse, dtype, order, copy, force_all_finite, ensure_2d, allow_nd, ensure_min_samples, ensure_min_features, warn_on_dtype, estimator)
473
474 if all(isinstance(dtype, np.dtype) for dtype in dtypes_orig):
--> 475 dtype_orig = np.result_type(*dtypes_orig)
476
477 if dtype_numeric:
<array_function internals> in result_type(*args, **kwargs)
ValueError: at least one array or dtype is required