Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/337.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python LinearRegression()中的系数值代表什么?_Python_Machine Learning_Scikit Learn_Linear Regression - Fatal编程技术网

Python LinearRegression()中的系数值代表什么?

Python LinearRegression()中的系数值代表什么?,python,machine-learning,scikit-learn,linear-regression,Python,Machine Learning,Scikit Learn,Linear Regression,我是机器学习的初学者。这只是一个简单的问题,线性回归()中的系数代表什么?我知道它代表系数,但我不理解这些值,高的正系数是否意味着更强的关系 而且,如果coef_u值是指数型的,这是否意味着我的线性回归是错误的 array([-3.12840684e+02, -1.01279891e+13, -1.42682874e+13, -1.42682874e+13, -1.42682873e+13, -1.42682873e+13, -1.23057091e+04, -6.08730443e+02

我是机器学习的初学者。这只是一个简单的问题,线性回归()中的系数代表什么?我知道它代表系数,但我不理解这些值,高的正系数是否意味着更强的关系

而且,如果coef_u值是指数型的,这是否意味着我的线性回归是错误的

array([-3.12840684e+02, -1.01279891e+13, -1.42682874e+13, -1.42682874e+13,
   -1.42682873e+13, -1.42682873e+13, -1.23057091e+04, -6.08730443e+02,
   -2.25836847e+12, -2.25836846e+12, -2.25836846e+12, -2.25836846e+12,
   -2.25836845e+12, -2.25836846e+12, -2.25836846e+12, -2.25836847e+12,
   -2.25836849e+12,  5.41669310e+11,  5.41669341e+11,  5.41669325e+11,
    2.47680574e+12,  5.41669172e+11,  5.41669322e+11,  5.41669336e+11,
    5.41669320e+11, -1.92388659e+12, -1.92388658e+12, -1.92388659e+12,
   -1.92388654e+12, -1.43004842e+12, -1.92388655e+12, -1.92388658e+12,
   -1.92388665e+12, -1.92388657e+12, -1.92388658e+12, -1.92388653e+12,
   -1.92388658e+12, -1.92388660e+12, -1.92388658e+12, -1.92388660e+12,
   -9.83609518e+11, -9.83609497e+11, -9.83609551e+11, -9.83609529e+11,
   -1.47744767e+12, -9.83609560e+11, -9.83609506e+11, -9.83609465e+11,
   -9.83609519e+11, -9.83609455e+11, -9.83609513e+11, -9.83609555e+11,
   -9.83609535e+11, -9.83609497e+11, -9.83609511e+11, -9.83609514e+11,
   -1.54590339e+13, -1.54590339e+13, -1.54590339e+13, -1.54590339e+13,
   -1.30678844e+12, -1.30678843e+12, -1.30678843e+12, -1.30678847e+12,
   -1.30678844e+12, -1.62361585e+13, -1.62361585e+13, -1.62361585e+13,
   -1.62361585e+13, -1.62361586e+13, -1.62361585e+13,  5.88718912e+12,
    5.88718906e+12,  5.88718908e+12,  5.88718907e+12, -5.88718905e+12,
   -5.88718905e+12, -5.88718906e+12, -5.88718904e+12,  3.45085841e+11,
    3.45085830e+11,  3.45085861e+11,  3.45085814e+11,  3.45085829e+11,
    3.45085830e+11,  3.11126022e+12,  3.11126021e+12,  3.11126021e+12,
    1.13335966e+07,  3.11126021e+12, -7.58191433e+11, -7.58191424e+11,
    3.04834491e+03,  3.74262207e+03, -1.19176646e+04, -1.16855749e+04,
    2.00192065e+03,  3.75148918e+12,  3.75148916e+12,  3.75148918e+12,
    3.75148928e+12,  2.24187815e+13,  2.24187815e+13,  2.24187815e+13,
    2.24187815e+13,  6.26624305e+04,  1.61187017e+04,  2.00000527e+04,
   -3.10534619e+03,  2.39790901e+04, -7.55683101e+12, -7.55683096e+12,
   -7.55683096e+12, -7.55683096e+12, -7.55683097e+12, -7.55683097e+12,
    2.32335100e+13,  2.32335100e+13,  2.32335100e+13, -1.15363807e+13,
   -1.15363808e+13, -1.15363807e+13, -1.15363807e+13, -1.15363807e+13,
    2.70555822e+12,  2.70555822e+12,  2.70555823e+12, -3.19955267e+11,
   -3.19955228e+11, -3.19955165e+11, -3.19955294e+11, -3.19955247e+11,
   -3.19955264e+11, -3.19955255e+11, -3.19955270e+11, -3.19955263e+11,
   -4.86759426e+12, -4.86759423e+12, -4.86759425e+12, -4.86759428e+12,
   -4.86759425e+12, -4.86759427e+12])

也许至少阅读您正在使用的scikit学习模块的首页是明智的

对于线性回归: $\hat{y}(w,x)=w_0+w_1 x_1+…+w_p x_p$


在整个模块中,我们将系数向量w指定为coef_uu和intercept_u。

coef_u为您提供了通过线性回归估计的权重数组。它的形状(n个目标,n个特征)。在您的情况下,它是1D阵列,因为您只有一个目标


如果权重值很大,请确保对特征进行规格化,并对模型使用正则化。

系数给出数据集特征的系数。此外,e只是表示与数字相关的第10次方,那么这是否意味着线性回归中使用的所有变量都有意义的影响?由于所有特性都有很高的值,我不知道LR的输出,但从技术上讲是这样的,而且如果你发现数组的维数和输入数据的维数是相同的,你可以使用套索来抑制不重要的特征。我在这里做的只是使用分类变量来预测一个连续变量,在这个例子中是“价格”。我对分类变量的数据进行了模拟,我认为这些变量可能有助于预测“价格”。你认为套索能处理分类变量吗?