Python sklearn.model_selection.train_test_split中的样本重量是多少_Python_Machine Learning_Scikit Learn

Python sklearn.model_selection.train_test_split中的样本重量是多少

python machine-learning scikit-learn

Python sklearn.model_selection.train_test_split中的样本重量是多少,python,machine-learning,scikit-learn,Python,Machine Learning,Scikit Learn,在from scikit learn中，有一部分代码我在文档中找不到解释 centers = [(-5, -5), (0, 0), (5, 5)] X, y = make_blobs(n_samples=n_samples, n_features=2, cluster_std=1.0, centers=centers, shuffle=False, random_state=42) y[:n_samples // 2] = 0 y[n_samples //

在from scikit learn中，有一部分代码我在文档中找不到解释

centers = [(-5, -5), (0, 0), (5, 5)] X, y =
make_blobs(n_samples=n_samples, n_features=2, cluster_std=1.0,
                  centers=centers, shuffle=False, random_state=42)

y[:n_samples // 2] = 0 
y[n_samples // 2:] = 1 
sample_weight = np.random.RandomState(42).rand(y.shape[0])

# split train, test for calibration 
X_train, X_test, y_train, y_test, sw_train, sw_test = \
    train_test_split(X, y, sample_weight, test_size=0.9, random_state=42)

在

train\u test\u split

中

sample\u weight

做什么

train\u test\u分割的源代码如何处理sample\u weight


非常感谢。
train\u test\u split
不仅仅需要x
和y
。它可以获取具有相同第一维度的任意数组序列，并沿该维度将它们随机但一致地拆分为两组
在您的示例中，有一个随机权重数组（每个观察值一个权重），它被分成训练和测试数组，sw\u train
和sw\u test

为观察值指定权重有很多原因。有关进一步讨论，请参阅：




谢谢。但是从源代码中，我看不出train\u test\u split
如何处理sample\u weight
。它处理sample weight
的方式与处理X
和y
的方式相同。它只是另一个输入数组。