Warning: file_get_contents(/data/phpspider/zhask/data//catemap/8/python-3.x/17.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
机器学习的问题—在python中学习_Python_Python 3.x_Machine Learning_Scikit Learn - Fatal编程技术网

机器学习的问题—在python中学习

机器学习的问题—在python中学习,python,python-3.x,machine-learning,scikit-learn,Python,Python 3.x,Machine Learning,Scikit Learn,我正试图复制一个教程 在我将.fit方法添加到我的训练集之前,一切都很顺利 以下是我的代码示例: # TRAINING PART train_dir = 'pdf/learning_set' dictionary = make_dic(train_dir) train_labels = np.zeros(20) train_labels[17:20] = 1 train_matrix = extract_features(train_dir) model1 = MultinomialNB(

我正试图复制一个教程

在我将.fit方法添加到我的训练集之前,一切都很顺利

以下是我的代码示例:

# TRAINING PART

train_dir = 'pdf/learning_set'
dictionary = make_dic(train_dir)

train_labels = np.zeros(20)
train_labels[17:20] = 1
train_matrix = extract_features(train_dir)
model1 = MultinomialNB()
model1.fit(train_matrix, train_labels)


# TESTING PART

test_dir = 'pdf/testing_set'
test_matrix = extract_features(test_dir)
test_labels = np.zeros(8)
test_labels[4:7] = 1
result1 = model1.predict(test_matrix)
print(confusion_matrix(test_labels, result1))
这是我的回溯:

Traceback (most recent call last):
File "ML.py", line 65, in <module>
model1.fit(train_matrix, train_labels)
File "/usr/local/lib/python3.6/site-packages/sklearn/naive_bayes.py", 
line 579, in fit
X, y = check_X_y(X, y, 'csr')
File "/usr/local/lib/python3.6/site-
packages/sklearn/utils/validation.py", line 552, in check_X_y
check_consistent_length(X, y)
File "/usr/local/lib/python3.6/site-
packages/sklearn/utils/validation.py", line 173, in 
check_consistent_length
" samples: %r" % [int(l) for l in lengths])
ValueError: Found input variables with inconsistent numbers of 
samples: [23, 20]
我想知道如何解决这个问题?
我正在使用Python3.6开发Ubuntu16.04。

ValueError:发现输入变量的数量不一致 样本:[23,20]

这意味着你有23个训练向量,训练矩阵有23行 但只有20个训练标签train_标签是20个值的数组

更改列车标签=np.zeros20 训练_标签=np.zeros23
而且它应该能工作。

非常感谢,它工作得非常好!那是个愚蠢的错误啊哈