Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/305.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python TF-IDF计算中的名称定义_Python - Fatal编程技术网

Python TF-IDF计算中的名称定义

Python TF-IDF计算中的名称定义,python,Python,我有一个数据集,其中包含一组文章。我合并了元数据和json文件,并创建了一个数据框架。这是我的密码: from sklearn.feature_extraction.text import TfidfVectorizer from sklearn.feature_extraction.text import CountVectorizer from sklearn.metrics.pairwise import cosine_similarity import pandas as pd imp

我有一个数据集,其中包含一组文章。我合并了元数据和json文件,并创建了一个数据框架。这是我的密码:

from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.metrics.pairwise import cosine_similarity

import pandas as pd
import numpy as np 

vectorizer = TfidfVectorizer()
X = vectorizer.fit_transform(merged_df['Title'][39100])

print(X.shape)

query = "How to prevent covid19"
query_vec = vectorize.transform([query])
result = cosine_similarity(X,query_vec).reshape((-1,))

for i in result.argsort()[-10:][::-1]:
    print(merged_df.iloc['Title'][i,0], "--", merged_df.iloc['Title'][i,1])
我想计算Title的TFIDF来处理查询,这有助于我找到一些相关的论文。
为什么提示名称“merged_df”未定义?

在代码中,merged_df未定义。数据帧从未创建,因此未定义