Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/297.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 无法检索具有kb.get\u候选项的任何候选项_Python_Python 3.x_Machine Learning_Nlp_Spacy - Fatal编程技术网

Python 无法检索具有kb.get\u候选项的任何候选项

Python 无法检索具有kb.get\u候选项的任何候选项,python,python-3.x,machine-learning,nlp,spacy,Python,Python 3.x,Machine Learning,Nlp,Spacy,我创建了一个csv文件,如下所示: "CAMERA", "Camera", "kamera", "cam", "Kamera" "PICTURE", "Picture", "bild", "photograph" nlp = de_core_news_sm.load() text = "Cam is not

我创建了一个csv文件,如下所示:

"CAMERA", "Camera", "kamera", "cam", "Kamera"
"PICTURE", "Picture", "bild", "photograph"
nlp = de_core_news_sm.load()
text = "Cam is not good"
doc = nlp(text)

name_dict, desc_dict = load_entities()

kb = KnowledgeBase(vocab=nlp.vocab, entity_vector_length=96)

for qid, desc in desc_dict.items():
    desc_doc = nlp(desc)
    desc_enc = desc_doc.vector
    kb.add_entity(entity=qid, entity_vector=desc_enc, freq=342)  # 342 is an arbitrary value here

for qid, name in name_dict.items():
        kb.add_alias(alias=name, entities=[qid], probabilities=[1])  # 100% prior probability P(entity|alias)
print(f"Entities in the KB: {kb.get_entity_strings()}")
print(f"Aliases in the KB: {kb.get_alias_strings()}")
用起来有点像这样:

"CAMERA", "Camera", "kamera", "cam", "Kamera"
"PICTURE", "Picture", "bild", "photograph"
nlp = de_core_news_sm.load()
text = "Cam is not good"
doc = nlp(text)

name_dict, desc_dict = load_entities()

kb = KnowledgeBase(vocab=nlp.vocab, entity_vector_length=96)

for qid, desc in desc_dict.items():
    desc_doc = nlp(desc)
    desc_enc = desc_doc.vector
    kb.add_entity(entity=qid, entity_vector=desc_enc, freq=342)  # 342 is an arbitrary value here

for qid, name in name_dict.items():
        kb.add_alias(alias=name, entities=[qid], probabilities=[1])  # 100% prior probability P(entity|alias)
print(f"Entities in the KB: {kb.get_entity_strings()}")
print(f"Aliases in the KB: {kb.get_alias_strings()}")
打印如下值:

"CAMERA", "Camera", "kamera", "cam", "Kamera"
"PICTURE", "Picture", "bild", "photograph"
nlp = de_core_news_sm.load()
text = "Cam is not good"
doc = nlp(text)

name_dict, desc_dict = load_entities()

kb = KnowledgeBase(vocab=nlp.vocab, entity_vector_length=96)

for qid, desc in desc_dict.items():
    desc_doc = nlp(desc)
    desc_enc = desc_doc.vector
    kb.add_entity(entity=qid, entity_vector=desc_enc, freq=342)  # 342 is an arbitrary value here

for qid, name in name_dict.items():
        kb.add_alias(alias=name, entities=[qid], probabilities=[1])  # 100% prior probability P(entity|alias)
print(f"Entities in the KB: {kb.get_entity_strings()}")
print(f"Aliases in the KB: {kb.get_alias_strings()}")
给我:

Entities in the KB: ['PICTURE', 'CAMERA']
Aliases in the KB: [' "Camera"', ' "Picture"']
但是,如果我尝试检查候选人,我只会得到一个空列表:

candidates = kb.get_candidates("Camera")
print(candidates)
for c in candidates:
    print(" ", c.entity_, c.prior_prob, c.entity_vector)
知识库中的别名:[“照相机”、“图片”]

在我看来,您的解析脚本似乎在KB中添加了文字字符串
“Camera”
,其中包含空格、引号和全部,而不仅仅是原始字符串
Camera

知识库中的别名:[“照相机”、“图片”]

在我看来,您的解析脚本似乎在KB中添加了文字字符串
“Camera”
,其中包含空格、引号和全部,而不仅仅是原始字符串
Camera