Warning: file_get_contents(/data/phpspider/zhask/data//catemap/9/csharp-4.0/2.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 如何使用ChemDataExtractor提取化学实体?_Python_Nlp - Fatal编程技术网

Python 如何使用ChemDataExtractor提取化学实体?

Python 如何使用ChemDataExtractor提取化学实体?,python,nlp,Python,Nlp,我正试图通过ChemDataExtractor(Python)处理一个用于提取化学实体的文本。一个可能的例子是 from chemdataextractor import Document doc = Document('UV-vis spectrum of 5,10,15,20-Tetra(4-carboxyphenyl)porphyrin in Tetrahydrofuran (THF).') 然后键入doc.cems结果如下 [Span('THF', 82, 85), Span('5,

我正试图通过ChemDataExtractor(Python)处理一个用于提取化学实体的文本。一个可能的例子是

from chemdataextractor import Document
doc = Document('UV-vis spectrum of 5,10,15,20-Tetra(4-carboxyphenyl)porphyrin in Tetrahydrofuran (THF).')
然后键入
doc.cems
结果如下

[Span('THF', 82, 85),
 Span('5,10,15,20-Tetra(4-carboxyphenyl)porphyrin', 19, 61),
 Span('Tetrahydrofuran', 65, 80)]
我只想提取
'THF'
'5,10,15,20-四(4-羧基苯基)卟啉'
'四氢呋喃'
,而不提取“span元素”。我该怎么做?

解决方案是:

doc.cems[0].text

doc.cems[1].text

doc.cems[2].text