Warning: file_get_contents(/data/phpspider/zhask/data//catemap/7/elixir/2.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/3/sql-server-2005/2.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Cmusphinx 脱机语言模型的解码结果不正确_Cmusphinx - Fatal编程技术网

Cmusphinx 脱机语言模型的解码结果不正确

Cmusphinx 脱机语言模型的解码结果不正确,cmusphinx,Cmusphinx,我目前正在使用pocketsphinx,并离线构建自己的语言模型,而不是使用 我遵循中的步骤 我的步骤如下: #!/bin/bash text2wfreq < 1.txt | wfreq2vocab > 1.vocab text2idngram -vocab 1.vocab -idngram 1.idngram < 1.txt idngram2lm -vocab_type 0 -idngram 1.idngram -vocab 1.vocab -arpa 1.arpa sphi

我目前正在使用pocketsphinx,并离线构建自己的语言模型,而不是使用

我遵循中的步骤 我的步骤如下:

#!/bin/bash
text2wfreq < 1.txt | wfreq2vocab > 1.vocab
text2idngram -vocab 1.vocab -idngram 1.idngram < 1.txt
idngram2lm -vocab_type 0 -idngram 1.idngram -vocab 1.vocab -arpa 1.arpa
sphinx_lm_convert -i 1.arpa -o 1.lm.DMP
sphinx_lm_convert -i 1.lm.DMP -ifmt dmp -o 1.lm -ofmt arpa
#pocketsphinx_continuous -lm 1.lm -dict 1.dic
#/bin/bash
text2wfreq<1.txt | wfreq2vocab>1.vocab
text2idgram-vocab 1.vocab-idngram 1.idngram<1.txt
idngram2lm-vocab_类型0-idngram 1.idngram-vocab 1.vocab-arpa 1.arpa
sphinx_lm_convert-i 1.arpa-o 1.lm.DMP
sphinx_lm_convert-i 1.lm.DMP-ifmt DMP-o 1.lm-ofmt arpa
#pocketsphinx_连续-lm 1.lm-dict 1.dic
1.txt:

<s> Children  I want you to draw your bedroom </s>
<s> In my room  there is a big bed next to the window and a picture on the door </s>
.... #more
孩子们我要你们画你们的卧室 在我的房间里,窗户旁边有一张大床,门上有一幅画 .... #更多 然而当我运行pocketsphinx_continuous-lm 1.lm-dict 1.dic时,结果根本不正确,我的步骤是否有问题

我注意到,与在线构建lm文件相比,我自己的lm文件的输出是完全错误的

INFO: ngram_search_fwdflat.c(951): fwdflat 0.01 wall 0.006 xRT
**INFO: ngram_search.c(1214): </s> not found in last frame, using OK.150 instead**
INFO: ngram_search.c(1266): lattice start node <s>.0 end node OK.115
INFO: ngram_search.c(1294): Eliminated 1 nodes before end node
INFO: ngram_search.c(1399): Lattice has 30 nodes, 18 links
INFO: ps_lattice.c(1365): Normalizer P(O) = alpha(OK:115:150) = -1308625
INFO: ps_lattice.c(1403): Joint P(O,S) = -1309458 P(S|O) = -833
INFO: ngram_search.c(888): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(891): bestpath 0.00 wall 0.000 xRT
000000001: TV OK

READY....
INFO:ngram\u search\u fwdflat.c(951):fwdflat 0.01墙0.006 xRT
**信息:ngram_search.c(1214):在最后一帧中找不到,请使用OK.150**
信息:ngram_search.c(1266):晶格开始节点。0结束节点正常。115
信息:ngram_search.c(1294):在结束节点之前消除1个节点
信息:ngram_search.c(1399):Lattice有30个节点,18个链接
信息:ps_lattice.c(1365):归一化器P(O)=α(OK:115:150)=-1308625
信息:ps|u lattice.c(1403):关节P(O,S)=-1309458 P(S|O)=-833
信息:ngram_search.c(888):最佳路径0.00CPU 0.000XRT
信息:ngram_search.c(891):最佳路径0.00墙0.000 xRT
000000001:电视正常
准备好的

我尝试使用我自己的lm和系统dic,结果显示完全不匹配。

我们需要了解更多关于您所做的事情。 张贴指向1.txt和1.dic文件以及1.lm文件的指针。 这里有一个(随机)想法:您的.lm看起来是混合情况。你的.dic是大写的吗

我不确定这是怎么回事:

sphinx_lm_convert -i 1.arpa -o 1.lm.DMP
sphinx_lm_convert -i 1.lm.DMP -ifmt dmp -o 1.lm -ofmt arpa
.arpa和.lm本质上是同一个文件。DMP文件是为了预先计算语言模型在线表示的一些内容。

Myy dictionay(1.dic)是大写的,而我的txt(1.txt)是混合大小写的。在我将它们全部改为大写后,它就可以工作了。虽然我的回答是高达70%。