Neural network 窗口字嵌入中的启动标记_Neural Network_Nlp_Nltk_Lstm_Named Entity Recognition

Neural network 窗口字嵌入中的启动标记

neural-network nlp

Neural network 窗口字嵌入中的启动标记,neural-network,nlp,nltk,lstm,named-entity-recognition,Neural Network,Nlp,Nltk,Lstm,Named Entity Recognition,我正在使用预先训练过的SENNA嵌入，并将一个3个单词的窗口输入到一个密集的神经网络中 senna是否有开始或结束标记嵌入？还是创建一个随机向量 Sentence: 'McDonalds sells fries' input 1: ['<s>', 'McDonalds', 'sells'] 一句：“麦当劳卖薯条” 输入1:[''，'麦当劳'，'销售'] 但是没有嵌入。。。我自己创造吗？（例如，all-1）？阅读主页上的标记化和（包括特殊的句子边界标记）正在内部处理你想把

我正在使用预先训练过的SENNA嵌入，并将一个3个单词的窗口输入到一个密集的神经网络中

senna是否有开始或结束标记嵌入？还是创建一个随机向量

Sentence:  'McDonalds sells fries'
input 1:  ['<s>', 'McDonalds', 'sells']

一句：“麦当劳卖薯条”
输入1:[''，'麦当劳'，'销售']

但是没有嵌入

。。。我自己创造吗？（例如，all-1）？

阅读主页上的标记化和（包括特殊的句子边界标记）正在内部处理

你想把它们包括在内有什么原因吗？从网站上看，它们似乎隐含在输入上下文中（一句话）

摘自网站：

Usage

SENNA reads input sentences from the standard input 
and outputs tags into the standard output. 
The most likely command line usage for SENNA is therefore:

senna [options] < input.txt > output.txt

Of course you can run SENNA in an interactive mode 
without the "pipes" < and >.

Each input line is considered as a sentence. 
SENNA has its own tokenizer for separating words, 
which can be deactivated with the -usrtokens option.

用法
塞纳从标准输入法中读出输入句
并将标记输出到标准输出中。
因此，SENNA最可能的命令行用法是：
senna[选项]output.txt
当然，您可以在交互模式下运行SENNA
没有“管道”<和>。
每个输入行都被视为一个句子。
塞纳有自己的分词标记器，
可以使用-usrtokens选项停用。

阅读主页上的标记化（包括特殊的句子边界标记）似乎在内部处理

你想把它们包括在内有什么原因吗？从网站上看，它们似乎隐含在输入上下文中（一句话）

摘自网站：

Usage

SENNA reads input sentences from the standard input 
and outputs tags into the standard output. 
The most likely command line usage for SENNA is therefore:

senna [options] < input.txt > output.txt

Of course you can run SENNA in an interactive mode 
without the "pipes" < and >.

Each input line is considered as a sentence. 
SENNA has its own tokenizer for separating words, 
which can be deactivated with the -usrtokens option.

用法
塞纳从标准输入法中读出输入句
并将标记输出到标准输出中。
因此，SENNA最可能的命令行用法是：
senna[选项]output.txt
当然，您可以在交互模式下运行SENNA
没有“管道”<和>。
每个输入行都被视为一个句子。
塞纳有自己的分词标记器，
可以使用-usrtokens选项停用。