Python spaCy：如何使用create_pipe和add_pipe设置n_线程_Python_Multithreading_Spacy

Python spaCy：如何使用create_pipe和add_pipe设置n_线程

python multithreading

Python spaCy：如何使用create_pipe和add_pipe设置n_线程,python,multithreading,spacy,Python,Multithreading,Spacy,我遵循Spacy的一个流行例子：它们使用“创建_管道”和“添加_管道”来构建管道： def main(model=None, output_dir=None, n_iter=100): """Load the model, set up the pipeline and train the entity recognizer.""" if model is not None: nlp = spacy.load(model) # load existing s

我遵循Spacy的一个流行例子：

它们使用“创建_管道”和“添加_管道”来构建管道：

def main(model=None, output_dir=None, n_iter=100):
    """Load the model, set up the pipeline and train the entity recognizer."""
    if model is not None:
        nlp = spacy.load(model)  # load existing spaCy model
        print("Loaded model '%s'" % model)
    else:
        nlp = spacy.blank('en')  # create blank Language class
        print("Created blank 'en' model")

    # create the built-in pipeline components and add them to the pipeline
    # nlp.create_pipe works for built-ins that are registered with spaCy
    if 'ner' not in nlp.pipe_names:
        ner = nlp.create_pipe('ner')
        nlp.add_pipe(ner, last=True)
    # otherwise, get it so we can add labels
    else:
        ner = nlp.get_pipe('ner')

    # add labels
    for _, annotations in TRAIN_DATA:
        for ent in annotations.get('entities'):
            ner.add_label(ent[2])

    # get names of other pipes to disable them during training
    other_pipes = [pipe for pipe in nlp.pipe_names if pipe != 'ner']
    with nlp.disable_pipes(*other_pipes):  # only train NER
        optimizer = nlp.begin_training()
        for itn in range(n_iter):
            random.shuffle(TRAIN_DATA)
            losses = {}
            for text, annotations in TRAIN_DATA:
                nlp.update(
                    [text],  # batch of texts
                    [annotations],  # batch of annotations
                    drop=0.5,  # dropout - make it harder to memorise data
                    sgd=optimizer,  # callable to update weights
                    losses=losses)
            print(losses)

我想设置n_线程，因为nlp.pipe有文档记录。但是，按照他们的示例，如果没有显式调用nlp.pipe，则似乎不可能指定线程数。我是否丢失了某些内容，或者需要使用nlp.pipe重建