Python 跳过无模型-未声明列表
我已经定义了一个listPython 跳过无模型-未声明列表,python,list,definition,theano,Python,List,Definition,Theano,我已经定义了一个listX来获取文件的内容,并在Theano中为skip-through模型的预处理器提供数据。错误显示未定义X 请发布更多信息,比如错误跟踪,比如你甚至没有提到哪一行抛出了错误,它没有给出一行。它只是说X没有在模块中定义。 def encode(model, X, use_norm=True, verbose=True, batch_size=128, use_eos=False): """ Encode sentences in the list X. Ea
X
来获取文件的内容,并在Theano中为skip-through模型的预处理器提供数据。错误显示未定义X
请发布更多信息,比如错误跟踪,比如你甚至没有提到哪一行抛出了错误,它没有给出一行。它只是说X没有在模块中定义。
def encode(model, X, use_norm=True, verbose=True, batch_size=128, use_eos=False):
"""
Encode sentences in the list X. Each entry will return a vector
"""
# first, do preprocessing
X = preprocess(X)
# word dictionary and init
d = defaultdict(lambda : 0)
for w in model['utable'].keys():
d[w] = 1
ufeatures = numpy.zeros((len(X), model['uoptions']['dim']), dtype='float32')
bfeatures = numpy.zeros((len(X), 2 * model['boptions']['dim']), dtype='float32')
# length dictionary
ds = defaultdict(list)
captions = [s.split() for s in X]
for i,s in enumerate(captions):
ds[len(s)].append(i)
def preprocess(text):
"""
Preprocess text for encoder
"""
X = []
#For _chat.txt to list_______________________________________
f= open("/var/www/html/_chat.txt", "r")
file_list = f.readlines()
first_three = file_list[1:3]
X = [x.replace('\t',' ') for x in first_three]
X = [x.replace('\x00','') for x in X]
X = [x.replace(' \r\n','') for x in X]
#______________________________________________________________
sent_detector = nltk.data.load('tokenizers/punkt/english.pickle')
for t in text:
sents = sent_detector.tokenize(t)
result = ''
for s in sents:
tokens = word_tokenize(s)
result += ' ' + ' '.join(tokens)
X.append(result)
return X