Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/300.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/json/15.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 如何使用HolaVPN解决googletrans';s JSON解码错误:第1行第1列(字符0)_Python_Json_Nlp_Google Translate_Language Translation - Fatal编程技术网

Python 如何使用HolaVPN解决googletrans';s JSON解码错误:第1行第1列(字符0)

Python 如何使用HolaVPN解决googletrans';s JSON解码错误:第1行第1列(字符0),python,json,nlp,google-translate,language-translation,Python,Json,Nlp,Google Translate,Language Translation,我有一个18k行的越南语数据集,我正试图使用googletrans模块将其翻译成英语 from googletrans import Translator translator = Translator() def trans_text(df, text_field): df[text_field] = df[text_field].apply(translator.translate, src='vi', dest='en').apply(getattr, args=('text'

我有一个18k行的越南语数据集,我正试图使用googletrans模块将其翻译成英语

from googletrans import Translator
translator = Translator()

def  trans_text(df, text_field):
    df[text_field] = df[text_field].apply(translator.translate, src='vi', dest='en').apply(getattr, args=('text',))  
    return df
trans_text(df_train.sample(1), "question")


最后出现以下JSONDecode错误:


JSONDecodeError                           Traceback (most recent call last)
<ipython-input-21-d6791d78575e> in <module>()
     24     df[text_field] = df[text_field].apply(translator.translate, src='vi', dest='en').apply(getattr, args=('text',))
     25     return df
---> 26 trans_text(df_train.sample(1), "question")
     27 
     28 

<ipython-input-21-d6791d78575e> in trans_text(df, text_field)
     22 
     23 def  trans_text(df, text_field):
---> 24     df[text_field] = df[text_field].apply(translator.translate, src='vi', dest='en').apply(getattr, args=('text',))
     25     return df
     26 trans_text(df_train.sample(1), "question")

/opt/anaconda3/envs/sam-pycaret/lib/python3.6/site-packages/pandas/core/series.py in apply(self, func, convert_dtype, args, **kwds)
   4198             else:
   4199                 values = self.astype(object)._values
-> 4200                 mapped = lib.map_infer(values, f, convert=convert_dtype)
   4201 
   4202         if len(mapped) and isinstance(mapped[0], Series):

pandas/_libs/lib.pyx in pandas._libs.lib.map_infer()

/opt/anaconda3/envs/sam-pycaret/lib/python3.6/site-packages/pandas/core/series.py in f(x)
   4183 
   4184             def f(x):
-> 4185                 return func(x, *args, **kwds)
   4186 
   4187         else:

/opt/anaconda3/envs/sam-pycaret/lib/python3.6/site-packages/googletrans/client.py in translate(self, text, dest, src)
    170 
    171         origin = text
--> 172         data = self._translate(text, dest, src)
    173 
    174         # this code will be updated when the format is changed.

/opt/anaconda3/envs/sam-pycaret/lib/python3.6/site-packages/googletrans/client.py in _translate(self, text, dest, src)
     79         r = self.session.get(url, params=params)
     80 
---> 81         data = utils.format_json(r.text)
     82         return data
     83 

/opt/anaconda3/envs/sam-pycaret/lib/python3.6/site-packages/googletrans/utils.py in format_json(original)
     60         converted = json.loads(original)
     61     except ValueError:
---> 62         converted = legacy_format_json(original)
     63 
     64     return converted

/opt/anaconda3/envs/sam-pycaret/lib/python3.6/site-packages/googletrans/utils.py in legacy_format_json(original)
     52             text = text[:p] + states[j][1] + text[nxt:]
     53 
---> 54     converted = json.loads(text)
     55     return converted
     56 

/opt/anaconda3/envs/sam-pycaret/lib/python3.6/json/__init__.py in loads(s, encoding, cls, object_hook, parse_float, parse_int, parse_constant, object_pairs_hook, **kw)
    352             parse_int is None and parse_float is None and
    353             parse_constant is None and object_pairs_hook is None and not kw):
--> 354         return _default_decoder.decode(s)
    355     if cls is None:
    356         cls = JSONDecoder

/opt/anaconda3/envs/sam-pycaret/lib/python3.6/json/decoder.py in decode(self, s, _w)
    337 
    338         """
--> 339         obj, end = self.raw_decode(s, idx=_w(s, 0).end())
    340         end = _w(s, end).end()
    341         if end != len(s):

/opt/anaconda3/envs/sam-pycaret/lib/python3.6/json/decoder.py in raw_decode(self, s, idx)
    355             obj, end = self.scan_once(s, idx)
    356         except StopIteration as err:
--> 357             raise JSONDecodeError("Expecting value", s, err.value) from None
    358         return obj, end

JSONDecodeError: Expecting value: line 1 column 1 (char 0)

JSONDecodeError回溯(最近一次调用)
在()
24 df[text_field]=df[text_field].apply(translator.translate,src='vi',dest='en').apply(getattr,args=('text',))
25返回df
--->26翻译文本(df_系列样本(1),“问题”)
27
28
在trans_文本中(df,文本_字段)
22
23 def传输文本(df,文本字段):
--->24 df[text_field]=df[text_field].apply(translator.translate,src='vi',dest='en').apply(getattr,args=('text',))
25返回df
26翻译文本(df_系列样本(1),“问题”)
/应用中的opt/anaconda3/envs/sam pycaret/lib/python3.6/site-packages/pandas/core/series.py(self、func、convert\u dtype、args、**kwds)
4198其他:
4199值=self.astype(对象)。\u值
->4200映射=lib.map\u推断(值,f,convert=convert\u数据类型)
4201
4202如果len(映射)和isinstance(映射[0],系列):
pandas/_libs/lib.pyx在pandas中。_libs.lib.map_infere()
/f(x)中的opt/anaconda3/envs/sam pycaret/lib/python3.6/site-packages/pandas/core/series.py
4183
4184 def f(x):
->4185返回函数(x,*args,**kwds)
4186
4187其他:
/翻译中的opt/anaconda3/envs/sam pycaret/lib/python3.6/site-packages/googletrans/client.py(self、text、dest、src)
170
171原点=文本
-->172数据=自译(文本、目的地、src)
173
174#此代码将在格式更改时更新。
/opt/anaconda3/envs/sam pycaret/lib/python3.6/site-packages/googletrans/client.py in_translate(self、text、dest、src)
79 r=self.session.get(url,params=params)
80
--->81 data=utils.format\u json(r.text)
82返回数据
83
/opt/anaconda3/envs/sam pycaret/lib/python3.6/site-packages/googletrans/utils.py格式为_json(原件)
60 converted=json.loads(原始)
61除值错误外:
--->62已转换=传统格式\u json(原件)
63
64返回转换
/传统格式的opt/anaconda3/envs/sam pycaret/lib/python3.6/site-packages/googletrans/utils.py
52 text=文本[:p]+状态[j][1]+文本[nxt:]
53
--->54 converted=json.loads(文本)
55返回转换
56
/加载中的opt/anaconda3/envs/sam pycaret/lib/python3.6/json/_init_uuuuuu.py(s、编码、cls、对象钩、解析浮点数、解析int、解析常量、对象对钩、**kw)
352 parse_int为无,parse_float为无,且
353 parse_常量为None且对象_pairs_hook为None且非kw):
-->354返回\u默认\u解码器。解码
355如果cls为无:
356 cls=JSONDecoder
/解码中的opt/anaconda3/envs/sam pycaret/lib/python3.6/json/decoder.py(self,s,_w)
337
338         """
-->339 obj,end=self.raw\u decode(s,idx=\u w(s,0.end())
340 end=_w(s,end).end()
341如果结束!=长度:
/原始解码中的opt/anaconda3/envs/sam pycaret/lib/python3.6/json/decoder.py(self、s、idx)
355 obj,end=自扫描一次(s,idx)
356除了停止迭代作为错误:
-->357将JSONDecodeError(“预期值”,s,err.value)从None提升
358返回obj,结束
JSONDecodeError:应为值:第1行第1列(字符0)

我理解这是由于我的IP被禁止而产生的。我寻找了一些方法来规避这一点,发现使用VPN是值得的。我已经安装了HolaVPN。但是,作为一个新手,我不确定我如何能够复制与for hola相同的步骤。任何关于清晰程序的提示都会非常有帮助。谢谢你u、

库发出请求,在不检查状态代码的情况下,假定请求成功:

     79         r = self.session.get(url, params=params)
     80 
---> 81         data = utils.format_json(r.text)
现在,Google可能出于任何原因不喜欢您的请求,并返回一条带有匹配状态代码(4xx,5xx)的错误消息。库仍然尝试将正文解析为JSON,但由于没有正文,因此无法工作,并引发与原始问题无关的
JSONDecodeError
。您看不到真正的原因

结论:
googletrans
库缺少关键的错误处理。您可能需要编辑它并自己添加错误处理