如何在python中将unicode字符串转换为普通文本

如何在python中将unicode字符串转换为普通文本,python,string,unicode,encoding,utf-8,Python,String,Unicode,Encoding,Utf 8,假设我有一个Unicode字符串(不是真正的Unicode,而是看起来像Unicode的字符串)。我想得到它的utf-8变体。我如何用Python实现它? 例如,如果我有如下字符串: title = "\\u10d8\\u10e1\\u10e0\\u10d0\\u10d4\\u10da\\u10d8 == \\u10d8\\u10d4\\u10e0\\u10e3\\u10e1\\u10d0\\u10da\\u10d8\\u10db\\u10d8" 我如何才能得到它的utf-8变体(格鲁吉亚符号

假设我有一个Unicode字符串(不是真正的Unicode,而是看起来像Unicode的字符串)。我想得到它的utf-8变体。我如何用Python实现它? 例如,如果我有如下字符串:

title = "\\u10d8\\u10e1\\u10e0\\u10d0\\u10d4\\u10da\\u10d8 == \\u10d8\\u10d4\\u10e0\\u10e3\\u10e1\\u10d0\\u10da\\u10d8\\u10db\\u10d8"
我如何才能得到它的utf-8变体(格鲁吉亚符号):

ისრაელი == იერუსალიმი

简单地说,我希望有如下代码:

title = "\\u10d8\\u10e1\\u10e0\\u10d0\\u10d4\\u10da\\u10d8 == \\u10d8\\u10d4\\u10e0\\u10e3\\u10e1\\u10d0\\u10da\\u10d8\\u10db\\u10d8"
utfTitle = title.TurnToUTF()
print(utfTitle)
我希望这段代码有输出:

ისრაელი == იერუსალიმი


给你。只需使用
decode
方法并应用
unicode\u escape

对于Python2.x

title = "\\u10d8\\u10e1\\u10e0\\u10d0\\u10d4\\u10da\\u10d8 == \\u10d8\\u10d4\\u10e0\\u10e3\\u10e1\\u10d0\\u10da\\u10d8\\u10db\\u10d8"
utfTitle = title.decode('unicode_escape')
print(utfTitle)

#output :ისრაელი == იერუსალიმი
对于Python3.x

title = "\\u10d8\\u10e1\\u10e0\\u10d0\\u10d4\\u10da\\u10d8 == \\u10d8\\u10d4\\u10e0\\u10e3\\u10e1\\u10d0\\u10da\\u10d8\\u10db\\u10d8"
print(title.encode('ascii').decode('unicode-escape'))
您可以使用编解码器消除双重反斜杠,并有效地使用字符串

假设
title
str
,则需要先对字符串进行编码,然后再解码回unicode(
str

如果
title
bytes
实例,则可以直接解码:

>>> t = title.decode('unicode-escape')
>>> t
'ისრაელი == იერუსალიმი'

假设unicode是str类型,并使用decode和unicode转义进行转换 方法

title="\\u10d8\\u10e1\\u10e0\\u10d0\\u10d4\\u10da\\u10d8 == \\u10d8\\u10d4\\u10e0\\u10e3\\u10e1\\u10d0\\u10da\\u10d8\\u10db\\u10d8"

res1 = title.encode('utf-8')

res2 = res1.decode('unicode-escape')

print(res2)

看,我想这能帮你…@MosheSlavin,它不能help@ozking这无助于UnicodeEncodeError:“cp932”编解码器无法对位置0:非法多字节序列中的字符“\u10d8”进行编码
title="\\u10d8\\u10e1\\u10e0\\u10d0\\u10d4\\u10da\\u10d8 == \\u10d8\\u10d4\\u10e0\\u10e3\\u10e1\\u10d0\\u10da\\u10d8\\u10db\\u10d8"

res1 = title.encode('utf-8')

res2 = res1.decode('unicode-escape')

print(res2)