Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/302.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 使用正则表达式忽略模式提取文本?_Python_Python 3.x - Fatal编程技术网

Python 使用正则表达式忽略模式提取文本?

Python 使用正则表达式忽略模式提取文本?,python,python-3.x,Python,Python 3.x,如何提取所需模式旁边的文本,示例如下: string = 'this is good < U+0097 > never end . < U+0093 > gift,<U+0094 > said . < U+0093 > test . < U+0093 > time , with,<U+0094 > said boys . gave answer , Naresh Hembrom , s

如何提取所需模式旁边的文本,示例如下:

string = 'this is good < U+0097 > never end . < U+0093 > gift,<U+0094 > said . < U+0093 > test . < U+0093 > time , 
         with,<U+0094 > said boys . gave answer , Naresh Hembrom , 
         sitting crosslegged charpoy outside home , .'
所需输出:
string = 'this is good never end . gift, said. test. time, with, said boys. gave answer, 
          Naresh Hembrom, sitting crosslegged charpoy outside home, .'

re.sub
返回一个字符串。尝试:

new_str = re.sub(r'<[^>]*>', '', string)
print(new_str)
new_str=re.sub(r']*>,'',字符串)
打印(新建)

re.sub
返回一个字符串。尝试:

new_str = re.sub(r'<[^>]*>', '', string)
print(new_str)
new_str=re.sub(r']*>,'',字符串)
打印(新建)

以下是我解决问题的方法

import re

string = 'this is good < U+0097 > never end . < U+0093 > gift,<U+0094 > said . < U+0093 > test . < U+0093 > time , with,<U+0094 > said boys . gave answer , Naresh Hembrom , sitting crosslegged charpoy outside home , .'

regString = re.sub(r'<(.*?)>','',string)
print (regString)
重新导入
string='这很好永远不会结束。礼物,说U+0093>测试U+0093>时间,和,男孩们说。纳雷什·亨布罗姆盘腿坐在家门口,回答道
regString=re.sub(r'','',字符串)
打印(regString)
不过,唯一需要注意的是,您将不会有一致的间距。你可以搞乱这个正则表达式,添加一些类似的东西来实现你所需要的


请随意评论您需要的内容,我可以提供帮助

以下是我解决问题的方法

import re

string = 'this is good < U+0097 > never end . < U+0093 > gift,<U+0094 > said . < U+0093 > test . < U+0093 > time , with,<U+0094 > said boys . gave answer , Naresh Hembrom , sitting crosslegged charpoy outside home , .'

regString = re.sub(r'<(.*?)>','',string)
print (regString)
重新导入
string='这很好永远不会结束。礼物,说U+0093>测试U+0093>时间,和,男孩们说。纳雷什·亨布罗姆盘腿坐在家门口,回答道
regString=re.sub(r'','',字符串)
打印(regString)
不过,唯一需要注意的是,您将不会有一致的间距。你可以搞乱这个正则表达式,添加一些类似的东西来实现你所需要的

请随意评论您需要的内容,我可以提供帮助