Python正则表达式并不匹配所需的所有字符_Python_Regex - Fatal编程技术网

Python正则表达式并不匹配所需的所有字符

python regex

Python正则表达式并不匹配所需的所有字符,python,regex,Python,Regex,我有一些由PDF制作的txt文件，希望使用一些python脚本和正则表达式模式添加一些xml标记。大多数情况下，它工作正常，但有时表达式并不匹配所需的所有字符。在测试工具中，它工作正常下面是python代码： matchs = re.finditer("<UTop>[^<]+",string) for m in matchs: tagend = m.end() string = string[:tagend] + "</UTop&

我有一些由PDF制作的txt文件，希望使用一些python脚本和正则表达式模式添加一些xml标记。大多数情况下，它工作正常，但有时表达式并不匹配所需的所有字符。在测试工具中，它工作正常

下面是python代码：

matchs = re.finditer("<UTop>[^<]+",string)
    for m in matchs:
        tagend = m.end()
        string = string[:tagend] + "</UTop>" + string[tagend:]

matchs=re.finditer（“[^使用Unicode标志：
matchs = re.finditer("<UTop>[^<]+",string,re.UNICODE)

matchs=re.finditer（“[^我使用测试它，结果似乎是正确的
 #coding: utf-8
 import re
 input = "<Top>1. Regierungserklärung des Ministerpräsidenten<UTop>Ministerpräsident Winfried Kretschmann </Top>"
 print(re.sub(r"(<UTop>[^<]+)","\g<1><\\UTop>" ,input))

#编码：utf-8
进口稀土
input=“1.管理迷你解释和迷你解释Winfried Kretschmann”
print（re.sub（r）（[^如果您试图使用正则表达式解析HTML，请使用+1表示BeautifulSoup。另请参阅以获取有关处理损坏的HTML输入的详细信息。感谢您的回答。不幸的是，unicode标志无法解决此问题。
<Top>1. Regierungserklärung des Ministerpräsidenten<UTop>Ministerpräsident Winfried Krets</UTop>chmann </Top>

matchs = re.finditer("<UTop>[^<]+",string,re.UNICODE)

 #coding: utf-8
 import re
 input = "<Top>1. Regierungserklärung des Ministerpräsidenten<UTop>Ministerpräsident Winfried Kretschmann </Top>"
 print(re.sub(r"(<UTop>[^<]+)","\g<1><\\UTop>" ,input))




[regex]相关文章推荐



                                                        
glibc regexp性能
regex 
Regex 使用正则表达式作为函数或变量的名称
regexfunction 
Regex 如果连续存在两个以上的空格，则删除字符串中的空格
regexstring 
Regex 使用记事本参数化文本块++；
regexnotepad++ 
Regex 需要模式匹配方面的帮助吗？
regexperl 
Regex 从ps的输出中提取基本目录
regexbashawksedgrep 
Regex Perl正则表达式-将分隔符作为字符串本身的一部分
regexperl 
Regex 介于.xx-xxx.xx之间的正则表达式数
regex 
Regex 记事本++；正则表达式<；李>；
regexnotepad++ 
Regex 我需要从电子邮件中提取域，但我得到@domain.com我不需要域开头的@
regexpython-2.7 
Regex 如何让正则表达式一有机会就结束。（科特林）
regexkotlin 
Regex visual studio 2015中的正则表达式替换
regexvisual-studio 
Regex 如何使用条件字符编写正则表达式？
regex 
Regex 如何将这些powershell正则表达式匹配输出放在同一行上
regexpowershell 
Regex 一个文本框中数字、小数和百分比的正则表达式
regex 
Regex 如何使用sed表达式以单宽替换双宽字符
regexubuntudockersed 
Regex 在行中间插入字符串的Gvim命令
regexsed 
Perl:显示regexp，不匹配
regexperl 
Hadoop配置单元中的CASE-WHEN-LIKE-REGEXP
regexhadoophive 
Regex 匹配到第一个字符
regex 
                                       





随机文章推荐



                                                        
Amp html 对AMP标记使用XHTML格式
amp-html 
Amp html AMP验证错误：属性'；引用'；可能不会出现在标签'；q'；
amp-html 
Amp html AMP缓存ping请求速率
amp-html 
Amp html Google AMP:表单提交失败：：意外令牌<；在JSON中的位置1
amp-html 
Amp html 如何制作这个放大器
amp-html 
Amp html amp实验和谷歌标签管理器
amp-html 
Amp html 放大器和代码块
amp-html 
Amp html 在获得IG用户同意时禁用默认行为
amp-html 
Amp html amp表单/amp分析跟踪表单提交响应中的元素
amp-html 
Amp html 用于跟踪像素的Amp组件
amp-html 
Amp html 使用amp绑定和amp胡须在amp状态上迭代
amp-html 
Amp html iframe内<；amp ad>；标记未调整大小
amp-html 
Amp html 与AMP测试工具和；谷歌搜索控制台
amp-html 
Amp html OpenMP实现问题
amp-html


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
Python 如何在Django建立电子商务购物车？
									Python
							 									Django
							 
Python AppEngine批量上载列表属性
									Python
							 									Google App Engine
							 
python卸载和更新包
									Python
							 
wxpython：如何用给定的rgb颜色填充listctrl对象中复选框的内部？
									Python
							 									Wxpython
							 
如何在windows中使用python创建exe启动器
									Python
							 									Windows
							 
Python 当我尝试使用easy#u install时，它显示了can'；t解压数据；zlib不可用
									Python
							 									Centos
							 
使用python ssh运行本地shell脚本来远程
									Python
							 									Ssh
							 
python客户端tcp套接字能否用于多次发送？
									Python
							 									Sockets
							 									Network Programming
							 
自定义Python排序：双优先级
									Python
							 									Sorting
							 
Python GAE-1.7.7'；s oauth2client/appengine.py:_create_flow（self，request_handler）返回AttributeError:'；资源'；对象没有属性'；请求'；
									Python
							 									Google App Engine
							 
使用Python设置服务器
									Python
							 
Python 不同的语句取决于代码的退出状态
									Python
							 
Python 从矩阵中聚类项目
									Python
							 
如何在Python中向文件写入字符串列表？
									Python
							 									Python 2.7
							 									File Io
							 
即使在类型转换python之后，unicode的类型也不会改变
									Python
							 									String
							 									Unicode
							 
Python 数组的最大值赢得'；t将ASCII文本保存到文件
									Python
							 									Arrays
							 									Numpy
							 
Python Django应用程序中的具体翻译功能放在哪里？
									Python
							 									Django
							 
Python Heroku拒绝我的plivo应用程序：！[远程拒绝]主机->；主控（预接收）
									Python
							 									Git
							 									Heroku
							 									Flask
							 
使用python unicode函数
									Python
							 									Unicode
							 
Python 如何在setup.py中为PyTest使用复杂参数
									Python
							 
python中的字符串类型检查
									Python
							 									Pandas
							 
Python 在开始url中对特定url进行优先级排序（Scrapy）
									Python
							 									Scrapy
							 
Python-audiodiff can'；找不到存在的文件
									Python
							 									Debian
							 
Python 显示静态文件Django和rest框架
									Python
							 									Django
							 									Api
							 									Django Rest Framework
							 
下载pyinstaller和pip for python 3.4、3.6
									Python
							 									Pip
							 
如何让Python导航到一个链接并从这个子链接打印几个数据点？
									Python
							 									Python 3.x
							 
Python-Scrubadub.clean不工作-无法正确擦洗文本PII+；HTTP错误503
									Python
							 
Python Google Colab是否继续运行脚本；“运行时断开连接”；？
									Python
							 									Neural Network
							 									Pytorch
							 									Google Colaboratory
							 
Python+；Gtk&x2B；WebKit:页面更改后滚动条高度未重置
									Python
							 									Python 3.x
							 									Webview
							 									Gtk
							 
Python 一种热编码方法是对具有未观察到的级别的字符列表进行编码
									Python
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Puppet
Windbg
Ip
Redis
Dynamic
Z3
Vaadin
Pointers
Sap
Process
Pytorch
Xcode
Jar
Tomcat
.net 4.0
Build
Protractor
Variables
Ruby On Rails
String
Sprite Kit
Google Cloud Firestore
Cocoa Touch
Android
Xquery
Dom
Openlayers 3
Sip
Listview
Opencv
Protocol Buffers
Gitlab
Parsing
Plugins
Swagger
Devexpress
Weblogic
Css
Vuejs2
Postman
Vue.js
Couchbase
Artificial Intelligence
Sphinx
Networking
Usb
Compiler Errors
Resharper
Discord.js
Amazon Cloudformation
Enums
Keyboard
Fluent Nhibernate
Modelica
Plone
Quickbooks
Youtube Api
Ios8
Automated Tests
Swing
Tinymce
.htaccess
Erlang
Web Applications
Dialogflow Es
Lotus Notes
Mapping
Automation
Django Models
Graph
Azure Active Directory
Blazor
Activerecord
Angular Material
Internet Explorer
Jira
Tags
Visual Studio 2017
Ionic2
Web Scraping
Vhdl
Dask
Hadoop
Collections
Installation
Phantomjs
Routing
Compiler Construction
Navigation
Visual Studio 2013
Rxjs
Scala
Ffmpeg
Codenameone
Jenkins
Cmake
Google Api
Embedded
Node.js
Drupal 6
Highcharts
Windows Phone 8.1
Scroll
Yii
Webstorm
Streaming
Opencart
Firefox Addon
Sql Server 2012
Plsql
Mediawiki
Dojo
Sharepoint 2010
Windows Mobile
Nuget
Responsive Design
Memory Management
Visual Studio 2012
Wix
Xslt
Prestashop
Go
Seo
Netlogo
Bash
Nservicebus
Apache Storm
Google Chrome Extension
React Native
Wxpython
Teradata
Latex
Shell
Parse Platform
Electron
Autohotkey
Tensorflow
Google Drive Api
Sms
Synchronization
Arm
Three.js
Tableau Api
Subsonic
Rdf
Youtube
Ms Office
Dataframe
Download
Elm
Asp.net Web Api
Image
Logic
Corda
Spring Cloud
Filter
Exception Handling
Triggers
Cocos2d Iphone
Angular6
Ethereum
C# 4.0
Netsuite
Sharepoint 2007
Unix
Calendar
.net Core
Nest
Mongoose
Model
Ajax
Sails.js
Log4net
Types
Jdbc
For Loop
Performance
Symfony1
Rx Java
Vector
Dependency Injection
Spring
Menu
File Io
File
Vba
Internet Explorer 8
C# 3.0
Network Programming
Amazon S3
Fortran
Web Services
Magento
Parallel Processing
Tabs
Class
Syntax
Search
Rspec
Sas
Knockout.js


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网