Python/Regex：如果一行确实包含某个特殊字符，则拆分字符串_Python_Regex_Regex Negation_Regex Lookarounds

Python/Regex：如果一行确实包含某个特殊字符，则拆分字符串

python regex

Python/Regex：如果一行确实包含某个特殊字符，则拆分字符串,python,regex,regex-negation,regex-lookarounds,Python,Regex,Regex Negation,Regex Lookarounds,我试图拆分字符上的多行字符串，但仅当该行不包含：时。不幸的是，我找不到一种简单的方法来使用re.split（）对字符：进行负回溯，因为：可能发生在字符串前面的另一行中例如，我想在）上拆分下面的字符串字符串： Hello1 ( First : (), Second ) Hello2 ( First ) ['Hello1 (\nFirst : (),\nSecond', 'Hello2 (\nFirst \n'] 输出： Hello1 ( First : (), Second ) He

我试图拆分字符上的多行字符串，但仅当该行不包含

：

时。不幸的是，我找不到一种简单的方法来使用

re.split（）

对字符

：

进行负回溯，因为

：

可能发生在字符串前面的另一行中

例如，我想在

）

上拆分下面的字符串

字符串：

Hello1 (
First : (),
Second )

Hello2 (
First 
)

['Hello1 (\nFirst : (),\nSecond', 'Hello2 (\nFirst \n']

输出：

Hello1 (
First : (),
Second )

Hello2 (
First 
)

['Hello1 (\nFirst : (),\nSecond', 'Hello2 (\nFirst \n']

可以使用

Python

，尽管本机

re

模块并非“开箱即用”

第一种选择较新版本支持可变长度的查找，因此您可以使用

(?<=^[^:]+)\)
# pos. lookbehind making sure there's no : in that line

产生

['\nHello1 (\nFirst : (),\nSecond ', '\n\nHello2 (\nFirst \n', '']

第二种选择或者，您可以匹配有问题的行，并让它们在

（*跳过）（*失败）

之后失败：

^[^:\n]*:.*(*SKIP)(*FAIL)|\)
# match lines with at least one : in it
# let them fail
# or match )

再次在

Python

中：

import regex as re

data = """
Hello1 (
First : (),
Second )

Hello2 (
First 
)"""

pattern = re.compile(r'(?<=^[^:]+)\)', re.MULTILINE)

parts = pattern.split(data)
print(parts)

pattern2 = re.compile(r'^[^:\n]*:.*(*SKIP)(*FAIL)|\)', re.MULTILINE)
parts2 = pattern.split(data)
print(parts2)

看。

第三种选择好的，现在答案比之前想象的要长。在函数的帮助下，您甚至可以使用本机

re

模块来完成此操作。在这里，您需要先替换有问题的

）

，然后按替换项拆分：

def replacer(match):
    if match.group(1) is not None:
        return "SUPERMAN"
    else:
        return match.group(0)

pattern3 = re.compile(r'^[^:\n]*:.*|(\))', re.MULTILINE)
data = pattern3.sub(replacer, data)
parts3 = data.split("SUPERMAN")
print(parts3)