Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/regex/20.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 正则表达式将文本字符串拆分为dict as组_Python_Regex_String Split - Fatal编程技术网

Python 正则表达式将文本字符串拆分为dict as组

Python 正则表达式将文本字符串拆分为dict as组,python,regex,string-split,Python,Regex,String Split,我试图将正则表达式字符串输出作为一组文本添加到字典中,为此我使用了re.split() 我的字符串输出是: mpathag (36005076801b2014804000000000001cd) dm-7 Test ,2145 size=488G features='1 queue_if_no_path' hwhandler='0' wp=rw |-+- policy='service-time 0' prio=50 status=active | |- 12:0:1:3 sdas 66:19

我试图将正则表达式字符串输出作为一组文本添加到字典中,为此我使用了
re.split()

我的字符串输出是:

mpathag (36005076801b2014804000000000001cd) dm-7 Test ,2145
size=488G features='1 queue_if_no_path' hwhandler='0' wp=rw
|-+- policy='service-time 0' prio=50 status=active
| |- 12:0:1:3 sdas  66:192 active ready  running
| `- 13:0:1:3 sdbi  67:192 active ready  running
`-+- policy='service-time 0' prio=10 status=enabled
  |- 12:0:0:3 sdak  66:64  active ready  running
  `- 13:0:0:3 sdba  67:64  active ready  running
mpathz (36005076801b2014804000000000001c4) dm-0 Test ,2145
size=488G features='1 queue_if_no_path' hwhandler='0' wp=rw
|-+- policy='service-time 0' prio=50 status=active
| |- 12:0:0:0 sdah  66:16  active ready  running
| `- 13:0:0:0 sdax  67:16  active ready  running
 `-+- policy='service-time 0' prio=10 status=enabled
   |- 10:0:0:0 sdb   8:16   active ready  running
   `- 13:0:1:0 sdbf  67:144 active ready  running
下面是我的解析代码:

    output = "mpathag (36005076801b2014804000000000001cd) dm-7 Test ,2145\
    size=488G features='1 queue_if_no_path' hwhandler='0' wp=rw\
    |-+- policy='service-time 0' prio=50 status=active\
    | |- 12:0:1:3 sdas  66:192 active ready  running\
    | `- 13:0:1:3 sdbi  67:192 active ready  running\
    `-+- policy='service-time 0' prio=10 status=enabled\
      |- 12:0:0:3 sdak  66:64  active ready  running\
      `- 13:0:0:3 sdba  67:64  active ready  running\
    mpathz (36005076801b2014804000000000001c4) dm-0 Test ,2145\
    size=488G features='1 queue_if_no_path' hwhandler='0' wp=rw\
    |-+- policy='service-time 0' prio=50 status=active\
    | |- 12:0:0:0 sdah  66:16  active ready  running\
    | `- 13:0:0:0 sdax  67:16  active ready  running\
     `-+- policy='service-time 0' prio=10 status=enabled\
       |- 10:0:0:0 sdb   8:16   active ready  running\
       `- 13:0:1:0 sdbf  67:144 active ready  running"

    devices = re.split('mpath', output)
    for dev in devices:
       print dev
我的问题是re.split会从文本中删除模式。是否可以在不删除分隔符的情况下进行拆分

输出结果如下所示:

ag (36005076801b2014804000000000001cd) dm-7 Test ,2145    
size=488G features='1 queue_if_no_path' hwhandler='0' wp=rw    
|-+- policy='service-time 0' prio=50 status=active    
| |- 12:0:1:3 sdas  66:192 active ready  running    
| `- 13:0:1:3 sdbi  67:192 active ready  running    
`-+- policy='service-time 0' prio=10 status=enabled      
  |- 12:0:0:3 sdak  66:64  active ready  running      
  `- 13:0:0:3 sdba  67:64  active ready  running

如果模式包括捕获组,则不会排除该组:

>>> re.split(',', '1,2,3')  # without capturing group
['1', '2', '3']
>>> re.split('(,)', '1,2,3')  # with capturing group
['1', ',', '2', ',', '3']
>>> xs = re.split('(,)', '1,2,3')
>>> [part1+part2 for part1, part2 in zip(xs[1::2], xs[2::2])]
[',2', ',3']


当字符串没有新行时,如何获得格式良好的输出?
devices = re.split('(mpath)', output)
for part1, part2 in zip(devices[1::2], devices[2::2]):
    print part1 + part2