Python 2.7 从cURL输出中查找模式

Python 2.7 从cURL输出中查找模式,python-2.7,awk,sed,Python 2.7,Awk,Sed,我从cURL输出中获得以下数据。我只对数据键之后的数据和问题链接摘要感兴趣。请帮我做这个。这是一个参考输出 <div class="aui-group split-view"><div class="aui-item list-results-panel"><div class="list-panel"><div class="list-ordering"></div><div class="list-content">&l

我从cURL输出中获得以下数据。我只对
数据键之后的数据
问题链接摘要
感兴趣。请帮我做这个。这是一个参考输出

<div class="aui-group split-view"><div class="aui-item list-results-panel"><div class="list-panel"><div class="list-ordering"></div><div class="list-content"><ol class="issue-list"><li  data-id="95253" data-key="WXYZ-15288" title="b1-257-07.ch.abcde.com /grid/* 100%,"><a class="splitview-issue-link" data-issue-key="WXYZ-15288" href="/browse/WXYZ-15288"><img height="16" width="16" alt="Incident" title="Incident - " src="/secure/viewavatar?size=xsmall&amp;avatarId=11100&amp;avatarType=issuetype" />&nbsp;<span class="issue-link-key">WXYZ-15288</span><br/><span class="issue-link-summary">b1-257-07.ch.abcde.com /grid/* 100%,</span></a></li><li  data-id="95073" data-key="WXYZ-15238" title="restarting h4dn30.nm.abcde.com region server,"><a class="splitview-issue-link" data-issue-key="WXYZ-15238" href="/browse/WXYZ-15238"><img height="16" width="16" alt="Incident" title="Incident - " src="/secure/viewavatar?size=xsmall&amp;avatarId=11100&amp;avatarType=issuetype" />&nbsp;<span class="issue-link-key">WXYZ-15238</span><br/><span class="issue-link-summary">restarting h4dn30.nm.abcde.com region server,</span></a></li><li  data-id="95047" data-key="WXYZ-15230" title="Fwd: Alert: abcd-fgh-db1.nm.abcde.com/Mysql Uptime,"><a class="splitview-issue-link" data-issue-key="WXYZ-15230" href="/browse/WXYZ-15230"><img height="16" width="16" alt="Incident" title="Incident - " src="/secure/viewavatar?size=xsmall&amp;avatarId=11100&amp;avatarType=issuetype" />&nbsp;<span class="issue-link-key">WXYZ-15230</span><br/><span class="issue-link-summary">Fwd: Alert: abcd-fgh-db1.nm.abcde.com/Mysql Uptime,</span></a></li><li  data-id="94928" data-key="WXYZ-15194" title="Re: [CRITICAL ALERT! Reached Rule: XX_eml_slayer_label_production"><a class="splitview-issue-link" data-issue-key="WXYZ-15194" href="/browse/WXYZ-15194"><img height="16" width="16" alt="Incident" title="Incident - " src="/secure/viewavatar?size=xsmall&amp;avatarId=11100&amp;avatarType=issuetype" />&nbsp;<span class="issue-link-key">WXYZ-15194</span><br/><span class="issue-link-summary">Re: [CRITICAL] ALERT! Reached Rule: XX_eml_slayer_label_production</span></a></li><li  data-id="94925" data-key="WXYZ-15193" title="Re: [CRITICAL] ALERT! Breached Rule: Reached Rule: XX_eml_slayer_label_production"><a class="splitview-issue-link" data-issue-key="WXYZ-15193" href="/browse/WXYZ-15193"><img height="16" width="16" alt="Incident" title="Incident - " src="/secure/viewavatar?size=xsmall&amp;avatarId=11100&amp;avatarType=issuetype" />&nbsp;<span class="issue-link-key">WXYZ-15193</span><br/><span class="issue-link-summary">Re: [CRITICAL] ALERT! Reached Rule: XX_eml_slayer_label_production</span></a></li></ol></div></div><div class="end-of-stable-message-container"></div><div class="aui-group pagination-view count-pagination"><div class="refresh-container aui-item"><a href="#" class="refresh-table" title="Refresh results">Refresh results</a></div><div class="pagination-container aui-item"><div class="pagination" data-displayable-total="5"></div></div></div></div><div class="aui-item detail-panel navigator-issue-only"><div></div></div></div>
  • 请注意,整个输出显示在一行中。字段的数量可能会有所不同。

    这里有一种方法:

    awk -vRS="=" '/data-key/ {f=NR} f && NR==f+1 {print "data-key="$1}'
    data-key="WXYZ-15288"
    data-key="WXYZ-15238"
    data-key="WXYZ-15230"
    data-key="WXYZ-15194"
    data-key="WXYZ-15193"
    

    更新:

    awk -vRS="[<>]" -F\" '/data-key.*title/ {print $4,$6}' file
    WXYZ-15288 b1-257-07.ch.abcde.com /grid/* 100%,
    WXYZ-15238 restarting h4dn30.nm.abcde.com region server,
    WXYZ-15230 Fwd: Alert: abcd-fgh-db1.nm.abcde.com/Mysql Uptime,
    WXYZ-15194 Re: [CRITICAL ALERT! Reached Rule: XX_eml_slayer_label_production
    WXYZ-15193 Re: [CRITICAL] ALERT! Breached Rule: Reached Rule: XX_eml_slayer_label_production
    
    awk-vRS=“[]”-F\“'/data key.*title/{print$4,$6}文件
    WXYZ-15288 b1-257-07.ch.abcde.com/grid/*100%,
    WXYZ-15238重新启动h4dn30.nm.abcde.com区域服务器,
    WXYZ-15230固件:警报:abcd-fgh-db1.nm.abcde.com/Mysql正常运行时间,
    WXYZ-15194 Re:[严重警报!达到规则:XX\U eml\U杀手\U标签\U生产
    WXYZ-15193 Re:[严重]警报!违反规则:达到规则:XX\U eml\U slayer\U label\U生产
    
    您的预期输出是什么?@Avinash Raj:预期输出应该显示如下内容:
    WXYZ-15288 b1-257-07.ch.abcde.com/grid/*100%,
    请注意,后面的逗号可能并不总是在那里。谢谢,但这对我没有帮助。我通过运行
    awk'{for(i=1;i@rickydj然后发布您希望输入结果如何的示例输出。预期输出应该显示如下内容:
    WXYZ-15288 b1-257-07.ch.abcde.com/grid/*100%,
    请注意,结尾的逗号可能并不总是在那里。@rickydj仍然不清楚。有几个
    数据键
    和several
    发布链接摘要
    ,但仍然不清楚选择一个而不是另一个的规则。感谢Jotne。它可以工作。注意:awk-v不是mac上的重新组合开关。但它可以在我的服务器上工作。非常感谢。