Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/regex/16.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python:解析javascript生成的html字符串中的换行符_Python_Regex_String - Fatal编程技术网

Python:解析javascript生成的html字符串中的换行符

Python:解析javascript生成的html字符串中的换行符,python,regex,string,Python,Regex,String,我有一个特定的URL,它给了我一个大的JSON。我使用正则表达式提取特定属性的值并将其存储在Python字符串中。我捕获的这个值是JavaScript生成的HTML,看起来像 <ul class=\"ylist ylist-bordered search-results\">\n \n \n <li class=\"yloca-search-result\">\n

我有一个特定的URL,它给了我一个大的JSON。我使用正则表达式提取特定属性的值并将其存储在Python字符串中。我捕获的这个值是JavaScript生成的HTML,看起来像

<ul class=\"ylist ylist-bordered search-results\">\n        \n        \n                        <li class=\"yloca-search-result\">\n                        <div class=\"search-result\" data-key=\"ad_business:QaG0eB4HEXgkPIjOCw_3dA\">\n        <div class=\"biz-listing-large\">\n            <div class=\"main-attributes\">\n                <div class=\"media-block media-block--12\">\n                    <div class=\"media-avatar\">\n                                    <div class=\"photo-box pb-90s\">\n                <a href=\"/
也就是说,我想用实际的换行符替换任何一段空白和
'\n'
(可能和这里一样多)。我不知道该怎么做。我原以为任何普通的文本编辑器(我在Windows上使用的是Sublime)都会将\n转换为新行,但我得到的那一行正如您上面看到的


当我将Python变量写入文本文件并在编辑器中打开它时,如何处理存储上述第一行的Python变量,使其看起来像第二行?

如果您不想使用
re
,请使用以下方法:

x = '<ul class=\"ylist ylist-bordered search-results\">\n        \n        \n                        <li class=\"yloca-search-result\">\n                        <div class=\"search-result\" data-key=\"ad_business:QaG0eB4HEXgkPIjOCw_3dA\">\n        <div class=\"biz-listing-large\">\n            <div class=\"main-attributes\">\n                <div class=\"media-block media-block--12\">\n                    <div class=\"media-avatar\">\n                                    <div class=\"photo-box pb-90s\">\n                <a href=\"/'

vals = x.split('\n')
filtered_vals = [item.strip() for item in vals if item.strip() != ""]
for item in filtered_vals:
    print item
x='
    \n\n\n
  • \n\n\n\n
    x = '<ul class=\"ylist ylist-bordered search-results\">\n        \n        \n                        <li class=\"yloca-search-result\">\n                        <div class=\"search-result\" data-key=\"ad_business:QaG0eB4HEXgkPIjOCw_3dA\">\n        <div class=\"biz-listing-large\">\n            <div class=\"main-attributes\">\n                <div class=\"media-block media-block--12\">\n                    <div class=\"media-avatar\">\n                                    <div class=\"photo-box pb-90s\">\n                <a href=\"/'
    
    vals = x.split('\n')
    filtered_vals = [item.strip() for item in vals if item.strip() != ""]
    for item in filtered_vals:
        print item