Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/node.js/42.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Javascript 如何删除<;!doctype html>;从带有cheerio.js的文档_Javascript_Node.js_Web Scraping_Cheerio - Fatal编程技术网

Javascript 如何删除<;!doctype html>;从带有cheerio.js的文档

Javascript 如何删除<;!doctype html>;从带有cheerio.js的文档,javascript,node.js,web-scraping,cheerio,Javascript,Node.js,Web Scraping,Cheerio,我试图从cherio.js解析的html文档中删除和。有可能吗 <?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html> <head></head> &

我试图从cherio.js解析的html文档中删除
。有可能吗

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">

<html>
  <head></head>
  <body>
    <div>text</div>
  </body>
</html>

文本

您只需提取html即可。您所需要做的就是再次添加html标记

const cheerio = require('cheerio');

const html = `
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">

<html>
  <head></head>
  <body>
    <div>text</div>
  </body>
</html>
`;
const $ = cheerio.load(html);
console.log($('html').html());
const cheerio=require('cheerio');
常量html=`
文本
`;
const$=cheerio.load(html);
log($('html').html());

好的,这很有意义。我只是在乱搞
gulpcheerio
库,这使得这一步不可能实现,但我注意到这是我能提供的唯一一个小包装。谢谢。或者你可以
const$outer=cheerio.load(html);const$=cheerio.load($outer('html').html());console.log($.html())