Linux 如何从多个网页下载文本到文件？_Linux_Bash_Console_Lynx - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/linux/26.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Linux 如何从多个网页下载文本到文件？_Linux_Bash_Console_Lynx - Fatal编程技术网

Linux 如何从多个网页下载文本到文件？

linux bash

Linux 如何从多个网页下载文本到文件？,linux,bash,console,lynx,Linux,Bash,Console,Lynx,我想下载一本波兰语词典。不幸的是，这个词包含了所有的屈折变化（不确定正确的英语单词是什么）。我发现这个命令 lynx --dump https://sjp.pl/slownik/lp.phtml?f_vl=2&page=1 > file.txt 可以下载单个词典网页。然后我将不得不以某种方式从文本块中只提取字典条目，但至少这是一个开始不幸的是，我是一个linux noob，不知道如何遍历所有3067页。未经测试，但使用GNU Parallel，您应该能够非常快速轻松地完成这项工

我想下载一本波兰语词典。不幸的是，这个词包含了所有的屈折变化（不确定正确的英语单词是什么）。我发现这个命令

lynx --dump https://sjp.pl/slownik/lp.phtml?f_vl=2&page=1 > file.txt

可以下载单个词典网页。然后我将不得不以某种方式从文本块中只提取字典条目，但至少这是一个开始

不幸的是，我是一个linux noob，不知道如何遍历所有3067页。

未经测试，但使用GNU Parallel，您应该能够非常快速轻松地完成这项工作

parallel -qk 'lynx --dump https://sjp.pl/slownik/lp.phtml?f_vl=2&page={}' ::: {1..3067} > file.txt
如果无效，请尝试删除单引号。如果这不起作用，请尝试在
&
前面加一个反斜杠。对不起，我现在没有办法测试
慢的方法是：

for ((i=1;i<3068;i++)) ; do lynx --dump ...page=$i done > file.txt
（（i=1；i file.txt）的
我发现使用
lynx…page=$I
只显示第一页，而不管
I
，我不理解，因为
https://...page=i
肯定会链接到第i页。实际上，其他程序，如curl或wget也会链接到第i页。使用
wgethttps://sjp.pl/slownik/lp.phtml?f_vl=2&page=200
will获取
…page=1
的内容，而粘贴
…page=200
时确实显示了第200页…我不明白。好吧，我发现我必须将链接放在引号中，因为lynx误解了“&”请再看一看答案，因为Ole很乐意添加
-q
选项来为我们处理报价。

[bash]相关文章推荐

如何在Bash中区分两条管道？ bash

Bash 从shell脚本的目录中选择随机文件的最佳方法 bash file shell random

Bash 从后面直到时间戳抓取一个巨大的日志文件 bash unix text

希望在bash中对ssh密码进行异常处理 bash shell

在bash上显示查找/替换多个文档的结果 bash shell replace sed

Bash 如何将ec2din的特定输出存储在单独的变量中 bash amazon-web-services

将命令行选项传递给bash中调用的脚本 bash shell

Bash脚本的行为因执行它的内容而异 bash shell

Bash 美元（…）和“…”之间有什么区别？ bash

如何使用bash在特定行后插入文本？ bash sed

使用bash脚本更新$push MongoDB bash mongodb

bash-处理文件名中的特殊字符 bash

Bash 在正则表达式中的字符类内使用括号 bash sed

将.bash_profile和.bashrc作为一个文件使用 bash

在EOF bash脚本中重用变量 bash variables

openssl pass参数通过bash安全吗？ bash shell security openssl

Bash 如何对“a”进行排序；周一YYYY日NUM；使用UNIX工具的时间？ bash shell

Bash脚本中的用户输入？ bash shell scripting

循环中的Bash变量名扩展 bash

如何在=登录Bash后删除任何内容？ bash awk

随机文章推荐

Ionic framework 实时重新加载不适用于ION serve命令 ionic-framework

Ionic framework Ionic框架-上传到Ionic View应用程序，没有获取最新的javascript文件，可能缓存仍然存在？ ionic-framework

Ionic framework 爱奥尼亚+；输入范围+；Internet Explorer=冻结滑块 ionic-framework

Ionic framework 当我需要在另一个具有Ionic框架的视图中使用swipeable视图时，如何实现路由？ ionic-framework

Ionic framework 对动态数据使用swiper？ ionic-framework

Ionic framework $ionicLoading.hide冻结所有ionSpinner'；在视图中显示动画 ionic-framework

Ionic framework 激活时，更改爱奥尼亚侧菜单中项目图标的颜色 ionic-framework

Ionic framework ionic back按钮不出现在选项卡式视图中 ionic-framework

Ionic framework 应用程序更新中的离子存储 ionic-framework

Ionic framework 如何动态更改ion导航栏的颜色？ ionic-framework

Ionic framework 实用程序CLI意外关闭（退出代码1）：Ionic ionic-framework ionic2

Ionic framework 如何设置IONIC4设置选项卡背景透明？ ionic-framework

Ionic framework Ionic3:无法在应用浏览器中声明 ionic-framework

Ionic framework 处理Ionic 4/Firebase断开连接的最佳方法是什么 ionic-framework google-cloud-firestore

Ionic framework 离子3中的引导锚HREF ionic-framework

Ionic framework 创建新的ionic项目时遇到错误 ionic-framework

Ionic framework Ionic 3 AlertController：单词在新行上拆分为两个单词 ionic-framework

Ionic framework Ionic Admob奖励服务器验证 ionic-framework

Ionic framework 从Play Store获取应用程序版本以更新设备中的应用程序。爱奥尼亚+；电容器 ionic-framework

Ionic framework 离子时间倒计时 ionic-framework

[linux]相关推荐

Tags

Biztalk Microservices Jestjs Jboss Fluent Nhibernate Ibm Mq Requirejs Charts Stream Silverlight 4.0 Apache Kafka Ruby Netsuite Date Redis Protocol Buffers Pascal Cryptography Java 8 Vector Map Keyboard Macos Stanford Nlp Ftp Stm32 Gcc Lucene Eclipse Plugin Actions On Google Polymer Cmake Reference Socket.io Blockchain Yii2 Cassandra Drupal String Ios4 Stripe Payments Scikit Learn Junit Service Opengl Vbscript Xcode Certificate Jquery Mobile Windbg Hash If Statement Apache Flex Hyperlink Silverstripe Facebook Graph Api Jasper Reports Project Management Firefox Addon Spring Boot Datatables Sharepoint 2013 Floating Point Automation Sapui5 Couchdb Ldap Joomla Sharepoint 2010 Webview Encoding Codeigniter Ckeditor Loops Configuration Activemq Redirect Twilio For Loop Collections Google Plus Discord.py Sonarqube Julia Menu Shell Algorithm Python Sphinx Automated Tests Big O Knockout.js Transactions Iis Regex Entity Framework 4 Random Python 2.7 Jpa Browser Path Ipad Clearcase Breeze Design Patterns Highcharts Windows Phone 8.1 Leaflet Common Lisp Zurb Foundation Push Notification Scheme Actionscript Tinymce Swing Calendar Puppet Flash Kubernetes Sip Azure Service Fabric Arduino Multithreading Grafana Django Ravendb Racket Wso2 Layout Arm Ant Wicket Nunit Domain Driven Design Asterisk Delphi Compression Oop Zend Framework2 Coding Style Asp.net Web Api Tsql Jquery Plugins Groovy Editor Azure Interface Uml Com Select Javafx Web Applications Syntax Identityserver4 Directx Asp.net Mvc Apache Storm Appium Angular Material Bazel Oracle10g Ms Word Applescript Jenkins Php Material Ui Iframe Networking Nosql Terminal Npm Api Llvm Rust Jwt Sencha Touch 2 Apache Flink Vim Documentation Airflow Cocoa Markdown Instagram Dynamic Rx Java Omnet++ Asp Classic Outlook Asp.net Core Ansible Mule Twitter Bootstrap 3 Selenium Webdriver Qml Amazon Web Services Meteor Smtp Microsoft Graph Api Tree Itext Safari Visual Studio 2012

Copyright © 2024. All Rights Reserved by - Fatal编程技术网