UNIX shell脚本：按文本文件的条目拆分文本文件_Shell_Unix - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/unix/3.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
UNIX shell脚本：按文本文件的条目拆分文本文件_Shell_Unix - Fatal编程技术网

UNIX shell脚本：按文本文件的条目拆分文本文件

shell unix

UNIX shell脚本：按文本文件的条目拆分文本文件,shell,unix,Shell,Unix,我试图分析一个巨大的文本文件（1.6GB），其数据行如下所示： 20090118025859 -2.400000 78.100000 1023.200000 0.000000 20090118025900 -2.500000 78.100000 1023.200000 0.000000 20090118025901 -2.400000 78.100000 1023.200000 0.000000 我甚至不知道有多少行。但我正试图按日期分割文件。左边的数字是一个时间戳（例如，这些行来自2009年

我试图分析一个巨大的文本文件（1.6GB），其数据行如下所示：

20090118025859 -2.400000 78.100000 1023.200000 0.000000
20090118025900 -2.500000 78.100000 1023.200000 0.000000
20090118025901 -2.400000 78.100000 1023.200000 0.000000

我甚至不知道有多少行。但我正试图按日期分割文件。左边的数字是一个时间戳（例如，这些行来自2009年1月18日）。如何根据日期将此文件拆分为多个部分

每个日期的条目数不同，因此使用带有常量的

split

将不起作用。我所知道的一切都是

grep文件“20090118*”>data20090118.dat

，但确实有一种方法可以同时完成所有日期，对吗

提前感谢,，

Alex

如果项目是按日期顺序排列的，则此选项应有效：

date=20090101 # Change to the earliest date
while IFS= read -rd $'\n' line
do
    if [ "$(echo "$line" | cut -d ' ' -f 1 | cut -c 1-8)" -eq $date ]
    then
        echo "$line" >> "$date.dat"
    else
        let date++
    fi
done < log.dat

date=20090101#更改为最早日期
而IFS=read-rd$'\n'行
做
如果[“$（回显“$行”|切割-d'-f 1 |切割-c 1-8）”-eq$日期]
然后
回显“$line”>>“$date.dat”
其他的
约会++
fi
完成

使用awk：
awk '{print  > "data"substr($1,0,8)".dat"}' myfile

需要注意的是，每天需要有超过1条记录，
并且输出文件将有空行：
uniq --all-repeated=separate -w8 file | csplit -s - '/^$/' '{*}'

我们真的应该可以选择uniq来输出uniq记录。
此外，csplit还应具有抑制匹配线的选项。
也不起作用，因为等号周围有空格。read
的默认分隔符已经是换行符。不要设置最早的日期并递增1，只需检查当前行中的日期是否等于上次保存的日期。更改时，请更改保存的值。




[unix]相关文章推荐



                                                        
                                       





随机文章推荐



                                                        
Ibm mq Websphere MQ身份验证和密钥证书
ibm-mq 
Ibm mq 使用Websphere MQ客户端构建持久的体系结构
ibm-mq 
Ibm mq 将文件加载到MQ队列的实用工具
ibm-mq 
Ibm mq SYSTEM.INTER.QMGR.PUBS队列如何与本地&；集群主题
ibm-mq 
Ibm mq 如何在linux中检查IBM MQ集群队列和队列管理器是否已在集群中？
ibm-mq 
Ibm mq MQ控制台-队列管理器不可用，尽管它处于活动状态
ibm-mq 
Ibm mq IBM Websphere MQ未授权错误AMQ8135
ibm-mq 
Ibm mq IBM MQ队列中的重试间隔
ibm-mq 
Ibm mq Websphere MQ XMS事件轮询时间
ibm-mq 
Ibm mq AMQ9641:通道IBM WebSphere MQ客户端8.0.0.9的远程CipherSpec错误
ibm-mq 
Ibm mq mqexplorer与PAM的集成
ibm-mq


                                        

                                        
                                        


                                                
                                                        [shell]相关推荐
                                                        
如何使用shell脚本确定网页是否存在？
									Shell
							 									Unix
							 
Shell 在'之后中断循环；n'；会议记录
									Shell
							 									Scripting
							 
Shell 什么是「；壳牌公司；chrome扩展中的窗口类型？它可以用来隐藏窗口吗？
									Shell
							 									Google Chrome
							 									Google Chrome Extension
							 
Shell 适用于'的awk；任何'；（文件1）中的字符（文件2）
									Shell
							 									Variables
							 									Awk
							 
在KornShell中构建动态变量名
									Shell
							 									Variables
							 									Dynamic
							 
Shell 我的语法是错的还是完全错了？
									Shell
							 									Unix
							 
Shell 如何将启动目录重置为Fish中的主页？
									Shell
							 
Shell 如何使用awk从文件中查找最大值和平均值
									Shell
							 									Awk
							 
如何在shell脚本中传递参数数组？
									Shell
							 									Svn
							 									Unix
							 									Loops
							 
在shell脚本中使用美元符号
									Shell
							 
Shell 用正则表达式格式化带有脚注的文本
									Shell
							 									Perl
							 									Sed
							 
Shell 在UNIX中使用命令关闭所有打开的X-windows
									Shell
							 									Unix
							 
Shell 查找比名称相似但扩展名不同的文件更新的文件
									Shell
							 
Shell 从命令行排序.csv
									Shell
							 									Csv
							 									Sorting
							 									Unix
							 
Shell 如何向STDERR重定向添加时间戳
									Shell
							 									Scripting
							 
Shell 为什么xterm'；s文档调用''；控制角色？
									Shell
							 									Language Agnostic
							 
Shell 使用TCL'压缩多个文件；s tk_getOpenFile
									Shell
							 									Compression
							 									Tcl
							 
Shell 在C中使用glib-在Makefile中正确调用pkg config
									Shell
							 									Makefile
							 
Shell csh中的重定向问题
									Shell
							 									Grep
							 
Shell 每两周做一次工作
									Shell
							 									Cron
							 
Oozie在随机节点上运行shell脚本
									Shell
							 									Hadoop
							 									Ftp
							 
Shell 阻止kill输出消息
									Shell
							 
Shell 使用远程环境变量'；scp路径中的s值
									Shell
							 									Ssh
							 
Shell 设置视图后脚本退出（clearcase）
									Shell
							 									Clearcase
							 
Shell 等待进程id时返回代码127
									Shell
							 
Shell Ansible命令模块说'|'；这是非法字符
									Shell
							 									Ansible
							 
Shell 在文件的每一行的特定位置添加值
									Shell
							 
Shell 为什么不能用find和zip压缩所有文件？
									Shell
							 
Shell 带for循环的Bash算法扩展
									Shell
							 
Shell 如何在tmux中在pyenv python和系统python之间来回切换？
									Shell
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Ionic Framework
Netsuite
Spring Mvc
Struts2
Clearcase
Verilog
Macos
Sdk
Path
Google Chrome
Sencha Touch 2
Architecture
Dataframe
Stored Procedures
Oauth
Service
Blackberry
Migration
Streaming
Printing
Listview
Debugging
Blockchain
Msbuild
Canvas
3d
Eclipse
Web Scraping
Nunit
Jhipster
Artificial Intelligence
Sublimetext3
Signalr
Mediawiki
Gnuplot
Plot
Entity Framework
Swiftui
Memory Management
Yocto
Ms Access
D3.js
Awk
Kdb
Glsl
Joomla
Datetime
Google Compute Engine
Microsoft Graph Api
Typescript
Text
Webrtc
Windows Runtime
Azure Service Fabric
Adobe
Chef Infra
Sails.js
Phantomjs
Arduino
Corda
Cocos2d Iphone
Seo
Spring
Twig
Solr
Apache Flink
Xpages
Vue.js
Winforms
Compiler Construction
Open Source
Google Cloud Platform
Jquery Ui
Sharepoint 2010
Spring Security
Xml
Swift3
Aem
Inno Setup
Design Patterns
Drop Down Menu
Express
Passwords
Reactjs
Gridview
Cloud Foundry
Vbscript
Mysql
Github
Android Studio
Atom Editor
Google Apps Script
Polymer
Shell
Magento2
Random
Playframework
Speech Recognition
Javafx 2
Opengl
Authentication
Google Colaboratory
Python Sphinx
Geolocation
Tcl
Inheritance
Ag Grid
Google Calendar Api
Audio
Typo3
Rabbitmq
Replace
Module
Input
Scikit Learn
Safari
Debian
Jquery Plugins
Animation
Sql
Google Cloud Dataflow
Frameworks
Uwp
Outlook
Directx
Gatsby
Akka
Concurrency
Dom
Docker Compose
Aws Lambda
Wpf
Regex
Office Js
Postgresql
Scroll
Serialization
Time Complexity
Jms
Jaxb
Math
Apache Zookeeper
Sas
Internet Explorer
Ipython
Redirect
Vector
Paypal
Talend
Tableau Api
Gruntjs
Visual Studio 2008
Hash
Jenkins
Identityserver4
Automated Tests
Menu
Unicode
Db2
Embedded
C++ Cli
Sugarcrm
Dynamics Crm 2011
Jqgrid
Laravel 5
Properties
Variables
Npm
Mips
Google Api
List
Facebook
Telegram
Hadoop
Pdf
Sonarqube
Oracle10g
Sqlite
String
Google Cloud Firestore
Angular Material
Appium
Pointers
Magento
Tsql
Visual Studio Code
Asp.net Mvc
Android Fragments
Pandas
Nest
Apache Nifi
Nginx
React Native
Deployment
Clojure
Stanford Nlp
Salesforce
Clang
Ckeditor
Logstash
Three.js


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网