Bash 在大型数据集中快速验证两个“单元格”是否匹配
我正在处理大量的数据,每次检查几百行,我想知道比较两组不同数据最有效的方法是什么 我要寻找的是找到如下差异: 来源1:Bash 在大型数据集中快速验证两个“单元格”是否匹配,bash,verify,Bash,Verify,我正在处理大量的数据,每次检查几百行,我想知道比较两组不同数据最有效的方法是什么 我要寻找的是找到如下差异: 来源1: site1.49729 site2.80124 /path/path/path/path site1.49730 site2.80125 /path/path/path/path site1.49734 site2.80126 /
site1.49729 site2.80124 /path/path/path/path
site1.49730 site2.80125 /path/path/path/path
site1.49734 site2.80126 /path/path/path/path
site1.49735 site2.80127 /path/path/path/path
site1.49736 site2.80128 /path/path/path/path
site1.49737 site2.80129 /path/path/path/path
site1.49738 site2.80131 /path/path/path/path
site1.49752 site2.80171 /path/path/path/path
来源2:
site1.49729 site2.80124 /path/path/path/path
site1.49730 site2.80125 /path/path/path/path
site1.49734 **site2.1234** /path/path/path/path
site1.49735 site2.80127 /path/path/path/path
site1.49736 site2.80128 /path/path/path/path
site1.49737 **site2.12345** /path/path/path/path
site1.49738 site2.80131 /path/path/path/path
site1.49752 site2.80171 /path/path/path/path
**site1.49735 site2.99999 /path/path/path/path**
差异突出与**
确保两个命令的第二列中的所有内容1不丢失,2与记录完全匹配的最有效方法是什么
关于从何处开始,您有什么想法吗?我建议只针对源1和源2运行一个diff。它将显示包含差异的行。将源1的内容放在s1.txt中,将源2的内容放在s2.txt中,然后运行命令: $diff-y s1.txt s2.txt
这将显示两个文件之间的差异。使用“diff”命令。它为您的案例生成如下输出:
< site1.49734 site2.80126 /path/path/path/path
---
> site1.49734 **site2.1234** /path/path/path/path
6c6
< site1.49737 site2.80129 /path/path/path/path
---
> site1.49737 **site2.12345** /path/path/path/path
8c8,9
< site1.49752 site2.80171 /path/path/path/path
\ No newline at end of file
---
> site1.49752 site2.80171 /path/path/path/path
> **site1.49735 site2.99999 /path/path/path/path**
有许多文本编辑器提供用于区分文件或查看差异的GUI,例如Notepad++非常感谢!我以前从未使用过并排diff,它非常适合于此!