仅限awk计数选择性组合:
希望从第3个字段到第5个字段读取并计算字段值==“TRUE” Input.txt仅限awk计数选择性组合:,awk,Awk,希望从第3个字段到第5个字段读取并计算字段值==“TRUE” Input.txt Locationx,Desc,A,B,C,Locationy ab123,Name1,TRUE,TRUE,TRUE,ab1234 ab123,Name2,TRUE,FALSE,TRUE,ab1234 ab123,Name2,FALSE,FALSE,TRUE,ab1234 ab123,Name1,TRUE,TRUE,TRUE,ab1234 ab123,Name2,TRUE,TRUE,TRUE,ab1234 ab123
Locationx,Desc,A,B,C,Locationy
ab123,Name1,TRUE,TRUE,TRUE,ab1234
ab123,Name2,TRUE,FALSE,TRUE,ab1234
ab123,Name2,FALSE,FALSE,TRUE,ab1234
ab123,Name1,TRUE,TRUE,TRUE,ab1234
ab123,Name2,TRUE,TRUE,TRUE,ab1234
ab123,Name3,FALSE,FALSE,FALSE,ab1234
ab123,Name3,TRUE,FALSE,FALSE,ab1234
ab123,Name3,TRUE,TRUE,FALSE,ab1234
ab123,Name3,TRUE,TRUE,FALSE,ab1234
ab123,Name1,TRUE,TRUE,FALSE,ab1234
Locationx,Desc,INCOMING,OUTGOING,SMS,RECHARGE,DEBIT,DATA,Locationy
ab123,Name1,TRUE,TRUE,TRUE,FALSE,FALSE,FALSE,ab1234
ab123,Name2,TRUE,TRUE,FALSE,TRUE,TRUE,TRUE,ab1234
ab123,Name2,TRUE,TRUE,TRUE,TRUE,FALSE,FALSE,ab1234
ab123,Name1,TRUE,TRUE,TRUE,TRUE,FALSE,TRUE,ab1234
ab123,Name2,TRUE,TRUE,TRUE,TRUE,FALSE,TRUE,ab1234
ab123,Name3,FALSE,FALSE,FALSE,TRUE,FALSE,FALSE,ab1234
ab123,Name3,TRUE,TRUE,TRUE,TRUE,TRUE,TRUE,ab1234
ab123,Name3,TRUE,TRUE,FALSE,TRUE,FALSE,FALSE,ab1234
ab123,Name3,TRUE,TRUE,FALSE,TRUE,FALSE,FALSE,ab1234
ab123,Name1,TRUE,TRUE,FALSE,FALSE,FALSE,TRUE,ab1234
当从第3个字段到第5个字段读取标题时,即A、B、C只想生成唯一的组合和排列,如A、B、C、AB、AC、AB、ABC。
注:AA、BB、CC、BA等除外
如果“AB”组合计数考虑为“真”,则不应再次考虑“A”组合计数和“B”计数,以避免重复
示例#1
作品#1
示例2
作品2
例#3
作品3
期望输出:
Desc,A,B,C,AB,AC,BC,ABC
Name1,,,,1,,,2
Name2,,,1,,1,,1
Name3,1,,,2,,,
实际文件如下所示:
Input.txt
Locationx,Desc,A,B,C,Locationy
ab123,Name1,TRUE,TRUE,TRUE,ab1234
ab123,Name2,TRUE,FALSE,TRUE,ab1234
ab123,Name2,FALSE,FALSE,TRUE,ab1234
ab123,Name1,TRUE,TRUE,TRUE,ab1234
ab123,Name2,TRUE,TRUE,TRUE,ab1234
ab123,Name3,FALSE,FALSE,FALSE,ab1234
ab123,Name3,TRUE,FALSE,FALSE,ab1234
ab123,Name3,TRUE,TRUE,FALSE,ab1234
ab123,Name3,TRUE,TRUE,FALSE,ab1234
ab123,Name1,TRUE,TRUE,FALSE,ab1234
Locationx,Desc,INCOMING,OUTGOING,SMS,RECHARGE,DEBIT,DATA,Locationy
ab123,Name1,TRUE,TRUE,TRUE,FALSE,FALSE,FALSE,ab1234
ab123,Name2,TRUE,TRUE,FALSE,TRUE,TRUE,TRUE,ab1234
ab123,Name2,TRUE,TRUE,TRUE,TRUE,FALSE,FALSE,ab1234
ab123,Name1,TRUE,TRUE,TRUE,TRUE,FALSE,TRUE,ab1234
ab123,Name2,TRUE,TRUE,TRUE,TRUE,FALSE,TRUE,ab1234
ab123,Name3,FALSE,FALSE,FALSE,TRUE,FALSE,FALSE,ab1234
ab123,Name3,TRUE,TRUE,TRUE,TRUE,TRUE,TRUE,ab1234
ab123,Name3,TRUE,TRUE,FALSE,TRUE,FALSE,FALSE,ab1234
ab123,Name3,TRUE,TRUE,FALSE,TRUE,FALSE,FALSE,ab1234
ab123,Name1,TRUE,TRUE,FALSE,FALSE,FALSE,TRUE,ab1234
已经尝试了很多,没有什么是具体的,任何建议请
编辑:实际输入的所需输出:
Desc,INCOMING-OUTGOING-SMS-RECHARGE-DEBIT-DATA,OUTGOING-SMS-RECHARGE-DEBIT-DATA,INCOMING-SMS-RECHARGE-DEBIT-DATA,INCOMING-OUTGOING-RECHARGE-DEBIT-DATA,INCOMING-OUTGOING-SMS-RECHARGE-DATA,INCOMING-OUTGOING-SMS-RECHARGE-DEBIT,SMS-RECHARGE-DEBIT-DATA,OUTGOING-RECHARGE-DEBIT-DATA,OUTGOING-SMS-RECHARGE-DATA,OUTGOING-SMS-RECHARGE-DEBIT,INCOMING-RECHARGE-DEBIT-DATA,INCOMING-SMS-DEBIT-DATA,INCOMING-SMS-RECHARGE-DATA,INCOMING-SMS-RECHARGE-DEBIT,INCOMING-OUTGOING-DEBIT-DATA,INCOMING-OUTGOING-RECHARGE-DATA,INCOMING-OUTGOING-RECHARGE-DEBIT,INCOMING-OUTGOING-SMS-DATA,INCOMING-OUTGOING-SMS-DEBIT,INCOMING-OUTGOING-SMS-RECHARGE,RECHARGE-DEBIT-DATA,SMS-DEBIT-DATA,SMS-RECHARGE-DATA,SMS-RECHARGE-DEBIT,OUTGOING-RECHARGE-DATA,OUTGOING-RECHARGE-DEBIT,OUTGOING-SMS-DATA,OUTGOING-SMS-DEBIT,OUTGOING-SMS-RECHARGE,INCOMING-DEBIT-DATA,INCOMING-RECHARGE-DATA,INCOMING-RECHARGE-DEBIT,INCOMING-SMS-DATA,INCOMING-SMS-DEBIT,INCOMING-SMS-RECHARGE,INCOMING-OUTGOING-DATA,INCOMING-OUTGOING-DEBIT,INCOMING-OUTGOING-RECHARGE,INCOMING-OUTGOING-SMS,DEBIT-DATA,RECHARGE-DATA,RECHARGE-DEBIT,SMS-DATA,SMS-DEBIT,SMS-RECHARGE,OUTGOING-DATA,OUTGOING-DEBIT,OUTGOING-RECHARGE,OUTGOING-SMS,INCOMING-DATA,INCOMING-DEBIT,INCOMING-RECHARGE,INCOMING-SMS,INCOMING-OUTGOING,DATA,DEBIT,RECHARGE,SMS,OUTGOING,INCOMING
Name1,,,,,1,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1,,,1,,,,,,,,,,,,,,,,,,,,,
Name2,,,,1,1,,,,,,,,,,,,,,,1,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
Name3,1,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,2,,,,,,,,,,,,,,,,,,,1,,,
没有Perl和Python访问权限 我已经为您编写了一个
perl
脚本。从大小和注释中可以看出,完成这项工作非常简单
#!/usr/bin/perl
use strict;
use warnings;
use autodie;
use Algorithm::Combinatorics qw(combinations);
## change the file to the path where your file exists
open my $fh, '<', 'file';
my (%data, @new_labels);
## capture the header line in an array
my @header = split /,/, <$fh>;
## backup the header
my @fields = @header;
## remove first, second and last columns
@header = splice @header, 2, -1;
## generate unique combinations
for my $iter (1 .. +@header) {
my $combination = combinations(\@header, $iter);
while (my $pair = $combination->next) {
push @new_labels, "@$pair";
}
}
## iterate through rest of the file
while(my $line = <$fh>) {
my @line = split /,/, $line;
## identify combined labels that are true
my @is_true = map { $fields[$_] } grep { $line[$_] eq "TRUE" } 0 .. $#line;
## increment counter in hash map keyed at description and then new labels
++$data{$line[1]}{$_} for map { s/ /-/g; $_ } "@is_true";
}
## print the new header
print join ( ",", "Desc", map {s/ /-/g; $_} reverse @new_labels ) . "\n";
## print the description and counter values
for my $desc (sort keys %data){
print join ( ",", $desc, ( map { $data{$desc}{$_} //= "" } reverse @new_labels ) ) . "\n";
}
注意:请重新查看您的预期输出。从上面的脚本生成的输出可以看出,它几乎没有错误。我已经为您编写了一个
perl
脚本。从大小和注释中可以看出,完成这项工作非常简单
#!/usr/bin/perl
use strict;
use warnings;
use autodie;
use Algorithm::Combinatorics qw(combinations);
## change the file to the path where your file exists
open my $fh, '<', 'file';
my (%data, @new_labels);
## capture the header line in an array
my @header = split /,/, <$fh>;
## backup the header
my @fields = @header;
## remove first, second and last columns
@header = splice @header, 2, -1;
## generate unique combinations
for my $iter (1 .. +@header) {
my $combination = combinations(\@header, $iter);
while (my $pair = $combination->next) {
push @new_labels, "@$pair";
}
}
## iterate through rest of the file
while(my $line = <$fh>) {
my @line = split /,/, $line;
## identify combined labels that are true
my @is_true = map { $fields[$_] } grep { $line[$_] eq "TRUE" } 0 .. $#line;
## increment counter in hash map keyed at description and then new labels
++$data{$line[1]}{$_} for map { s/ /-/g; $_ } "@is_true";
}
## print the new header
print join ( ",", "Desc", map {s/ /-/g; $_} reverse @new_labels ) . "\n";
## print the description and counter values
for my $desc (sort keys %data){
print join ( ",", $desc, ( map { $data{$desc}{$_} //= "" } reverse @new_labels ) ) . "\n";
}
注意:请重新查看您的预期输出。从上面的脚本生成的输出可以看出,它几乎没有错误。我已经为您编写了一个
perl
脚本。从大小和注释中可以看出,完成这项工作非常简单
#!/usr/bin/perl
use strict;
use warnings;
use autodie;
use Algorithm::Combinatorics qw(combinations);
## change the file to the path where your file exists
open my $fh, '<', 'file';
my (%data, @new_labels);
## capture the header line in an array
my @header = split /,/, <$fh>;
## backup the header
my @fields = @header;
## remove first, second and last columns
@header = splice @header, 2, -1;
## generate unique combinations
for my $iter (1 .. +@header) {
my $combination = combinations(\@header, $iter);
while (my $pair = $combination->next) {
push @new_labels, "@$pair";
}
}
## iterate through rest of the file
while(my $line = <$fh>) {
my @line = split /,/, $line;
## identify combined labels that are true
my @is_true = map { $fields[$_] } grep { $line[$_] eq "TRUE" } 0 .. $#line;
## increment counter in hash map keyed at description and then new labels
++$data{$line[1]}{$_} for map { s/ /-/g; $_ } "@is_true";
}
## print the new header
print join ( ",", "Desc", map {s/ /-/g; $_} reverse @new_labels ) . "\n";
## print the description and counter values
for my $desc (sort keys %data){
print join ( ",", $desc, ( map { $data{$desc}{$_} //= "" } reverse @new_labels ) ) . "\n";
}
注意:请重新查看您的预期输出。从上面的脚本生成的输出可以看出,它几乎没有错误。我已经为您编写了一个
perl
脚本。从大小和注释中可以看出,完成这项工作非常简单
#!/usr/bin/perl
use strict;
use warnings;
use autodie;
use Algorithm::Combinatorics qw(combinations);
## change the file to the path where your file exists
open my $fh, '<', 'file';
my (%data, @new_labels);
## capture the header line in an array
my @header = split /,/, <$fh>;
## backup the header
my @fields = @header;
## remove first, second and last columns
@header = splice @header, 2, -1;
## generate unique combinations
for my $iter (1 .. +@header) {
my $combination = combinations(\@header, $iter);
while (my $pair = $combination->next) {
push @new_labels, "@$pair";
}
}
## iterate through rest of the file
while(my $line = <$fh>) {
my @line = split /,/, $line;
## identify combined labels that are true
my @is_true = map { $fields[$_] } grep { $line[$_] eq "TRUE" } 0 .. $#line;
## increment counter in hash map keyed at description and then new labels
++$data{$line[1]}{$_} for map { s/ /-/g; $_ } "@is_true";
}
## print the new header
print join ( ",", "Desc", map {s/ /-/g; $_} reverse @new_labels ) . "\n";
## print the description and counter values
for my $desc (sort keys %data){
print join ( ",", $desc, ( map { $data{$desc}{$_} //= "" } reverse @new_labels ) ) . "\n";
}
注意:请重新查看您的预期输出。从上面的脚本生成的输出可以看出,它几乎没有错误。下面是使用
awk
解决此问题的尝试:
script.awk的内容
执行:
现在,组合
函数中存在相当明显的错误。它不会递归打印所有组合。例如:对于A B C D
它将打印
A
B
C
AB
AC
ABC
但不是BC
以下是使用awk
解决此问题的尝试:
script.awk的内容
执行:
现在,组合
函数中存在相当明显的错误。它不会递归打印所有组合。例如:对于A B C D
它将打印
A
B
C
AB
AC
ABC
但不是BC
以下是使用awk
解决此问题的尝试:
script.awk的内容
执行:
现在,组合
函数中存在相当明显的错误。它不会递归打印所有组合。例如:对于A B C D
它将打印
A
B
C
AB
AC
ABC
但不是BC
以下是使用awk
解决此问题的尝试:
script.awk的内容
执行:
现在,组合
函数中存在相当明显的错误。它不会递归打印所有组合。例如:对于A B C D
它将打印
A
B
C
AB
AC
ABC
但不是BC
您希望从实际文件中获得什么输出?@jaypal Ji,也要编辑实际输入的所需输出。awk
将是太多的工作。如果您愿意,我可以编写一些perl
来工作。您希望实际文件的输出是什么?@jaypal Ji,也可以编辑实际输入的所需输出。awk
的工作量太大了。如果您愿意,我可以编写一些perl
来工作。您希望实际文件的输出是什么?@jaypal Ji,也可以编辑实际输入的所需输出。awk
的工作量太大了。如果您愿意,我可以编写一些perl
来工作。您希望实际文件的输出是什么?@jaypal Ji,也可以编辑实际输入的所需输出。awk
的工作量太大了。如果您愿意,我可以编写一些perl
,这将很有效。是CPAN上可用的模块。非常感谢Jaypal提供的良好支持,保留Rocks,让我尝试将Perl安装到我的系统中,目前我只能访问Bash和awk。。。是CPAN上可用的模块。非常感谢Jaypal提供的良好支持,保留Rocks,让我尝试将Perl安装到我的系统中,目前我只能访问Bash和awk。。。是CPAN上可用的模块。非常感谢Jaypal提供的良好支持,保留Rocks,让我尝试将Perl安装到我的系统中,目前我只能访问Bash和awk。。。是CPAN上可用的模块。非常感谢Jaypal提供的良好支持,保留Rocks,让我尝试将Perl安装到我的系统中,目前我只能访问Bash和awk。。。
$ awk -f script.awk file
Desc,A,B,C,A-B,A-C,A-B-C
Name1,,,,1,,2
Name2,,,1,,1,1
Name3,1,,,2,,
A
B
C
AB
AC
ABC