Awk 将数据列表中的序列提取到单独的行中_Awk

Awk 将数据列表中的序列提取到单独的行中

awk

Awk 将数据列表中的序列提取到单独的行中,awk,Awk,sample.txt确实有“制表符分隔列”，并且有分隔的分号，需要相应地从数字序列分割为重复值 cat sample.txt 2 2627 588;577 2 2629 566 2 2685 568-564 2 2771 573 2 2773 597 2 2779 533 2 2799 558 2 6919 726;740-742;777 2 7295 761;771-772 请注意，某些行可能具有倒序

sample.txt

确实有“制表符分隔列”，并且有分隔的分号，需要相应地从数字序列分割为重复值

cat sample.txt

2   2627    588;577
2   2629    566
2   2685    568-564
2   2771    573
2   2773    597
2   2779    533
2   2799    558
2   6919    726;740-742;777
2   7295    761;771-772

请注意，某些行可能具有倒序

568-564

通过使用前面的脚本，我成功地将其拆分，但未能从序列中提取（按破折号拆分）

请尝试以下内容（几分钟后将添加解释）

$awk'
开始{
FS=“（+）”#输入字段分隔符为空格或；
OFS=“，”#输出fs为逗号
}
{
对于（i=3；i$awk'
开始{
FS=“（+）”#输入字段分隔符为空格或；
OFS=“，”#输出fs为逗号
}
{
对于（i=3；i
#!/bin/sh
awk -F"\t" '{print $1}' $1 >> $2 && 
awk -F"\t" '{print $2}' $1 >> $2 && 
awk -F"\t" '{print $3}' $1 >> $2 &&
sed -i "s/^M//;s/;\r//g" $2

#!/bin/awk -f
BEGIN { FS=";"; recNr=1}
!NF { ++recNr; lineNr=0; next }
{ ++lineNr }
lineNr == 1 { next }
recNr == 1  { a[lineNr] = $0 }
recNr == 2  { b[lineNr] = $0 }
recNr == 3  {
    for (i=1; i<=NF; i++) {
        print a[lineNr] "," b[lineNr] "," $i
    }
}

2,2627,588
2,2627,577
2,2629,566
2,2685,564
2,2685,565
2,2685,566
2,2685,567
2,2685,568
2,2771,573
2,2773,597
2,2779,533
2,2799,558
2,6919,726
2,6919,740
2,6919,741
2,6919,742
2,6919,777
2,7295,761
2,7295,771
2,7295,772

awk '
BEGIN{
  OFS=","
}
{
  num=split($NF,array,";")
  for(i=1;i<=num;i++){
    if(array[i]~/-/){
      split(array[i],array2,"-")
      to=array2[1]>array2[2]?array2[1]:array2[2]
      from=array2[1]<array2[2]?array2[1]:array2[2]
      while(from<=to){
        print $1,$2,from++
      }
    }
    else{
      print $1,$2,array[i]
    }
    from=to=""
  }
}
'   Input_file

awk '                                                  ##Starting awk program from here.
BEGIN{                                                 ##Starting BEGIN section of code here.
  OFS=","                                              ##Setting OFS as comma here.
}
{
  num=split($NF,array,";")                             ##Splitting last field of line into an array named array with delimiter semi-colon here.
  for(i=1;i<=num;i++){                                 ##Starting a for loop from 1 to till value of num which is actually length of array created in previous step.
    if(array[i]~/-/){                                  ##Checking condition if array value with index i is having dash then do followong.
      split(array[i],array2,"-")                       ##Split value of array with index i to array2 here with delimiter -(dash) here.
      to=array2[1]>array2[2]?array2[1]:array2[2]       ##Creating to variable which will compare 2 elements of array2 and have maximum value out of them here.
      from=array2[1]<array2[2]?array2[1]:array2[2]     ##Creating from variable which will compare 2 elements of array2 and will have minimum out of them.
      while(from<=to){                                 ##Running while loop from variable from to till value of variable to here.
        print $1,$2,from++                             ##Printing 1st, 2nd fields with value of from variable and increasing from value with 1 each time it comes here.
      }
    }
    else{                                              ##Mention else part of if condition here.
      print $1,$2,array[i]                             ##Printing only 1st, 2nd fields along with value of array with index i here.
    }
    from=to=""                                         ##Nullifying variables from and to here.
  }
}
'  Input_file                                          ##Mentioning Input_file name here.

2,2627,588
2,2627,577
2,2629,566
2,2685,564
2,2685,565
2,2685,566
2,2685,567
2,2685,568
2,2771,573
2,2773,597
2,2779,533
2,2799,558
2,6919,726
2,6919,740
2,6919,741
2,6919,742
2,6919,777
2,7295,761
2,7295,771
2,7295,772

$ awk '
BEGIN {
    FS="( +|;)"             # input field separator is space or ;
    OFS=","                 # output fs is comma
}
{
    for(i=3;i<=NF;i++) {    # from the 3rd field to the end
        n=split($i,t,"-")   # split on - if any. below loop from smaller to greater
        if(n)               # in case of empty fields
            for(j=(t[1]<t[n]?t[1]:t[n]); j<=(t[1]<t[n]?t[n]:t[1]);j++)
                print $1,$2,j   # output
    }
}' file

2,2627,588
2,2627,577
2,2629,566
2,2685,564   <─┐
2,2685,565     │
2,2685,566     ├─ wrong order, from smaller to greater
2,2685,567     │
2,2685,568   <─┘
2,2771,573
2,2773,597
2,2779,533
2,2799,558
2,6919,726
2,6919,740
2,6919,741
2,6919,742
2,6919,777
2,7295,761
2,7295,771
2,7295,772