Bash 按小时计算平均响应时间_Bash_Shell_Unix_Awk

Bash 按小时计算平均响应时间

bash shell unix awk

Bash 按小时计算平均响应时间,bash,shell,unix,awk,Bash,Shell,Unix,Awk,我试图从日志文件中每小时计算平均响应时间，该日志文件有数百万条记录，下面是日志摘录到目前为止，我正在尝试创建temproary文件，该文件将具有唯一id、开始时间和结束时间的行，之后，另一个脚本将在此临时文件上运行，以计算每个小时的平均响应时间我的脚本需要一个多小时才能创建临时文件我们有什么办法可以做得更快吗？或者更好的脚本，其执行时间更短。注意：这些UNIQID不是按顺序出现的 log file format 2012-06-04 13:04:19,324 UNIQID1 2012-0

我试图从日志文件中每小时计算平均响应时间，该日志文件有数百万条记录，下面是日志摘录

到目前为止，我正在尝试创建temproary文件，该文件将具有唯一id、开始时间和结束时间的行，之后，另一个脚本将在此临时文件上运行，以计算每个小时的平均响应时间我的脚本需要一个多小时才能创建临时文件

我们有什么办法可以做得更快吗？或者更好的脚本，其执行时间更短。注意：这些UNIQID不是按顺序出现的

log file format
2012-06-04 13:04:19,324 UNIQID1
2012-06-04 13:04:20,120 UNIQID1
2012-06-04 13:05:19,324 UNIQID2
2012-06-04 13:06:20,120 UNIQID2
2012-06-04 13:07:19,324 UNIQID3
2012-06-04 13:08:20,120 UNIQID3
2012-06-04 13:08:49,324 UNIQID4
2012-06-04 13:09:50,120 UNIQID4

这是我的密码：

uids=`cat $i|grep "UNIQ" |sort -u` >> $log
for uid in ${uids}; do  
    count=`grep "$uid" test.log|wc -l`
    if [ "${count}" -ne "0" ]; then
        unique_uids[counter]="$uid"
        let counter=counter+1   
    fi   
done


echo ${unique_uids[@]}   
echo $counter  
echo " Unique No:" ${#unique_uids[@]}
echo uid StartTime EndTime" > $log

for unique_uids in ${unique_uids[@]} ; do
    responseTime=`cat $i|grep "${unique_uids}" |awk '{split($2,Arr,":|,"); print Arr[1]*3600000+Arr[2]*60000+Arr[3]*1000+Arr[4]}'|sort -n`
    echo $unique_uids $responseTime >> $log
done

谢谢你的时间

一些简单的修复：

你不需要电话；只需将文件名用作
```
grep
```
的最后一个参数
您不应该将值同时保存到文件和变量中；用快一点的。通常你不必使用这两种方法；而IFS=read-r日期时间id循环可能会更快

你不需要电话；只需将文件名用作
```
grep
```
的最后一个参数
您不应该将值同时保存到文件和变量中；用快一点的。通常你不必使用这两种方法；而IFS=read-r日期时间id循环可能会更快

awk

gawk

awk

awk

gawk

awk

cat$i|grep“Service”| awk'BEGIN{FS=“lt；”RS=“gt；”}{print$2；}| sort-u

|sort-u

cat$i|grep“Service”| awk'BEGIN{FS=“lt；”RS=“gt；”}{print$2；}| sort-u

|sort-u

BEGIN {
  while (getline < UIDFILE) {
    x[$0] = 1;          # Awk will maintain these as an associative array, lookups are hashed
  }
}


{
  r = $NF;                  # Extract the unique ID from the record into r
  if (r in x) {             # If the UID is something we are interested in, then ...
    ts = $1 " " $2;         # concatenate these fields
    gsub ("[:-]", " ", ts); # Replace the : and - with spaces 
    gsub (",.*", "", ts);   # Remove everything after the comma
    # print ts, mktime(ts)  # If you want to see what mktime does 

    if (x[r] == "")         # First time seeing this unique ID?
      x[r] = mktime(ts);    # Store the timestamp
    else {                  # We're seeing it the second time
      now = mktime(ts)      # Keep track of the current log time 
      rt = now - x[r];      # Compute the delta
      delete (x[r])         # We don't need it any more
      # printf "Record <%s> has response time %f\n", r, rt;  # Print it out if you'd like
      hourrt += rt;         # Add it to this hour's total response time
      num++;                # And also keep track of how many records we have ending in this hour
       if (now % 3600 == 0) {  # Have we switched to a new hour?
          printf "Average response time = %f\n", hourrt / num;   # Dump the average
          num = hourrt = 0;
      }
    }
  }
}

gawk -v UIDFILE=name_of_uid_file  -f scriptname.awk