- 论坛徽章:
- 0
|
信息的处理,包括数据库信息的处理(SQL)和文本信息的处理。我对SQL比较熟悉,下面拿SQL和文本信息的处理对比一下:
#1. 查询所有的数据 - select * from ping;
[dwapp@dw_testdb yuechaotian]$ cat ping.txt
64 bytes from 1.2.3.2: icmp_seq=0 ttl=60 time=0.655 ms
64 bytes from 1.22.3.2: icmp_seq=1 ttl=60 time=0.328 ms
64 bytes from 1.2.3.2: icmp_seq=2 ttl=60 time=0.292 ms
64 bytes from 1.22.3.2: icmp_seq=3 ttl=60 time=0.311 ms
64 bytes from 1.2.3.2: icmp_seq=4 ttl=60 time=0.316 ms
64 bytes from 1.2.3.2: icmp_seq=0 ttl=60 time=0.655 ms
64 bytes from 1.2.3.2: icmp_seq=0 ttl=60 time=0.655 ms
64 bytes from 1.22.3.2: icmp_seq=1 ttl=60 time=0.328 ms
64 bytes from 1.2.3.2: icmp_seq=2 ttl=60 time=0.292 ms
64 bytes from 1.22.3.2: icmp_seq=3 ttl=60 time=0.311 ms
64 bytes from 1.2.3.2: icmp_seq=4 ttl=60 time=0.316 ms
64 bytes from 1.22.3.2: icmp_seq=1 ttl=60 time=0.328 ms
64 bytes from 1.2.3.2: icmp_seq=2 ttl=60 time=0.292 ms
64 bytes from 1.22.3.2: icmp_seq=3 ttl=60 time=0.311 ms
64 bytes from 1.2.3.2: icmp_seq=4 ttl=60 time=0.316 ms
64 bytes from 1.22.3.2: icmp_seq=3 ttl=60 time=0.311 ms
64 bytes from 1.22.3.2: icmp_seq=3 ttl=60 time=0.311 ms
64 bytes from 1.2.3.2: icmp_seq=4 ttl=60 time=0.316 ms
64 bytes from 1.2.3.2: icmp_seq=4 ttl=60 time=0.316 ms
#2. 查询第4列和第7列 - select ip, time from ping;
[dwapp@dw_testdb yuechaotian]$ cat ping.txt | awk '{print $4"\t"$7}'
1.2.3.2: time=0.655
1.22.3.2: time=0.328
1.2.3.2: time=0.292
1.22.3.2: time=0.311
1.2.3.2: time=0.316
1.2.3.2: time=0.655
1.2.3.2: time=0.655
1.22.3.2: time=0.328
1.2.3.2: time=0.292
1.22.3.2: time=0.311
1.2.3.2: time=0.316
1.22.3.2: time=0.328
1.2.3.2: time=0.292
1.22.3.2: time=0.311
1.2.3.2: time=0.316
1.22.3.2: time=0.311
1.22.3.2: time=0.311
1.2.3.2: time=0.316
1.2.3.2: time=0.316
#3. 查询ip中出现22的 - select ip, time from ping where ip like '%22%';
[dwapp@dw_testdb yuechaotian]$ cat ping.txt | awk '{print $4"\t"$7}' | grep 22
1.22.3.2: time=0.328
1.22.3.2: time=0.311
1.22.3.2: time=0.328
1.22.3.2: time=0.311
1.22.3.2: time=0.328
1.22.3.2: time=0.311
1.22.3.2: time=0.311
1.22.3.2: time=0.311
#4. 查询ip中未出现22的 - select ip, time from ping where ip not like '%22%';
[dwapp@dw_testdb yuechaotian]$ cat ping.txt | awk '{print $4"\t"$7}' | grep -v 22
1.2.3.2: time=0.655
1.2.3.2: time=0.292
1.2.3.2: time=0.316
1.2.3.2: time=0.655
1.2.3.2: time=0.655
1.2.3.2: time=0.292
1.2.3.2: time=0.316
1.2.3.2: time=0.292
1.2.3.2: time=0.316
1.2.3.2: time=0.316
1.2.3.2: time=0.316
#5. 将得到的结果按照time排序 - select ip, time from ping where ip like '%22%' order
by time;
[dwapp@dw_testdb yuechaotian]$ cat ping.txt | awk '{print $4"\t"$7}' | grep 22 | sort
1.22.3.2: time=0.311
1.22.3.2: time=0.311
1.22.3.2: time=0.311
1.22.3.2: time=0.311
1.22.3.2: time=0.311
1.22.3.2: time=0.328
1.22.3.2: time=0.328
1.22.3.2: time=0.328
#6. 合并结果中的重复记录 - select distinct ip, time from ping where ip like '%22%'
order by time;
[dwapp@dw_testdb yuechaotian]$ cat ping.txt | awk '{print $4"\t"$7}' | grep 22 | sort | uniq
1.22.3.2: time=0.311
1.22.3.2: time=0.328
#7. 统计每个记录的数量 - select count(*), ip, time from ping where ip like '%22%'
order by time
[dwapp@dw_testdb yuechaotian]$ cat ping.txt | awk '{print $4"\t"$7}' | grep 22 | sort | uniq -c
5 1.22.3.2: time=0.311
3 1.22.3.2: time=0.328
参考:
http://oracle.chinaitlab.com/PLSQL/755199_2.html
本文来自ChinaUnix博客,如果查看原文请点:http://blog.chinaunix.net/u/30637/showart_1963673.html |
|