分析apache日志,默认的 combined 格式,如下
220.181.108.91 - - [02/Sep/2015:11:24:48 +0800] "GET /Ultrasonic/policy.html HTTP/1.1" 200 40366 "-" "Mozilla/5.0 (compatible; Baiduspider/2.0; +
http://www.baidu.com/search/spider.html)" 244 40932
68.180.229.57 - - [02/Sep/2015:11:24:47 +0800] "GET /indian/list-23.html HTTP/1.1" 200 38850 "-" "Mozilla/5.0 (compatible; Yahoo! Slurp;
http://help.yahoo.com/help/us/ysearch/slurp)" 192 39452
字段分隔符是空格,但部分字段内部带空格,这种字段是引号括起来的。
请教awk怎么正确识别这些字段,或者有其它工具推荐吗?