« 天道酬勤 |
Blog首页
| Oracle7.3.4 Checkpoint行为研究 »
Google FeedFetcher的更新频率
检查网站的访问日志,偶然发现居然有Google FeedFetcher的访问日志,顺便计算一下Google的更新频率:
1.发现Google每2小时抓取一次index.xml文件,当然有更新时才会具体刷新内容:
grep FeedFetcher-Google access_log.20060217|awk '{print $4 " " $7 " " $12 }'|grep index.xml [17/Feb/2006:01:34:16 /blog/index.xml "FeedFetcher-Google; [17/Feb/2006:03:34:09 /blog/index.xml "FeedFetcher-Google; [17/Feb/2006:05:34:44 /blog/index.xml "FeedFetcher-Google; [17/Feb/2006:07:34:05 /blog/index.xml "FeedFetcher-Google; [17/Feb/2006:09:34:00 /blog/index.xml "FeedFetcher-Google;
|
2.Google同样每2小时抓取一次index.rdf文件更新
grep FeedFetcher-Google access_log.20060217|awk '{print $4 " " $7 " " $12 }'|grep index.rdf [17/Feb/2006:00:34:20 /blog/index.rdf "FeedFetcher-Google; [17/Feb/2006:02:34:25 /blog/index.rdf "FeedFetcher-Google; [17/Feb/2006:04:34:17 /blog/index.rdf "FeedFetcher-Google; [17/Feb/2006:06:34:12 /blog/index.rdf "FeedFetcher-Google;
|
实际上也就是每小时要访问一次.
顺便看了一下Google Reader,也有很多新颖之处,左侧边栏的动感及切换处理感觉很好,相信早晚有一天,Google这个Reader会被越来越多的人所认可和使用.
那么其他的Feed订阅站点呢?
FeedBurner每半小时访问一次:
grep FeedBurner access_log.20060217 |awk '{print $4 " " $7 " " $12 }' [17/Feb/2006:00:11:42 /blog/atom.xml "FeedBurner/1.0 [17/Feb/2006:00:41:50 /blog/atom.xml "FeedBurner/1.0 [17/Feb/2006:01:13:20 /blog/atom.xml "FeedBurner/1.0 [17/Feb/2006:01:45:48 /blog/atom.xml "FeedBurner/1.0 [17/Feb/2006:02:16:13 /blog/atom.xml "FeedBurner/1.0 [17/Feb/2006:02:45:58 /blog/atom.xml "FeedBurner/1.0 [17/Feb/2006:03:31:57 /blog/atom.xml "FeedBurner/1.0 [17/Feb/2006:04:06:00 /blog/atom.xml "FeedBurner/1.0 [17/Feb/2006:04:45:06 /blog/atom.xml "FeedBurner/1.0 [17/Feb/2006:05:15:41 /blog/atom.xml "FeedBurner/1.0
|
Bloglines是每半小时一次,但是每次都会依次抓取atom.xml,index.rdf,index.xml文件:
grep Bloglines access_log.20060217 |awk '{print $4 " " $7 " " $12 }' [17/Feb/2006:00:07:26 /blog/atom.xml "Bloglines/3.0-rho [17/Feb/2006:00:09:13 /blog/index.rdf "Bloglines/3.0-rho [17/Feb/2006:00:09:14 /blog/index.xml "Bloglines/3.0-rho [17/Feb/2006:00:35:52 /blog/atom.xml "Bloglines/3.0-rho [17/Feb/2006:00:37:35 /blog/index.rdf "Bloglines/3.0-rho [17/Feb/2006:00:37:38 /blog/index.xml "Bloglines/3.0-rho [17/Feb/2006:01:05:40 /blog/atom.xml "Bloglines/3.0-rho [17/Feb/2006:01:07:44 /blog/index.xml "Bloglines/3.0-rho [17/Feb/2006:01:07:44 /blog/index.rdf "Bloglines/3.0-rho
|
再看国内的狗狗:
grep "RSS Online" access_log.20060217 |awk '{print $4 " " $7 " " $12 }' [17/Feb/2006:01:28:47 /blog/index.xml "http://www.gougou.com [17/Feb/2006:01:52:54 /blog/index.xml "http://www.gougou.com [17/Feb/2006:02:12:27 /blog/index.xml "http://www.gougou.com [17/Feb/2006:02:12:27 /blog/index.xml "http://www.gougou.com [17/Feb/2006:02:26:01 /blog/index.xml "http://www.gougou.com [17/Feb/2006:02:46:46 /blog/index.xml "http://www.gougou.com [17/Feb/2006:03:07:12 /blog/index.xml "http://www.gougou.com [17/Feb/2006:03:19:24 /blog/index.xml "http://www.gougou.com [17/Feb/2006:03:37:24 /blog/index.xml "http://www.gougou.com
|
这个不太有规律,大约是20分钟左右.
历史上的今天...
>>
2012-02-20文章:
>>
2008-02-20文章:
By eygle on 2006-02-20 00:35 |
Comments (1) |
Web | 685 |
Google
sohu agent
Yahoo! Slurp
这两个都是大头。
google反而不太多,Google网站上说Googlebot 的平均访问频率不会高于数秒钟一次。