eygle.com   eygle.com
eygle.com eygle
eygle.com  
 

« 天道酬勤 | Blog首页 | Oracle7.3.4 Checkpoint行为研究 »

Google FeedFetcher的更新频率
modb.pro

检查网站的访问日志,偶然发现居然有Google FeedFetcher的访问日志,顺便计算一下Google的更新频率:

1.发现Google每2小时抓取一次index.xml文件,当然有更新时才会具体刷新内容:

grep FeedFetcher-Google access_log.20060217|awk '{print $4 " " $7 " " $12 }'|grep index.xml
[17/Feb/2006:01:34:16 /blog/index.xml "FeedFetcher-Google;
[17/Feb/2006:03:34:09 /blog/index.xml "FeedFetcher-Google;
[17/Feb/2006:05:34:44 /blog/index.xml "FeedFetcher-Google;
[17/Feb/2006:07:34:05 /blog/index.xml "FeedFetcher-Google;
[17/Feb/2006:09:34:00 /blog/index.xml "FeedFetcher-Google;

2.Google同样每2小时抓取一次index.rdf文件更新

grep FeedFetcher-Google access_log.20060217|awk '{print $4 " " $7 " " $12 }'|grep index.rdf
[17/Feb/2006:00:34:20 /blog/index.rdf "FeedFetcher-Google;
[17/Feb/2006:02:34:25 /blog/index.rdf "FeedFetcher-Google;
[17/Feb/2006:04:34:17 /blog/index.rdf "FeedFetcher-Google;
[17/Feb/2006:06:34:12 /blog/index.rdf "FeedFetcher-Google;

实际上也就是每小时要访问一次.

Google.Reader

顺便看了一下Google Reader,也有很多新颖之处,左侧边栏的动感及切换处理感觉很好,相信早晚有一天,Google这个Reader会被越来越多的人所认可和使用.

那么其他的Feed订阅站点呢?

FeedBurner每半小时访问一次:

grep FeedBurner access_log.20060217 |awk '{print $4 " " $7 " " $12 }'
[17/Feb/2006:00:11:42 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:00:41:50 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:01:13:20 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:01:45:48 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:02:16:13 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:02:45:58 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:03:31:57 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:04:06:00 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:04:45:06 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:05:15:41 /blog/atom.xml "FeedBurner/1.0 

Bloglines是每半小时一次,但是每次都会依次抓取atom.xml,index.rdf,index.xml文件:

grep Bloglines access_log.20060217 |awk '{print $4 " " $7 " " $12 }'
[17/Feb/2006:00:07:26 /blog/atom.xml "Bloglines/3.0-rho
[17/Feb/2006:00:09:13 /blog/index.rdf "Bloglines/3.0-rho
[17/Feb/2006:00:09:14 /blog/index.xml "Bloglines/3.0-rho
[17/Feb/2006:00:35:52 /blog/atom.xml "Bloglines/3.0-rho
[17/Feb/2006:00:37:35 /blog/index.rdf "Bloglines/3.0-rho
[17/Feb/2006:00:37:38 /blog/index.xml "Bloglines/3.0-rho
[17/Feb/2006:01:05:40 /blog/atom.xml "Bloglines/3.0-rho
[17/Feb/2006:01:07:44 /blog/index.xml "Bloglines/3.0-rho
[17/Feb/2006:01:07:44 /blog/index.rdf "Bloglines/3.0-rho 

再看国内的狗狗:

grep "RSS Online" access_log.20060217 |awk '{print $4 " " $7 " " $12 }'
[17/Feb/2006:01:28:47 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:01:52:54 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:02:12:27 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:02:12:27 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:02:26:01 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:02:46:46 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:03:07:12 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:03:19:24 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:03:37:24 /blog/index.xml "http://www.gougou.com 
这个不太有规律,大约是20分钟左右.
历史上的今天...
    >> 2012-02-20文章:
    >> 2008-02-20文章:
           弃我去者今日之日不可留

By eygle on 2006-02-20 00:35 | Comments (1) | Web | 685 |

1 Comment

sohu agent
Yahoo! Slurp
这两个都是大头。
google反而不太多,Google网站上说Googlebot 的平均访问频率不会高于数秒钟一次。


CopyRight © 2004~2020 云和恩墨,成就未来!, All rights reserved.
数据恢复·紧急救援·性能优化 云和恩墨 24x7 热线电话:400-600-8755 业务咨询:010-59007017-7040 or 7037 业务合作: marketing@enmotech.com