eygle.com   eygle.com
eygle.com  
 

« 天道酬勤 | Blog首页 | Oracle7.3.4 Checkpoint行为研究 »

Google FeedFetcher的更新频率

作者:eygle |【转载时请务必以超链接形式标明文章和作者信息及本声明
链接:

检查网站的访问日志,偶然发现居然有Google FeedFetcher的访问日志,顺便计算一下Google的更新频率:

1.发现Google每2小时抓取一次index.xml文件,当然有更新时才会具体刷新内容:

grep FeedFetcher-Google access_log.20060217|awk '{print $4 " " $7 " " $12 }'|grep index.xml
[17/Feb/2006:01:34:16 /blog/index.xml "FeedFetcher-Google;
[17/Feb/2006:03:34:09 /blog/index.xml "FeedFetcher-Google;
[17/Feb/2006:05:34:44 /blog/index.xml "FeedFetcher-Google;
[17/Feb/2006:07:34:05 /blog/index.xml "FeedFetcher-Google;
[17/Feb/2006:09:34:00 /blog/index.xml "FeedFetcher-Google;

2.Google同样每2小时抓取一次index.rdf文件更新

grep FeedFetcher-Google access_log.20060217|awk '{print $4 " " $7 " " $12 }'|grep index.rdf
[17/Feb/2006:00:34:20 /blog/index.rdf "FeedFetcher-Google;
[17/Feb/2006:02:34:25 /blog/index.rdf "FeedFetcher-Google;
[17/Feb/2006:04:34:17 /blog/index.rdf "FeedFetcher-Google;
[17/Feb/2006:06:34:12 /blog/index.rdf "FeedFetcher-Google;

实际上也就是每小时要访问一次.

Google.Reader

顺便看了一下Google Reader,也有很多新颖之处,左侧边栏的动感及切换处理感觉很好,相信早晚有一天,Google这个Reader会被越来越多的人所认可和使用.

那么其他的Feed订阅站点呢?

FeedBurner每半小时访问一次:

grep FeedBurner access_log.20060217 |awk '{print $4 " " $7 " " $12 }'
[17/Feb/2006:00:11:42 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:00:41:50 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:01:13:20 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:01:45:48 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:02:16:13 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:02:45:58 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:03:31:57 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:04:06:00 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:04:45:06 /blog/atom.xml "FeedBurner/1.0
[17/Feb/2006:05:15:41 /blog/atom.xml "FeedBurner/1.0 

Bloglines是每半小时一次,但是每次都会依次抓取atom.xml,index.rdf,index.xml文件:

grep Bloglines access_log.20060217 |awk '{print $4 " " $7 " " $12 }'
[17/Feb/2006:00:07:26 /blog/atom.xml "Bloglines/3.0-rho
[17/Feb/2006:00:09:13 /blog/index.rdf "Bloglines/3.0-rho
[17/Feb/2006:00:09:14 /blog/index.xml "Bloglines/3.0-rho
[17/Feb/2006:00:35:52 /blog/atom.xml "Bloglines/3.0-rho
[17/Feb/2006:00:37:35 /blog/index.rdf "Bloglines/3.0-rho
[17/Feb/2006:00:37:38 /blog/index.xml "Bloglines/3.0-rho
[17/Feb/2006:01:05:40 /blog/atom.xml "Bloglines/3.0-rho
[17/Feb/2006:01:07:44 /blog/index.xml "Bloglines/3.0-rho
[17/Feb/2006:01:07:44 /blog/index.rdf "Bloglines/3.0-rho 

再看国内的狗狗:

grep "RSS Online" access_log.20060217 |awk '{print $4 " " $7 " " $12 }'
[17/Feb/2006:01:28:47 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:01:52:54 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:02:12:27 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:02:12:27 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:02:26:01 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:02:46:46 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:03:07:12 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:03:19:24 /blog/index.xml "http://www.gougou.com
[17/Feb/2006:03:37:24 /blog/index.xml "http://www.gougou.com 
这个不太有规律,大约是20分钟左右. -----
这篇 【Google FeedFetcher的更新频率】来自 www.eygle.com | CSDN技术网摘| del.icio.us|365Key

By eygle on 2006-02-20 00:35 | Comments (1) | Posted to Web | Edit |Pageviews:

相关文章 随机文章
  • 我的装修以及装修的生意
  • 记录一下站点(eygle.com)PR值升级到5
  • 从网易相册到Web服务说开去
  • 2007年 新的开端
  • 圣诞快乐与搜索引擎的力量
  • 梁启超之死
    sql.bsq与数据库的创建
    关于Oracle9i的Peeking of User-Defined Bind Variables
    案例:Move系统表DEPENDENCY$导致索引失效的数据库故障
    瑞典游记-忏悔篇
    网上相关主题:
    Google

    留言 (1)

    sohu agent
    Yahoo! Slurp
    这两个都是大头。
    google反而不太多,Google网站上说Googlebot 的平均访问频率不会高于数秒钟一次。

    Posted by: qqdao at February 22, 2006 9:43 AM

    发表留言:



    Remember Me?
    (输入验证码后方可评论,谢谢支持)



    CopyRight © 2004 eygle.com, All rights reserved.