Lewandowski, Dirk, Wahlig, Henry and Meyer-Bautor, Gunnar The Freshness of Web search engines’ databases. Journal of Information Science, 2005, vol. 31. (In Press) [Journal article (Unpaginated)]
Preview |
PDF
jis_preprint.pdf Download (580kB) | Preview |
English abstract
This study measures the frequency in which search engines update their indices. Therefore, 38 websites that are updated on a daily basis were analysed within a time-span of six weeks. The analysed search engines were Google, Yahoo and MSN. We find that Google performs best overall with the most pages updated on a daily basis, but only MSN is able to update all pages within a time-span of less than 20 days. Both other engines have outliers that are quite older. In terms of indexing patterns, we find different approaches at the different engines: While MSN shows clear update patterns, Google shows some outliers and the update process of the Yahoo index seems to be quite chaotic. Implications are that the quality of different search engine indices varies and not only one engine should be used when searching for current content.
Item type: | Journal article (Unpaginated) |
---|---|
Keywords: | search engines; Online Information Retrieval; World Wide Web; index quality; index freshness |
Subjects: | L. Information technology and library technology > LS. Search engines. |
Depositing user: | Dirk Lewandowski |
Date deposited: | 07 Sep 2005 |
Last modified: | 02 Oct 2014 12:01 |
URI: | http://hdl.handle.net/10760/6701 |
References
Downloads
Downloads per month over past year
Actions (login required)
![]() |
View Item |