Exploring the Academic Invisible Web
(2006) Exploring the Academic Invisible Web.
There is a more recent version of this eprint available.
Full text available as: |
Abstract
Purpose: To provide a critical review of Bergman’s 2001 study on the Deep Web. In addition, we bring a new concept into the discussion, the Academic Invisible Web (AIW). We define the Academic Invisible Web as consisting of all databases and collections relevant to academia but not searchable by the general-purpose internet search engines. Indexing this part of the Invisible Web is central to scientific search engines. We provide an overview of approaches followed thus far.
Design/methodology/approach: Discussion of measures and calculations, estimation based on infor-metric laws. Literature review on approaches for uncovering information from the Invisible Web.
Findings: Bergman’s size estimation of the Invisible Web is highly questionable. We demonstrate some major errors in the conceptual design of the Bergman paper. A new (raw) size estimation is given.
Research limitations/implications: The precision of our estimation is limited due to small sample size and lack of reliable data.
Practical implications: We can show that no single library alone will be able to index the Academic Invisible Web. We suggest collaboration to accomplish this task.
Originality/value: Provides library managers and those interested in developing academic search en-gines with data on the size and attributes of the Academic Invisible Web.
| Keywords: | Search engines, Worldwide Web, Indexing, Scholarly content, Digital library |
|---|---|
| Subjects: | L. Information technology and library technology. > LS. Search engines. L. Information technology and library technology. > LC. Internet, including WWW. |
| ID Code: | 6071 |
| Deposited By: | Lewandowski, Dirk |
| Deposited On: | 16 April 2006 |
| Alternative Locations: | http://www.durchdenken.de/lewandowski/doc/LHT_Preprint.pdf |
| All fields: | Show all fields |
Available Versions of this Item
- Exploring the Academic Invisible Web (deposited 16 April 2006) [Currently Displayed]
Archive Staff Only: edit this record

