Cataloger's Corner

                                   ...organizing library information!

Hidden Web / Invisible Web / Deep Web

 

The Invisible or Hidden Web consists of
    * 3/4 of the information on the internet  
    * Information not indexed by major search engines
    * Usually more current information
    * Business intranets
    * Academic single source institutions
    * Content-rich DATABASES!

Five types of Sites
    * Visible - crawled and indexed often by search engines
    * Opaque - not linked to other sites (homepage with no links)
    * Private - password protected, secure server (https://)
    * Proprietary - registration required (fee or pay for view)
    * Truly invisible - unsearchable file formats and databases

 

 

 

 

 


 

 


                                          

                                       

                                         Bergman, Michael K. "The Deep Web: Surfacing Hidden Value." Journal of Electronic Publishing, August 2001.

Deep Web Facts
    * More in-depth information than surface sites
    * Quality content 1,000 - 2,000 times greater than that of surface sites
    * Academic disciplines highly represented
    * Most content resides in subject-specific databases
    * 95% is publicly accessible information - not subject to fees or subscriptions

How to tell if a Web Site is Invisible - Examine the URL
    * Direct URLs point to specific web pages. Crawlers can follow these pages
    * Indirect URLs
            * Do not point to specific web pages
            * Contain server executed script information
            * Contain symbols (?, &, %. $. +) or words (cgi-bin or javascript)

Is a site indexed?
    1. Pick a URL.
    2. Place in the Google search box with the word info: before the URL with no spaces.
    3. If it displays, it is indexed by Google; if not, then it is not indexed.

To find Hidden Web Databases
    1. Use search engines and add the word "database" to search terms.
    2. Use ADVANCE search features
               
AlltheWeb  http://www.alltheweb.com/advanced?advanced=1&&q= 
               
Google        http://www.google.com/advanced_search?hl=en
    3. Use Specialty search engines
                
Teoma    http://www.teoma.com/
                
IxQuick   http://ixquick.com/ 
               
 Surfwax  http://www.surfwax.com 

For better searches
    * Be specific
    * Put the most important and most rare words first
    * Use 6-8 key-words in query to increase efficiency
    * Use phrases when possible to eliminate irrelevant material
    *
Use quotation marks around phrase
    * Use more than one search tool for your research
    * Read the instructions or help screens
    * Teachers - use "grade 4" rather than "4th grade" ;  use educator rather than teacher

Helping Students Achieve Better Search Results
        Web Site Evaluation
            * What is the purpose of the site? Is there bias?
            * Who developed/sponsors the site? Is there contact information?
            * What are goals/objectives of the developer/sponsor?
            * Is the information well-documented?
            * When was the site last updated?
        From Noodle Tools
               
Information Literacy: Search Strategies
                          
http://www.noodletools.com/debbie/literacies/information/5locate/adviceengine.html
        From Kathy Schrock
               
Teacher Helpers - Critical Evaluation Information
                           http://school.discovery.com/schrockguide/eval.html

Sample Invisible Web Sites
        Find Articles     http://www.findarticles.com
        Invisible Web    http://www.invisible-web.net/
        Scholarly Societies  http://www.scholary-societies.org/
        LRC Catalogs    http://www.libdex.com/webcats
        Philanthropy & Grant Information     http://lnp.fdncenter.org/finder

Sample Invisible Web Resources - Links
        Academic Info       http://www.academicinfo.net
        Complete Planet    http://www.completeplanet.com
        Beaucoup        http://www.beaucoup.com/
        LibrarySpot    http://www.libraryspot.com/


       

       
Much of the information for this web page came from Helen Teague & her 2005 TCEA presentation on the Invisible Web.
Additional information is available at her website: Oops Overnight Planning System, http://4oops.com

 

HOME

Library Information Services

 

Copyright

Page edited - 10/29/2008

SBISD Employee Email Search

Library Resources Page

Spring Branch Independent School District, Houston, Texas
The Library Information Services Department grants permission to print this web page for educational purposes providing that appropriate credit is included on the copies.