University of Illinois at Urbana-Champaign
Graduate School of Library and Information Science - Library and Information Science Library

UI Current LIS Clips

Web Searching

December, 2002 - Compiled, annotated and supplemented by Janet Eke
Updated 1/9/03 (see items noted in red below)

In This Issue:

Becoming an Efficient and Effective Web Searcher

  1. An approach to efficient and effective Web searching
  2. Understanding search engines and subject directories
  3. Beyond engines and directories: specialized tools and the Deep Web
  4. Formulating and formalizing Web searching strategies
  5. Evaluating Web resources
  6. More Web topics in future issues

1. An approach to efficient and effective Web searching 

  1. Based on workshops developed by Janet Eke, Pat Barlosky, and Cindy Kehoe, for the Ontario Library Association, the National Technological University, and Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign (1999-2002). 
  2. Cohen, Laura B. "The Web as a Research Tool: Teaching Strategies for Instructors." Choice Supplement to vol. 36 (1999):19-44.
The Web is difficult to search. Though the Web's hypertext point-and-click environment is simple, conducting research on the Web is not. Its chaotic architecture of linking lends itself well to surfing but not to efficient information finding. (Cohen, 1999.)

Information professionals must be good Web searchers. The Web is an information resource that cannot be ignored. We must know how to locate high quality Web content quickly and accurately.

We already have useful skills. Though some of the skills and knowledge needed to search effectively are unique to the Web environment, many others come out of understanding that we already have of the information structures of traditional print and online collections. We can apply what we already know to the Web.
Three Steps to Efficient and Effective Web Searching

The chart below suggests three "A, B, C's" of effective Web searching. The sections following discuss each component in more detail, drawing on library literature.
 
Step
Example
  • Learn what they are and how they work. 
  • Learn the advanced search features of your favorites.
  • Know where to go to keep current on changing features and new services. 
  • B. Know specialized search tools. Build a "Core Collection" of Web sites beyond general-purpose search tools like Google or Yahoo! 
  • Specialized tools may take you more directly to an authoritative source.
  • Specialized tools may help you locate valuable content not indexed by search engines (the Deep Web). 
  • Know good sources for general types of information, such as general reference sites and news sites. 
  • Know good sources for specific subject areas you search often. 
  • Know gateway sites to specialized databases such as Deep Web pathfinders and specialized directories. 
  • C. Formulate and formalize Web searching strategies. Analyze Web topics before you search. Consciously apply old and new search strategies to the Web. 
  • Approaches from traditional collections and Information Retrieval system searching can be applied to the Web; other strategies are Web-specific. 
  • Analysis can make the difference between Web searching success and failure. 
Ask
  • Is there a type of source that will be useful for this topic? For example: an encyclopedia entry, a news article, a business directory entry.
  • Who might be interested in or responsible for gathering information about this topic? For example: a trade association, a government agency, a research institution.
  • What is the best type of Web search tool to begin with? For example: a search engine, a subject directory, a specific site from my Core Collection.
  • Should I search the Web in the first place? 


2. Understanding Search Engines and Subject Directories 

  1. Barker, Joe. UC Berkeley - Teaching Library Internet Workshops Berkeley - Types of Search Tools (2001). http://www.lib.berkeley.edu/TeachingLib/Guides/Internet/ToolsTables.html
  2. Basch, Reva and Mary Ellen Bates. Researching Online for Dummies, 2nd edition. Chicago, IL: IDG Books Worldwide. 2000. 
  3. Cohen, Laura B. "The Web as a Research Tool: Teaching Strategies for Instructors." Choice Supplement to vol. 36 (1999):19-44. 
  4. Cohen, Laura B. "Finding it All on the Web: Search Tools and Strategies." Choice Special Issue to vol. 38 (2001):13-27. 
  5. Cohen, Laura. "Internet Tutorials - Checklist of Internet Research Tips " at the University of Albany Libraries (2002). http://library.albany.edu/internet/checklist.html
  6. Schlein, Alan M. Find It Online: The Complete Guide to Online Research, 2nd edition. Tempe, AZ: Facts on Demand Press, 2000.
The two main types of general-purpose Web search tools are search engines and subject directories. In form they can look almost identical, apparently offering very similar search capabilities via similar interfaces; however, in function they are distinctly different. Understanding how search engines and subject directories work and when to use them is an essential step to becoming an efficient and effective Web searcher.

Human-centered VS Machine-centered Services (sources 3, 5, 6)

What is a subject directory? - A subject directory is a database of titles and URLs of Web sites, compiled and organized into subject categories by humans. Directories are usually searchable and browsable, and entries may be annotated. Selectivity, evaluation, and quality of annotations vary between directories. Examples are Yahoo! http://www.yahoo.com, The Open Directory Project http://dmoz.org/, and Infomine http://infomine.ucr.edu.

There are two main types of subject directories. Cohen emphasizes that they have very different purposes and should be approached accordingly.

What is a search engine? - A search engine is a searchable database of words from Web pages compiled by a computer program called a crawler, robot, or spider. The spider program travels the Web by following links, and gathering words from individual pages it finds. These are collected in a giant index that can then be queried. Results are ranked by relevancy algorithms. Every search engine's method of searching is proprietary and the depth, breadth, and scope of its database is unique. Examples are Google http://www.google.com, AltaVista http://www.altavista.com, and AllTheWeb http://www.alltheweb.com.

Subject Directories VS Search Engines (sources 1, 2, 3, 4, 5)

Subject Directories 
Search Engines 
  • Human-compiled index of information about Web sites 
  • Organized into subject categories
  • Search terms are matched in category names, site names, and site annotations, NOT in the full-text of individual pages
  • Smaller than search engines
  • Contents MAY be carefully evaluated and annotated -- but not always, especially in the case of large commercial portals 
  • Computer-compiled index of words from individual Web pages 
  • NOT organized into subject categories; results are ranked by computer algorithm
  • Search terms are matched in the actual text of Web pages; in many engines every word in a page may be indexed
  • Much larger than subject directories; include more sites, and index them much more deeply
  • Contents are unevaluated by humans 

Use a subject directory:

Use a search engine: Converging Content: Engines with Directories and Directories with Engines

Most major search services have formed alliances to provide both engine-type search results and directory-type search results. This is why we see Web page search results in Yahoo!, and can browse a directory in Google. Google owns and maintains the engine database only; it gets its directory from The Open Directory Project. Similarly, Yahoo! owns and develops its directory; currently it gets its engine results from Google.

Searching and Selecting Tips

Keeping Current

To quickly learn the features of your favorites, or find out about the latest new engines and directories, use sites such as the following. Greg Notess, Danny Sullivan and Stephen Bell do the work for you with news, features charts, reviews and search strategies.

Daily Briefings

Added 1/9/03 - To get daily new about search engines, subscribe to the following:

Other Web Guides to Selecting Directories and Engines


3. Beyond Search Engines and Subject Directories: Specialized Tools and the Deep Web 

  1. Cohen, Laura. "Internet Tutorials - The Deep Web" at the University of Albany Libraries (2002) http://library.albany.edu/internet/deepweb.html 
  2. King, David. "Specialized Search Engines." Online 24(3) (May/June 2000): 67-74. 
  3. Price, Gary and Chris Sherman. "Exploring the Invisible Web: 7 essential strategies." Online 25(4) (July/August 2001): 32-34. 
  4. Sherman, Chris and Gary Price. "The Invisible Web." Searcher 9(6) (June 2001): 62-74.
  5. Smith, C. Brian. "Getting to Know the Invisible Web." Library Journal Netconnect (Summer 2001): 16-18. 
  6. Snow, Bonnie. "The Internet's Hidden Content and How to Fnd it." Online 24(3) (May/June 2000): 61-66. 
Don't Rely Only - Searchers should not rely exclusively on direct searches in general-purpose search engines and subject directories. These searches may...

What is the Invisible Web/Deep Web? (sources 1, 3, 4, 5)

The Deep Web consists of text pages, files, or other often high-quality authoritative Web content that general-purpose search engines cannot, due to technical limitations, or will not, due to deliberate choice, add to their databases. Sometimes also referred to as the "Invisible Web" or "dark matter." (Sherman and Price, 2001.)

Content may be excluded for two main types of reasons.

Vast Amounts of Deep Web Content - There are vast amounts of authoritative and current information that you simply cannot access using a search engine like Google or AltaVista. (Sherman and Price, 2001.) Examples of databases whose contents are not accessible via search engines are

Searching the Deep Web: Strategies (source 3)

Price and Sherman offer the following practical strategies for successfully navigating the Web's hidden territories.

Specialized Directories and Engines (source 2)

A specialized search engine focuses on a specific subject, geographic region, or computer file format. It indexes fewer pages, but these pages are more likely to be on-topic. Often the contents are further weeded by human subject specialists, who gather, rank and annotate pages.

Using a specialized search engine..

Examples of Specialized Engines
Examples of Specialized Directories Web Guides to Deep Web Gateways and Specialized Search Sites

4. Formulating and Formalizing Web Searching Strategies 

  1. Basch, Reva and Mary Ellen Bates. Researching Online for Dummies, 2nd edition. Chicago, IL: IDG Books Worldwide. 2000.
  2. Cohen, Laura. "Internet Tutorials" at the University of Albany Libraries (2002) http://library.albany.edu/internet/
  3. Drabenstott, Karen M. "Web Search Strategy." Online 25(4) (July/August 2001): 19-27.
  4. Paul, Nora in Alan M. Schlein, Find It Online: The Complete Guide to Online Research, 2nd edition. Tempe, AZ: Facts on Demand Press, 2000.
Reference Interview Yourself - Don't jump right in. Basch and Bates emphasize the importance of beginning a search by reference-interviewing yourself. Formulating a search strategy is critical to finding what you are looking for, and can save you hours of work. 

Formulating a Strategy 

Nora Paul recommends asking yourself specific questions about your topic and your resources. Selected Resources for Web Strategies

5. Evaluating Web Resources 

Kirk, Elizabeth. "Evaluating Information Found on the Internet." The Sheridan Libraries of the Johns Hopkins University. 2002. http://www.library.jhu.edu/researchhelp/general/evaluating/index.html

The system of evaluation by scholars, publishers and librarians that exists in traditional information collections does not exist on the Web. Anyone can write a Web page; excellent resources live side-by-side with the highly dubious. This means resources found on the Web must be carefully evaluated by the user. 

Evaluation Criteria - Elizabeth Kirk shows how criteria used to evaluate print information can be applied to Web content. Criteria and sample questions to consider are excerpted below.

  1. Authorship
  2. Publishing body
  3. Point of view or bias
  4. Referral to and/or knowledge of the literature
  5. Accuracy or verifiability of details
  6. Currency
Question what you find on the Web - If you find information that is "too good to be true," it probably is. Never use information you cannot verify. 

Web Guide to Sources on Evaluation


More Web Topics in Future Issues 

More topics about the Web in the Library will be covered in future issues of Current LIS Clips, including:

Send Us Feedback 

Do you have a favorite Web Searching tutorial or how-to resource that we should mention? Send us an e-mail. We'll add it to the resources in this issue.

Updated 1/9/03

Graduate School of Library and Information Science
University of Illinois at Urbana-Champaign
501 E. Daniel Street
Champaign, IL 61820-6211 USA
(217) 333-0734 voice (217) 244-3302 fax
http://www.lis.uiuc.edu
Current Clips Manager: Marianne Steadley

Library and Information Science Library
University of Illinois at Urbana-Champaign
1408 W. Gregory Drive, 306 Main Library
Urbana, IL 61801 USA
(217) 333-3804
http://www.library.uiuc.edu/lsx/
Librarian and Current Clips Editor: Susan Searing