DNN Designer

Login |  
Readings
opera fix
Print   Minimize 
opera fix
  • Modern Information Retrieval (Baeza) - Chapter 1
  • Harter, Chapter 1
  • Ellis, David. 1990. New Horizons in Information Retrieval, Chapters 1 and 3. London: The Library Association.
  • Nahl, Diane. 2004. Measuring the Affective Information Environment of Web Searchers. In: Proceedings of the American Society for Information Science and Technology, 41, 191-197. (available in UH electronic resources)
opera fix

Modern Information Retrieval (Baeza) - Chapter 1
opera fix
Print   Minimize 
opera fix
  • Information Retrieval - representation, storate, organization of, and access to information items.
  • Data Retrieval - retrieval of all objects which satisfy clearly defined conditions such as those in a regular expression or in a relational algebra expression.
  • The difference between IR and DR is that IR usually deals with natural language text which is not always well structured and could be semantically ambiguous.  A DR system deals with data that has a well defined structure and semantics.
  • Research in IR includes modeling, document classification and categorization, system architecture, user interfaces, data visualization, filtering, language, etc.
  • Finding useful information on the Web is frequently a tedious and difficult task.  The main obstacle is the absence of a well-defined underlying data model for the Web, which implies that information definition and structure is frequently of low quality.
  • User Task - to translate his information need into a query in the language provided by the system.
  • Browsing - a process of retrieving information, but one whose main objectives are not clearly defined in the beginning and whose purpose might change during the interaction with the system.
  • Combination of retrieval and browsing is not yet a well established approach and is not the dominant paradigm.
  • Logical view of document - from full-text to individual index terms and keywords.
  • Three main questions about IR:
    1. Which techniques will allow retrieval of high quality?
    2. Which techniques will yield faster indexes and smaller query response times?
    3. How will a better understanding of the user behavior affect the design and deployment of new information retrieval strategies?
opera fix
Online Information Retrieval (Harter) - Chapter 1
opera fix
Print   Minimize 
opera fix
  • Information Retrieval - to examine the items in the collection, one by one, accepting some items but rejecting most, until each item has been examined.
  • Information Retrieval System - a device interposed between a potential user of information and the information collection itself.
  • Early examples of IR systems: commercial airline reservation systems.
opera fix


New Horizons in Information Retrieval (Ellis) - Chapter 1 and 3
opera fix
Print   Minimize 
opera fix

Chapter 1 - Introduction

  • Cranfield I Test - evaluation of 4 indexing systems:
    1. UDC (Universal Decimal Classification)
    2. Alphabetical Subject Index
    3. Faceted Classification Scheme
    4. Uniterm System - most effective compared to the other 3 systems
  • Cranfield II Test - measure of effectiveness is explicitly relevance-based.
  • Recall = Relevant documents retrieved / Relevant documents in collection
  • Precision = Relevant documents retrieved / Documents retrieved
  • Experiments show inverse relationship between recall and precision.
  • Remaining problem for information retrieval research is the nature and reliability of relevance judgement.
  • If relevance was to be employed as a performance criterion, the form of relevance judgement employed had to be objective enough to serve as the basis of the measure of effectiveness.

Chapter 3 - Cognitive User Modeling

  • Information retrieval was to be facilitated by means of a "conversation" or "dialogue" between the user and the system which was meant to resemble the personal communication between human minds through conversation.
  • Information need must be seen as a dynamic entity, not as something which remains static or unchanged during the course of the search (e.g. berry-picking model).
  • Context graph - the importance of context in information retrieval.
  • GRUNDY Program - use of stereotype to filter novels for users.
opera fix
Measuring the Affective Information Environment of Web Searchers (Nahl)
opera fix
Print   Minimize 
opera fix

Three ways to evaluate searchability:

  • Cognitive
  • Affective
  • Sensorimotor

Affective Measures:

  • Acceptance (Acc) = Search engine support + ease
  • Affective Load (AL) = Uncertainty intensified by Time Pressure = U * TP
  • Evaluation (Ev) = Acceptance + Satisfaction = Acc + S
  • Expected Effort (Ex Eff) = Expected effort at beginning
  • Felt Effort (Felt Eff) = Felt effort at end
  • Optimism (Op) = Keep trying + Good search engines + Lots of Info
  • Relevance (Rel) = Rating of results
  • Satisfaction (S) = Worthwhile + Relevance = W + Rel
  • Self-Efficacy (SE) = Sure of success + Getting good at + Good luck
  • Task Completion Motivation (TCM) = Importance + Getting upset
  • Time Pressure (TP) = Expected length - Felt length
  • Uncertainty (U) = Irritation + Anxiety + Frustration + Rage
  • User Coping Skills (UCS) = Self-Efficacy + Optimism = SE + Op
  • Worthwhile (W) = Expected effort at start - Felt effort at end = Ex Eff - Felt Eff
opera fix

Copyright 2008 by WillWork.Org
Terms Of Use | Privacy Statement