 |

 |
| |
Modern Information Retrieval (Baeza) - Chapter 1
|
|
|
|  |  |
|
 |
|
|
- Information Retrieval - representation, storate, organization of, and access to information items.
- Data Retrieval - retrieval of all objects which satisfy clearly defined conditions such as those in a regular expression or in a relational algebra expression.
- The difference between IR and DR is that IR usually deals with natural language text which is not always well structured and could be semantically ambiguous. A DR system deals with data that has a well defined structure and semantics.
- Research in IR includes modeling, document classification and categorization, system architecture, user interfaces, data visualization, filtering, language, etc.
- Finding useful information on the Web is frequently a tedious and difficult task. The main obstacle is the absence of a well-defined underlying data model for the Web, which implies that information definition and structure is frequently of low quality.
- User Task - to translate his information need into a query in the language provided by the system.
- Browsing - a process of retrieving information, but one whose main objectives are not clearly defined in the beginning and whose purpose might change during the interaction with the system.
- Combination of retrieval and browsing is not yet a well established approach and is not the dominant paradigm.
- Logical view of document - from full-text to individual index terms and keywords.
- Three main questions about IR:
- Which techniques will allow retrieval of high quality?
- Which techniques will yield faster indexes and smaller query response times?
- How will a better understanding of the user behavior affect the design and deployment of new information retrieval strategies?
|
 |
|
|
|

 |
| |
Online Information Retrieval (Harter) - Chapter 1
|
|
|
|  |  |
|
 |
|
|
- Information Retrieval - to examine the items in the collection, one by one, accepting some items but rejecting most, until each item has been examined.
- Information Retrieval System - a device interposed between a potential user of information and the information collection itself.
- Early examples of IR systems: commercial airline reservation systems.
|
 |
|
|
|
|
|
 |

|
 |

 |
| |
New Horizons in Information Retrieval (Ellis) - Chapter 1 and 3
|
|
|
|  |  |
|
 |
|
|
Chapter 1 - Introduction
- Cranfield I Test - evaluation of 4 indexing systems:
- UDC (Universal Decimal Classification)
- Alphabetical Subject Index
- Faceted Classification Scheme
- Uniterm System - most effective compared to the other 3 systems
- Cranfield II Test - measure of effectiveness is explicitly relevance-based.
- Recall = Relevant documents retrieved / Relevant documents in collection
- Precision = Relevant documents retrieved / Documents retrieved
- Experiments show inverse relationship between recall and precision.
- Remaining problem for information retrieval research is the nature and reliability of relevance judgement.
- If relevance was to be employed as a performance criterion, the form of relevance judgement employed had to be objective enough to serve as the basis of the measure of effectiveness.
Chapter 3 - Cognitive User Modeling
- Information retrieval was to be facilitated by means of a "conversation" or "dialogue" between the user and the system which was meant to resemble the personal communication between human minds through conversation.
- Information need must be seen as a dynamic entity, not as something which remains static or unchanged during the course of the search (e.g. berry-picking model).
- Context graph - the importance of context in information retrieval.
- GRUNDY Program - use of stereotype to filter novels for users.
|
 |
|
|
|

|
|