Information Retrieval and Extraction
Fall 2003
Fridays, 9:10 ~12:00 AM
Instructor:
Berlin Chen

Homework Webpage

Topic List and Schedule

9/12
 
  Break
 
 
 9/19
 
  Course Overview & Introduction
 
 
9/26
 
Retrieval Models (I) - Classic Retrieval Models: Boolean, Vector Space and Probabilistic Models
10/3



 
Retrieval Evaluation (I) - Measures

 
HW-1: Evaluation Measures (Due 10/17)
HW-2: Classic Retrieval Models  (Due 10/
31)  <See Homework Web Page>
10/10
 
Break
 

 
10/17

 
Retrieval Evaluation (II) - Reference Collections
Retrieval Models (II) - Fuzzy Set, Extended Boolean, Generalized Vector Space Models 
 
10/24

 
Retrieval Models (II) - Fuzzy Set, Extended Boolean, Generalized Vector Space Models
Query Operations (Query Expansion and Term Re-weighting)
 
10/31


 
Query Operations (Query Expansion and Term Re-weighting)

HW-3: Query Expansion and Term Re-weighting  (Due 11/21)  <See Homework Web Page>
 
HW-3: Query Expansion and Term Re-weighting  (Due 11/21)  <See Homework Web Page>
 
11/7
 
Retrieval Models (IV) - HMM/N-gram-based, LSI, PLSA

 

11/14
 
Break
 
11/21
 
Retrieval Models (IV) - HMM/N-gram-based, LSI, PLSA

 
11/28

 
Retrieval Models (III) - Structural Retrieval Models and Browsing Models
Text Clustering Techniques
 
12/5

 
Text Clustering Techniques
Query Languages
 
12/12

 
Query Languages
Text Languages and Text Statistics

 
12/19
 
Workshop on Digital Archives Technology and Innovalue
 
12/26
 

 
Paper Presentation
劉成韋
 
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models (SIGIR 2003)
蔡明瑾
  A Comparative Study on Content-Based Music Genre Classification (SIGIR 2003)
郭榮芳
  LSI (I): Theory of LSI
顏永泰
  LSI (II): GTP (General Text Parser) Software for Text Mining
 
1/2
 
Paper Presentation
朱惠銘
 A System for New Event Detection (SIGIR 2003)
謝明晃
 Search Strategies in Content-Based Image Retrieval (SIGIR 2003)
張志豪
 Generalized vector spaces model in information retrieval (SIGIR 1985)
1/9
 

 
Indexing and Searching
Text Preprocessing, Text Compression

 
1/12
 

 
Final Exam
 

Textbook: 

1.
 
R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval, Addison Wesley Longman, 1999.
 
2.
 
W. B. Croft and J. Lafferty (Editors). Language Modeling for Information Retrieval. Kluwer-Academic Publishers, July 2003.
 

References:
 
Books:

1. W. B. Frakes and R. Baeza-Yates, Information Retrieval: Data Structures & Algorithms,  Prentice-Hall, 1992.
2. A. D. Bimbo, "Visual Information Retrieval", Morgan Kaufmann, 1999.
3.
 
 I. H. Witten, A. Moffat, and T. C. Bell, Managing Gigabytes: Compressing and Indexing Documents and Images, Morgan Kaufmann Publishing, 1999.
4. C. Manning and H. Schutze, Foundations of Statistical Natural Language Processing, MIT Press, 1999.
5. D. Jurafsky and J. H. Martin, Speech and Language Processing, Prentice-Hall, 2000.

Papers:

Grading:
     1. Final: 20%
     2. Presentations 20%
     3. Homework: 20%
     4. Project: 25%
     5. Attendance/Other: 15%

    

Note: If you have any problems, please contact me directly or contact the TA.

TA: Roger Kuo (郭人瑋)  
       Tel: 2932-2411 ext 208
       Email: rogerkuo@csie.ntnu.edu.tw