This is truly a classic paper. Read it to appreciate Bush's great vision which has NOT yet completely realized. As a minimum, read everything starting from section 6.
The goal of reading these notes is to know about some basic concepts in probability, statistics, and information theory. You should read at least Section 3 of the estimation note and all of the information theory note except
for section 1.1.6. You should fully understand
the derivation of the maximum likelihood estimate for the binomial distribution, and most of the
contents in the information theory notes. If you can't understand these, you may want to consult
a textbook on probability and
statistics, and a book on information theory. Any book on these topics should be sufficient.
The goal is to know about the overall state of the art of statistical language models.
Your should try to read the whole paper, but don't worry about some
of the details that you can't understand. It's fine to skip some details.
This is a very good overview paper of IR, though it's a bit out of date and slightly biased toward empirically effective techniques. Your goal of reading it is to know about the general history of IR and a summary of IR techniques from empirical perspective. Read the whole paper.
Read at least Section 1 and Section 2 to know how to compute basic retrieval measures.
Read 8.3 and 8.4. Other sections should also be very interesting to read, though not required.
Read this entire review to get a good picture of all the retrieval models
Read the entire chapter 6 and Section 7.1.
Read 9.1.1. The rest of the chapter should also be very interesting to read if you want.
Optional reading. This is a nice evaluation of different weighting methods. Read it if you want to know about many variations of TF-IDF weighting and which variant is relatively more effective.
All optional. Read whatever you feel is useful to you.
Optional reading. Chapter 11 has a good introduction to "classic probabilistic models", which we didn't cover
in detail. Okapi was derived from this family of models with lots of heuristic modifications. Chapter 12 covers
the language modeling approach but not in-depth and may be hard to follow.
Read up to section 9.1. That is, skip 9.2 and everything after it. Focus on understanding the basic idea
of the query likelihood scoring method, the Dirichlet prior smoothing method, and the two-stage smoothing method.
The goal is to know the KL-divergence scoring formula
and how a mixture model can be used to do feedback.
Read the whole paper and try to understand how the mixture model works. Ignore the divergence minimization method.
Read the entire note.
Optional. Read it if you really want to understand the EM algorithm rigorously.
Read the whole paper. This is a classic paper about Google's PageRank algorithm. Your main goal is to understand the basics of the PageRank algorithm.
This article explains clearly how to use the Power Method to compute PageRank.
Read Section 1, Section 2.1-2.2. The goal is to know how memory-based algorithms work.
The main goal is to understand Section 3. You may want to read some other parts especially Sec 1 and Sec 2 to get some background.