Home > Computers > Software > Information Retrieval > Ranking > By Content
Classical IR Ranking based on document's content
http://www10.org/cdrom/papers/317/
This study evaluates the performance of a state-of-the-art keyword-based document ranking algorithm (coming out of TREC) on a popular web search task.
http://www.cs.ust.hk/~dlee/Papers/ir/ieee-sw-rank.ps.gz
It describes key issues in document ranking techniques based on the vector space model. Several TF*IDF variants are discussed. The cosine measure, recall and precision are introduced. [PS format]
http://goanna.cs.rmit.edu.au/~jz/fulltext/sigirforum98.pdf
Evaluation of many combinations of term frequency statistics, document frequency statistics and document length normalization.
http://isp.imm.dtu.dk/thor/projects/multimedia/textmining/
Description of boolean retrieval, vector space model, probabilistic retrieval, latent semantic indexing and other IR topics. An introduction to various classical ranking methods is also provided.
http://www.cs.berkeley.edu/~christos/ir.ps
Formal introduction to latent semantic indexing. [PS format]
http://www.is.informatik.uni-duisburg.de/bib/docs/Fuhr_92.html
Introduction to probabilistic models.
http://www.dcs.gla.ac.uk/Keith/Chapter.6/Ch.6.html
A Chapter in a book which introduces probabilistic retrieval.
http://www.dcc.uchile.cl/~rbaeza/iradsbook/irbook.html
"Ranking Algorithms" is chapter 14 in the Frakes and Baeza-Yates book. It gives a good discussion of the tradeoffs and choices among different term-weighting strategies.
Home > Computers > Software > Information Retrieval > Ranking > By Content
Thanks to DMOZ, which built a great web directory for nearly two decades and freely shared it with the web. About us