Scoring, term weighting and the vector space model
Dávid László
Introduction
- Boolean queries -> big number of documents
- Parametric zone indexes
- Weighting
- Vector space scoring
- Variants of term weighting
parametric and zone indexes
- Metadata
- Fields - parametric indexes
- Zones
- Weighted zone scoring
- Learning weights
- The optimal weight g
Term frequency and weighting
- Term frequency
- Document frequency
- Inverse document frequency


The vector space model for scoring
- Dot products
- Cosine similarity

- Magnitude of the angle theta

- Queries as vectors
- Computing vector scores
Variant tf-idf functions


- Document and query weighting schemes
- Pivoted normalized document length
Made with Slides.com