Chapter 01
PMI and Its Variants
Start here. Learn how Pointwise Mutual Information reveals hidden word associations, and explore its many variants.
PMI, PPMI, NPMI, PMI², PMI^k, Shifted PMI
Beginner
Chapter 02
Co-occurrence & Association
Beyond PMI: statistical tests and measures that quantify how strongly words attract each other.
Log-Likelihood, Chi², Dice, Jaccard, t-score, z-score, Log-Dice, MI, Fisher's
Beginner
Chapter 04
Information Theory
Shannon's gift to NLP: entropy, cross-entropy, and divergence — the language of uncertainty.
Entropy, Cross-Entropy, KL Divergence, JS Divergence, Perplexity
Intermediate
Chapter 08
Readability Formulas
Is your text too complex? Six classic formulas that grade reading difficulty.
Flesch Reading Ease, Flesch-Kincaid, Gunning Fog, Coleman-Liau, SMOG, ARI
Beginner
Chapter 09
Lexical Diversity
How rich is a vocabulary? Measures from the simple type-token ratio to sophisticated statistical indices.
TTR, Hapax, Yule's K, Simpson's D, MTLD, MATTR, vocd-D
Intermediate
Chapter 11
IR Evaluation Metrics
How good is your search? Precision, recall, and ranking metrics that judge retrieval quality.
Precision@K, Recall@K, F1, MAP, NDCG
Intermediate