Language-Models

Browse posts by tag

April 24, 2026

Speech and Language Processing (3rd ed. draft)

Notes

The canonical NLP book, updated for the LLM era.

April 24, 2026

The Unreasonable Effectiveness of Recurrent Neural Networks

Notes

Seminal blog post demonstrating char-level RNN power. Shakespeare, LaTeX, kernel code generation.

December 17, 2025

The Unreasonable Effectiveness of Recurrent Neural Networks

Review

Seminal blog post demonstrating the power of character-level RNNs. Shows Shakespeare generation, Wikipedia generation, LaTeX generation, and Linux kernel code generation. The visualizations of LSTM cells are particularly illuminating.

December 3, 2025

Infinigram: Corpus-Based Language Modeling via Suffix Arrays with LLM Probability Mixing

December 3, 2025

Infinigram: Variable-Length N-grams via Suffix Arrays

A corpus-based language model using suffix arrays for O(m log n) pattern matching. The corpus is the model.

machine-learning NLP

October 7, 2025

Language Calculus: An Algebraic Framework for LLM Composition

A mathematical framework that treats language models as algebraic objects with compositional structure.

September 20, 2024

Neural Language Models: From RNNs to Transformers

The evolution of neural sequence prediction, and how it connects to classical methods

Machine Learning Deep Learning

June 15, 2024

Comparing Prediction Methods: CTW vs. N-grams vs. Neural LMs

The bias-data trade-off in sequential prediction: when to use CTW, n-grams, or neural language models.

Machine Learning Information Theory

August 15, 2016

N-gram Language Models

The classical approach to sequence prediction: counting and smoothing

Machine Learning Natural Language Processing