December 3, 2025
NLP
Browse posts by tag
December 3, 2025
Infinigram: Variable-Length N-grams via Suffix Arrays
A corpus-based language model using suffix arrays for O(m log n) pattern matching. The corpus is the model.
November 30, 2025
BTK: Bookmark Toolkit
A database-first bookmark manager with hierarchical tags, content caching, and NLP auto-tagging. Part of the Long Echo toolkit.
August 15, 2016
N-gram Language Models
The classical approach to sequence prediction: counting and smoothing