December 3, 2025 Infinigram: Corpus-Based Language Modeling via Suffix Arrays with LLM Probability Mixing
December 3, 2025 Infinigram: Variable-Length N-grams via Suffix Arrays A corpus-based language model using suffix arrays for O(m log n) pattern matching. The corpus is the model. machine-learning NLP