Machine Learning

Browse posts by category

June 25, 2024

Reverse-Process Synthetic Data Generation for Math Reasoning

Training LLMs on mathematical reasoning by inverting easy-to-solve problems: generate derivatives, reverse them into integration exercises with full step-by-step solutions.

February 19, 2024

Fine-Tuning a Tiny LLM for ElasticSearch DSL

Fine-tuning a small language model to generate ElasticSearch DSL queries from natural language, as a proof of concept for domain-specific LLM specialization.

large language models fine-tuning information retrieval elastic search domain-specific language

March 15, 2026

Superintelligence May Not Require a Breakthrough

The most dramatic possibility in AI might arrive through the most mundane mechanism. Not a beam of sacred light. A sufficiently good build system.

ai reasoning reinforcement-learning superintelligence LLM

January 15, 2026

From A* to GPT: Rational Agents and the Representation Problem

The classical AI curriculum teaches rational agents as utility maximizers. The progression from search to RL to LLMs is really about one thing: finding representations that make decision-making tractable.

machine-learning reinforcement-learning llm rational-agents AI

December 3, 2025

Infinigram: Variable-Length N-grams via Suffix Arrays

A corpus-based language model using suffix arrays for O(m log n) pattern matching. The corpus is the model.

language models n-gram suffix arrays NLP LLM grounding

October 8, 2025

DreamLog: Logic Programming That Dreams to Improve Itself

A logic programming system that alternates between wake and sleep phases, using LLMs for knowledge generation during wake and compression-based learning during sleep.

logic-programming LLM neural-symbolic machine-learning compression

January 15, 2025

CTW Experimental Results: Theory Meets Practice

Validating Context Tree Weighting through experiments, including a bug that changed everything.

sequential-prediction ctw experiments bayesian markov-processes

September 30, 2024

All Induction Is the Same Induction

Solomonoff induction, MDL, speed priors, and neural networks are all special cases of one Bayesian framework with four knobs.

machine-learning solomonoff-induction bayesian-inference information-theory philosophy

September 20, 2024

Neural Language Models: From RNNs to Transformers

The evolution of neural sequence prediction, and how it connects to classical methods

sequential-prediction neural-networks transformers language-models attention

June 15, 2024

Comparing Prediction Methods: CTW vs. N-grams vs. Neural LMs

The bias-data trade-off in sequential prediction: when to use CTW, n-grams, or neural language models.

sequential-prediction ctw neural-networks language-models comparison

March 20, 2024

Instrumental Goals and Hidden Codes in RLHF'd Language Models

How RLHF-trained language models may develop instrumental goals, and the information-theoretic limits on detecting them.

artificial intelligence alignment reinforcement learning RLHF deceptive alignment

February 19, 2024

Approximations of Solomonoff Induction

I experiment with simple predictive / generative models to approximate Solomonoff induction for a relatively simple synthetic data-generating process.

large language models solomonoff induction synthetic data algorthmic data n-gram models

June 17, 2023