Machine Learning

Browse posts by tag

June 25, 2024

Reverse-Process Synthetic Data Generation for Math Reasoning

Training LLMs on mathematical reasoning by inverting easy-to-solve problems: generate derivatives, reverse them into integration exercises with full step-by-step solutions.

March 15, 2026

What You Assume vs. What You Compute

Part 4 of What Your RL Algorithm Actually Assumes — model-based vs. model-free, the assumptions table, AIXI as the incomputable ideal, and the unifying claim: representation is prior is assumption.

technical

March 15, 2026

The Architecture Is the Prior

Part 3 of What Your RL Algorithm Actually Assumes — the architecture decides what kind of features can be learned, and that decision is a Bayesian prior over value functions.

technical

March 15, 2026

The Features You Choose Are the Assumptions You Make

Part 2 of What Your RL Algorithm Actually Assumes — how hand-crafted features compress the state space, and what you're betting on when you pick them.

technical

March 15, 2026

The Infinite Table

Part 1 of What Your RL Algorithm Actually Assumes — tabular Q-learning makes zero assumptions about state similarity and pays for it in sample complexity.

technical

January 18, 2026

Value Functions Over Reasoning Traces

What if reasoning traces could learn their own usefulness? A simple RL framing for trace memory, and why one reward signal is enough.

January 15, 2026

From A* to GPT: Rational Agents and the Representation Problem

The classical AI curriculum teaches rational agents as utility maximizers. The progression from search to RL to LLMs is really about one thing: finding representations that make decision-making tractable.

machine-learning AI

December 19, 2025

The Incomputability of Simple Learning

Why the simplest forms of learning are incomputable, and what that means for the intelligence we can build.

December 17, 2025

Bayesian Reasoning and Machine Learning

December 17, 2025

Machine Learning: A Probabilistic Perspective

December 17, 2025

Pattern Recognition and Machine Learning

December 17, 2025

Patterns, Predictions, and Actions: A Story About Machine Learning

Notes

Modern graduate ML text with causal inference, decision making, and ML foundations. Accessible free textbook with strong conceptual framing.

December 17, 2025

The Elements of Statistical Learning

December 17, 2025

The Master Algorithm

December 3, 2025

Infinigram: Corpus-Based Language Modeling via Suffix Arrays with LLM Probability Mixing

December 1, 2025

MCTS-Reasoning: A Canonical Specification of Monte Carlo Tree Search for LLM Reasoning

November 4, 2025

The Policy: Q-Learning vs Policy Learning

SIGMA uses Q-learning rather than direct policy learning. This architectural choice makes it both transparent and terrifying. You can read its value function, but what you read is chilling.

AI Fiction

October 8, 2025

DreamLog: Logic Programming That Dreams to Improve Itself

A logic programming system that alternates between wake and sleep phases, using LLMs for knowledge generation during wake and compression-based learning during sleep.

artificial-intelligence machine-learning

October 7, 2025

DreamLog: Neural-Symbolic Integration through Compression-Based Learning and Wake-Sleep Cycles

October 7, 2025

Learning Fuzzy Logic: Automatic Rule Discovery Through Differentiable Circuits

Learning fuzzy membership functions and inference rules automatically through gradient descent on soft circuits, instead of hand-crafting them.

January 15, 2025

Differentiation: Three Ways

Three approaches to computing derivatives, forward-mode AD, reverse-mode AD, and finite differences, each with different trade-offs for numerical computing and machine learning.

Computer Science Mathematics

January 5, 2025

Science as Verifiable Search

Science is search through hypothesis space. Intelligence prunes; testing provides signal. Synthetic worlds could accelerate the loop.

AI Research

December 1, 2024

MCTS-Reasoning: Tree Search for LLM Reasoning

Applying Monte Carlo Tree Search to large language model reasoning, with a formal specification of the algorithm.

AI Research

November 15, 2024

Cluster-Aware Retrieval for RAG Systems

Using GMM clustering to improve retrieval in topically diverse knowledge bases

technical

October 15, 2024

Latent Reasoning Traces: Memory as Learned Prior

What if LLMs could remember their own successful reasoning? A simple experiment in trace retrieval, and why 'latent' is the right word.

October 1, 2024

Fuzzy Soft Circuits: Learning Fuzzy Rules from Data

What if fuzzy logic systems could discover their own rules? An interactive demo of differentiable fuzzy circuits that learn membership functions, rule structure, and rule existence, all via gradient descent.

research

September 30, 2024

All Induction Is the Same Induction

Solomonoff induction, MDL, speed priors, and neural networks are all special cases of one Bayesian framework with four knobs.

machine-learning statistics

April 20, 2024

Fisher Flow: Optimization on the Statistical Manifold

Gradient descent in Euclidean space ignores the geometry of probability distributions. Natural gradient descent uses the Fisher information metric instead. Fisher Flow makes this continuous.

March 15, 2024

FemtoGrad: A Minimal Automatic Differentiation Library

A tiny autodiff library for understanding how backpropagation actually works.

March 12, 2024

The AI Course: Everything is Utility Maximization

Intelligence as utility maximization under uncertainty. A unifying framework connecting A* search, reinforcement learning, Bayesian networks, and MDPs.

June 17, 2023

Uses and limits of abstractions

Abstractions let us reason about complex systems despite our cognitive limits. But some systems resist compression entirely.

philosophy machine-learning

June 17, 2023

Working Memory as an Inductive Bias

How the limited capacity of human working memory acts as regularization, shaping our reasoning and possibly preventing cognitive overfitting.

Cognitive Science Machine Learning

January 17, 2023

Reverse-Mode Automatic Differentiation

Reverse-mode automatic differentiation is just the chain rule applied systematically. I built one in C++20 to understand what PyTorch and JAX are actually doing.

Computer Science Mathematics

December 8, 2022

Discovering ChatGPT: Reconnecting with AI Research

Encountering ChatGPT during cancer treatment and recognizing the Solomonoff connection. Language models as compression, prediction as intelligence. A personal inflection point reconnecting with AI research after years in survival mode.

September 20, 2021

Forward-Mode Automatic Differentiation

Dual numbers extend our number system with an infinitesimal epsilon where epsilon^2 = 0. Evaluating f(x + epsilon) yields f(x) + epsilon * f'(x)—the derivative emerges automatically from the algebra.

Computer Science Mathematics

June 1, 2010

Introduction to Sequential Prediction

The problem of predicting what comes next, from compression to language models

Machine Learning Information Theory