April 1, 2026
I Spent $0.48 to Find Out When MCTS Actually Works for LLM Reasoning
Controlled experiments on constraint satisfaction problems. MCTS beats best-of-N only when blind sampling hits a ceiling and the verifier provides a gradient.
Browse posts by tag
Controlled experiments on constraint satisfaction problems. MCTS beats best-of-N only when blind sampling hits a ceiling and the verifier provides a gradient.