James, Witten, Hastie, Tibshirani
book
completed
Notes Best practical stats-based ML introduction ever written.
Russell & Norvig
book
completed
Notes The standard AI textbook. Search, logic, planning, learning, language.
Vaswani, Shazeer, Parmar, Uszkoreit, Jones, Gomez, Kaiser, Polosukhin
paper
completed
Notes Introduced the Transformer architecture. The paper that started everything.
Devlin, Chang, Lee, Toutanova
paper
completed
Notes Bidirectional pre-training via masked language modeling. Defined the pre-train/fine-tune paradigm.
Wei, Wang, Schuurmans, Bosma, Xia, Chi, Le, Zhou
paper
completed
Notes Step-by-step reasoning via prompting. Unlocked a new capability class.
Bai, Kadavath, Kundu, Askell, Kernion, Jones, Chen, et al.
paper
completed
Notes Self-critique and revision using principles instead of human labels.
Christiano, Leike, Brown, Martic, Legg, Amodei
paper
completed
Notes Foundational RLHF paper. Learning reward models from human comparisons.
Rafailov, Sharma, Mitchell, Ermon, Manning, Finn
paper
completed
Notes Bypasses reward modeling entirely. Simpler alignment, same results.
Dao, Fu, Ermon, Rudra, Ré
paper
completed
Notes IO-aware attention that is both faster and uses less memory. Essential infrastructure.
Child, Gray, Radford, Sutskever
paper
completed
Notes Sparse attention patterns for long-range dependencies. O(n√n) attention.
Douglas Hofstadter
book
completed
Notes Consciousness, self-reference, formal systems. Mind-altering.
Brown, Mann, Ryder, Subbiah, Kaplan, Dhariwal, Neelakantan, et al.
paper
completed
Notes 175B parameters. In-context learning emerges at scale. Changed the field.
Radford, Wu, Child, Luan, Amodei, Sutskever
paper
completed
Notes Showed large LMs can perform tasks zero-shot. Introduced the scaling intuition.
Touvron, Lavril, Izacard, Martinet, Lachaux, Lacroix, et al.
paper
completed
Notes Open-weight models competitive with GPT-3. Catalyzed the open-source LLM ecosystem.
Michael Nielsen
book
completed
Notes Visual, intuitive intro to neural nets. Build understanding from scratch.
Shazeer, Mirhoseini, Maziarz, Davis, Le, Hinton, Dean
paper
completed
Notes Mixture of Experts with learned gating. Conditional computation at scale.
Kevin P. Murphy
book
completed
Notes VAEs, flows, diffusion, GPs, causal inference — the advanced sequel.
Kevin P. Murphy
book
completed
Notes Comprehensive modern ML textbook with solid probabilistic foundations.
Yao, Zhao, Yu, Du, Shafran, Narasimhan, Cao
paper
completed
Notes Interleaving reasoning traces and actions. The prompting pattern behind most LLM agents.
Sutton & Barto
book
completed
Notes The RL bible. Bandits to policy gradients to planning.
Kaplan, McCandlish, Henighan, Brown, Chess, Child, Gray, Radford, Wu, Amodei
paper
completed
Notes Power-law relationships between compute, data, parameters, and loss. Empirical scaling science.
Jurafsky & Martin
book
completed
Notes The canonical NLP book, updated for the LLM era.
Richard McElreath
book
completed
Notes The modern Bayesian workflow explained with clarity and care.
Andrej Karpathy
blog
completed
Notes Seminal blog post demonstrating char-level RNN power. Shakespeare, LaTeX, kernel code generation.
Allen B. Downey
book
completed
Notes Bayesian statistics by writing code. Ideal computational on-ramp.
Allen B. Downey
book
completed
Notes Probability and statistics for programmers, learn by coding.
Schick, Dwivedi-Yu, Dessì, Raber, Lomeli, Zettlemoyer, Cancedda, Scialom
paper
completed
Notes LMs learning when and how to call external tools. Key step toward agentic LMs.
Hoffmann, Borgeaud, Mensch, Buchatskaya, Cai, Rutherford, et al.
paper
completed
Notes Showed most LLMs were undertrained. Optimal ratio of data to parameters.
Ouyang, Wu, Jiang, Almeida, Wainwright, Mishkin, Zhang, et al.
paper
completed
Notes RLHF applied to GPT-3. The bridge from raw LM to useful assistant.
Simon J. D. Prince
book
completed
Notes Modern DL explained cleanly without hype.
Ray Solomonoff
paper
queued
Notes Foundational paper on algorithmic probability and universal induction. Basis for AIXI.
Csaba Szepesvári
book
queued
Notes Free condensed RL theory book; rigorous and compact. Alternative formal RL resource.
Marcus Hutter, Elliot Catt, David Quarel
book
completed
Chapman & Hall/CRC
Notes Newer textbook presenting AIXI and related universal agent theory with updated structure; suitable after the core UAI book.
Stuart J. Russell, Peter Norvig
book
completed
4th (2020)
Notes Standard AI textbook covering search, logic, probabilistic reasoning, RL, multiagent systems, and more. Canonical comprehensive AI text.
David Barber
book
completed
Judea Pearl
book
completed
Ray Solomonoff
paper
queued
Notes Later work on convergence properties of Solomonoff induction.
David Silver
course
queued
YouTube / DeepMind
Notes Comprehensive lecture series covering RL foundations.
Ian Goodfellow, Yoshua Bengio, Aaron Courville
book
completed
François Chollet
book
completed
Catherine F. Higham, Desmond J. Higham
book
queued
Notes Perspective of deep learning from applied math. Bridges math with neural nets.
Aston Zhang, Zachary C. Lipton, Mu Li, Alexander J. Smola
book
queued
Notes Free open-source deep learning book with code and math integrated. Interactive deep learning resource with runnable code.
Douglas R. Hofstadter
book
queued
Notes Explores analogy and cognition via computational models. Classic work on analogy and cognition modeling.
Kevin P. Murphy
book
completed
Marcus Hutter
talk
completed
YouTube
Notes Technical talk touching on universal induction and AI foundations.
Marcus Hutter
lecture
completed
YouTube
Notes Lecture on AIXI and universal intelligence theory with deep insights.
Christopher M. Bishop
book
completed
Moritz Hardt, Benjamin Recht
book
queued
Notes Modern graduate ML text with causal inference, decision making, and ML foundations. Accessible free textbook with strong conceptual framing.
Daphne Koller, Nir Friedman
book
completed
Richard S. Sutton, Andrew G. Barto
book
completed
Notes Mathematical RL fundamentals (MDPs, value functions, dynamic programming, approximate methods). RL foundational text that bridges theory and practice.
Judea Pearl, Dana Mackenzie
book
completed
Trevor Hastie, Robert Tibshirani, Jerome Friedman
book
completed
Pedro Domingos
book
completed
Andrej Karpathy
paper
completed
Review Seminal blog post demonstrating the power of character-level RNNs. Shows Shakespeare generation, Wikipedia generation, LaTeX generation, and Linux kernel code generation. The visualizations of …
Marcus Hutter
book
completed
Springer (Texts in Theoretical Computer Science)
Notes Formal theory of universal agents combining Solomonoff induction and sequential decision theory; foundational for AGI theory.
Douglas Hofstadter
book
completed
Notes Classic exploration of self-reference, formal systems, and the nature of mind.