Favorite Reading

A Mind For Numbers: How to Excel at Math and Science by Barbara Oakley

Atomic Habits by James Clear

Learn Like a Pro: Science-Based Tools to Become Better at Anything by Barbara Oakley

Mindshift: Break Through Obstacles to Learning and Discover Your Hidden Potential by Barbara Oakley

Peak: Secrets from the New Science of Expertise by Anders Ericsson, Robert Pool

Why We Sleep: The New Science of Sleep and Dreams by Matthew Walker

Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future by Ashlee Vance

Steve Jobs by Walter Isaacson

Articles

What OpenAI Really Wants. Wired.

Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future - Life Inside the Gigafactory. Wired.

Reading

Ericsson, K. A., Krampe, R. T., & Tesch-Römer, C. (1993). The role of deliberate practice in the acquisition of expert performance. Psychological Review, 100(3), 363-406.

Duckworth, A. L., Peterson, C., Matthews, M. D., & Kelly, D. R. (2007). Grit: Perseverance and passion for long-term goals. Journal of Personality and Social Psychology, 92(6), 1087-1101.

Losch, S., Traut-Mattausch, E., Mühlberger, M. D., & Jonas, E. (2016). Comparing the Effectiveness of Individual Coaching, Self-Coaching, and Group Training: How Leadership Makes the Difference. Frontiers in Psychology, 7, 629.

MoE Pretraining Infrastructure @ NousResearch. DeepEP Expert Parallelism for TorchTitan: 33% faster than default (14,796 tok/s/GPU), near-linear scaling to 16 nodes (128 GPUs), 10 trillion tokens/month capacity at scale with fused kernels and optimized all-to-all communication.