Favorite Reading
A Mind For Numbers: How to Excel at Math and Science by Barbara Oakley
Atomic Habits by James Clear
Learn Like a Pro: Science-Based Tools to Become Better at Anything by Barbara Oakley
Mindshift: Break Through Obstacles to Learning and Discover Your Hidden Potential by Barbara Oakley
Peak: Secrets from the New Science of Expertise by Anders Ericsson, Robert Pool
Why We Sleep: The New Science of Sleep and Dreams by Matthew Walker
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future by Ashlee Vance
Steve Jobs by Walter Isaacson
Articles
What OpenAI Really Wants. Wired.
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future - Life Inside the Gigafactory. Wired.
Reading
Ericsson, K. A., Krampe, R. T., & Tesch-Römer, C. (1993). The role of deliberate practice in the acquisition of expert performance. Psychological Review, 100(3), 363-406.
Duckworth, A. L., Peterson, C., Matthews, M. D., & Kelly, D. R. (2007). Grit: Perseverance and passion for long-term goals. Journal of Personality and Social Psychology, 92(6), 1087-1101.
Losch, S., Traut-Mattausch, E., Mühlberger, M. D., & Jonas, E. (2016). Comparing the Effectiveness of Individual Coaching, Self-Coaching, and Group Training: How Leadership Makes the Difference. Frontiers in Psychology, 7, 629.
MoE Pretraining Infrastructure @ NousResearch. DeepEP Expert Parallelism for TorchTitan: 33% faster than default (14,796 tok/s/GPU), near-linear scaling to 16 nodes (128 GPUs), 10 trillion tokens/month capacity at scale with fused kernels and optimized all-to-all communication.