This Blog contains a few useful stuff for regarding data science. They focus more on the technical side. My suggestion is, do not try to learn them all at once, human beings are more productive when we work on a single task.
Recommended Courses
- Andrew Ng’s ML Course CS229
- Columbia University’s EdX ML Course
- Hugo Larochelle’s Sherbrooke U Neural Networks Course
- Chris Manning & Richard Socher’s Stanford NLP with Deep Learning Lectures
- Google’s ML Crash Course
- Intro to Database - CMU
- CS Missing Semester About Tools - MIT
- Statistical Rethinking
- Introduction to Algorithms
Recommended Reading
- Machine Learning: The High Interest Credit Card of Technical Debt
- A Few Useful Things to Know about Machine Learning
- Rules of Machine Learning: Best Practices for ML Engineering
- Machine Learning Cheatsheet
- Statistical Rethinking
- Mathematics for Machine Learning
- Machine Learning Cheatsheets CS229
- Graphical Probability
- Python Algorithms Implementations
- Convolutional Neural Networks for Visual Recognition - Stanford CS231N
Textbooks
- Bishop - Pattern Recognition & Machine Learning
- Murphy - Machine Learning: A Probabilistic Perspective
- MacKay - Information Theory, Inference, and Learning Algorithms
- Sutton, Barto - Reinforcement Learning: An Introduction
- Goodfellow, Bengio, Courville - Deep Learning
- Jurafsky, Martin - Speech and Language Processing
- An Introduction to Statistical Learning
- The Elements of Statistical Learning
- Convex Optimization - Boyd
- Math For Machine Learning - UPenn
Major Academic Conferences
Interesting Algorithms/Implementations
- Annoy - Approximate Nearest Neighbor (Oh Yeah!) - Spotify
- Listing2Vec - Airbnb
- Uplift modeling and causal inference -Uber
- Machine Learning Pipeline
- Personalized Cuisine Filter
- Recurrent Unit Recommender
- Distill Pub
- Stitch Fix
Visualization
- Kepler - Uber
- Pyecharts - Baidu
- R2D3
- Plotly
- ShinyR
- Easy Website
- Sample Shiny Tool
- Flourish
- D3 Visualization
Experimentation
- Facebook - PlanOut
- Netflix
- Uber - Experimentation
- Airbnb
- Convoy - Bayesian A/B
- Experiment Design Tool R
- Non-Inferiority - StitchFix
- Simple Experiment Design in R
- Simpsons Paradox
- Experiment Pitfalls
Multiple Arms Bandit Models
- Thompson Sampling Tutorial
- Bernoulli Distribution Case
- Doordash Recommendation Case
- R contextual Package
- Python Application
- Optimizely Multi Bandit Introduction
- Introduction to Multi Bandit Arms
- Algorithms for the multi-armed bandit problem
Natural Language Processing
Chinese Machine Learning Resources
Bayesian
- Bayesian AB in R
- Bayesian Methods for Hackers
- Bayesian Introduction
- Bayesian Power Analysis
- abtest in R - A Bayesian Approach
Useful Stat Courses
- Probability at Duke
- Probability at UCLA
- Mathematical Statistics at Duke
- Mathematical Statistics at UCLA
- Optimization and Statistics
- BRMS
- Statistics for Applications - MIT
Data Science Interview
- 120 Data Science Interview Questions
- Data Science Interview at Top Tech Companies
- Basic Stats Models in R
- Statistical Test and GLM
- R and Machine Learning
Causal Inference
- Causal Impact
- Causal Net
- Causal ML - Uber
- DoWhy - MicroSoft
- Basic Notations - MIT
- CAUSAL INFERENCE: THE MIXTAPE