Dashboard/Week 5/Day 3
WEEK 5 · WEDNESDAY

Week 5 · Day 3 — Random Forests & Feature Importance

2.2 hours·14 sections
Section 1 of 140/14 done

Week 5 · Day 3 — Random Forests & Feature Importance

Theme: Machine Learning Tryouts 🤖⚽ Time: ~75 minutes Tools: Python, scikit-learn, pandas, matplotlib Goal: Upgrade to random forests, understand what stats actually predict wins, evaluate properly with cross-validation


Warm-Up: Why One Tree Isn't Enough

Yesterday's decision tree was good. But it has a weakness: it's overconfident.

Train it on one dataset, get one opinion. Change one match in your training data and the tree can look completely different. It's like asking one scout to judge every player — their biases dominate.

The fix: Ask 500 scouts. Each one studies a different random sample of games. Then take a vote.

That's a Random Forest.