review

Module 08 Self-Test

  1. Why is accuracy bad for imbalanced data? What metric do you use instead?
  2. What is SMOTE and what's the critical rule for using it?
  3. Explain the consecutive-days SQL trick in your own words.
  4. When do you use pivot_table vs melt in Pandas?
  5. CAP theorem: what do the three letters stand for?
  6. p = 0.08 at α = 0.05. Is the null hypothesis true?

Practice Questions

Q: Why is accuracy bad for imbalanced data? What metric do you use instead?
Q: What is SMOTE and what's the critical rule for using it?
Q: Explain the consecutive-days SQL trick in your own words.
Q: When do you use pivot_table vs melt in Pandas?
Q: CAP theorem: what do the three letters stand for?
Q: p = 0.08 at α = 0.05. Is the null hypothesis true?