<aside> 💡 These are the materials I personally have found useful (& not so useful), organized by skill areas.

I’m not taking any bootcamps. I’m trying to spend as little money as possible. All resources below but DataMasked are free.

I’m do not receive commissions by linking these. Just my honest opinions!

</aside>

Resources I Found Useful

Statistics & Machine Learning

Practical Statistics for Data Scientists Physical book. Try your local library! Got mine there for $free.99
Data Science Cheat Sheet https://github.com/aaronwangy/Data-Science-Cheatsheet/blob/main/Data_Science_Cheatsheet.pdf
Emma Ding: YouTube Videos & Free Resources on the Website https://www.emmading.com

Python

Pandas: mainly data wrangling https://github.com/guipsamora/pandas_exercises

SQL

Leetcode SQL50 https://leetcode.com/studyplan/top-sql-50/
Data Lemur https://datalemur.com/questions
StrataScratch https://www.stratascratch.com

Case Studies for Interview Prep

DataMasked: for Product Data Scientists https://datamasked.com
Interview Query https://www.interviewquery.com/questions?searchQuery=&searchQuestionTag=&searchCompany=&ordering=Recommended&orderingDirection=ASC&pageSize=100&page=0

Resources I Tried But Did Not Find Useful

<aside> 💡 Other people have recommended these resources, I just personally do not think I got much out of it. However, they may be helpful to you

</aside>

Statistics & Machine Learning

Resource Strength Why I Didn’t Find it Useful
Khan Academy Fun!
Gives foundational statistics and probability concepts Doesn't connect the concepts to Data Science. I felt they are better for taking an exam at school
Stanford: Introduction to Statistics Gives foundational statistics and probability concepts The course has breadth, which means it talks about some concepts I have not seen in practice interview questions, such as “sign test”.

It also doesn’t talk about assumptions of regressions, t-tests, z-tests, which I have seen in published interview questions.

It also goes deep into how to calculate certain things, such as the z-score. When in reality, Python can do that for you. |