Skip to content

Latest commit

 

History

History
302 lines (253 loc) · 8.86 KB

File metadata and controls

302 lines (253 loc) · 8.86 KB

Roadmap to learn Data Science

About

This is a repository which contains a structured list of resources which includes online MOOC's and free ebooks to assist beginners who have just started with Data Science. Please feel free to contribute to this repository and leave a ⭐ if you found it helpful.

Disclaimer

It is not necessary to complete all the courses mentioned below. If you feel that you are already proficient with some sections feel free to move to the next section.

1.0 Foundations


Intro To Programming [OPTIONAL]

Index Course Name Link Description
1. Introducion to Programming Link to the Course A brief overview of programminig(no prior programming language experience needed)
2. Introducion to Data Science Link to the Course This is a great mini course to get a birds eye view of every toipc of Data Science


Python Programming

Index Course Name Link Description
1. Python for Everybody Specialization Link to the Course This is for beginners who have never programmed in their life!
2. Python 3 Tutorial by Jose Portilla Link to the Course This is by far the best course to learn Complete Python 3 concepts!
3. Kaggle MicroCourse Link to the Course This is a great course to test your skills!!Look out for the Blackjack problem 😁
4. Python Exercises Link to the Exercises This is a great problem set for Python


Python Libraries

Index Course Name Link Description
1. Numpy Link to the Course A course covering the basics of the Numpy Library. Numpy is the fundamental package for scientific computing with python.
2. W3 Resources Numpy Exercises Link to the Exercises Amazing resource to practice Numpy functions
3. Pandas Link to the Course This is a video course covering the basics of the Pandas Library. Pandas is the fundamental package for data analysis in python.
4. Kaggle Pandas Course Link to the Course This is a course covering Pandas library with challenging exercises
5. W3 Resources Pandas Exercises Link to the Exercises Amazing resource to practice Pandas
6. Matplotlib Course Link to the Course A tutorial that focuses on using matplotlib to visualize data. Matplotlib is python library for creating static visualizations.
7. Kaggle Data Viz Course Link to the Course This is a complete course covering all the important visualization libraries with challenging exercises


Mathematical Foundations

Index Course Name Link Description
1. Overview Link to the Video Overview of Why Maths is needed in Data Science
2. Khan Academy-Linear Algebra Link to the Playlist Great playlist by Khan Academy to learn linear algebra
3. Linear Algebra for Machine Learning[OPTIONAL] Link to the Video Complete Linear Algebra Course
4. Statistics-Khan Academy Link to the Video Complete Statistics Course
5. Probability by Khan Academy Link to the Playlist Amazing resource to learn Probability from Khan Academy!
6. Learn Statistics Visually Link to the Playlist Learn Statistics in a fun way
7. Maths for Machine Learning[HIGHLY OPTIONAL] Link to the Course Specializtion to learn Maths for ML
8. Maths for Machine Learning Ebook[Reference book] Link to the Book PDF Use this as a Reference book
9. Choosing which statistical test to use - statistics help. Link to the Exercises A tutorial explaining seven statistical test and how to use them.


2.0 Data Analyst (To Be Completed)


SQL

Index Course Name Link Description
1. MySQL Tutorial for Beginners [Full Course][HIGHLY RECOMMENDED] Link to the Video Excellent for all beginners. Very well structure and with both theory AND exercises for practise. Also, seems pretty complete. -> +3 hours! :)
2. SQL For Data Science Tutorial | Learn SQL Database For Data Science Link to the Video This video can be used to get a general overview of how useful SQL can be, both in general and for ML. +/- 30 minutes
3. Database Normalization in SQL - 1NF, 2NF, 3NF, 4NF - SQL Link to the Course This video is very specific and deals with Normalization in databases. +/- 4à minutes
4. Logical Database Design and E-R Diagrams Link to the Video !Another specialized video, this time dealing with the concepts of design and diagrams. (seems closer to real business use) +/- 30 minutes


TABLEAU

  • To be Completed.



3.0 Data Science (To Be Completed)


Courses?

Index Course Name Link Description
1. Complete Python Bootcamp for Data Science Link to the Course A complete course covering all the important data science libraries


4.0 Machine Learning (To Be Completed)


Supervised Algorithms


Unsupervised Algorithms


Cluster Algorithms



5.0 Deep Learning (To Be Completed)


Artificial Neural Networks


Reinforcement Learning


Computer Vision




6.0 Data Engineering (To Be Completed)

Docker