Skip to content

joaofbravo/DataScience

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

303 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science

Project data sets

Data profiling
  • Dimensionality
  • Distribution
  • Granularity
  • Sparsity
  • Correlation
Data preparation
  • Missing-values imputation
  • Anomaly/outlier detection and removal
  • Discretization and Dummification
  • Normalization
  • Balancing
Feature engineering
  • Feature selection
  • Feature extraction
  • Feature generation
Unsupervised learning
  • Pattern mining
  • Clustering
Supervised learning
  • Naïve Bayes
  • kNN
  • Decision tree
  • Random forest
  • Gradient boosting
  • Overfitting

Extra lab: Social network analysis (SNA)

Extra lab: Time-series Analysis and Forecasting

Data sets
  • Profiling
  • Transformation
  • Forecasting
  • Motif discovery

About

Data Science course - 2020

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 100.0%