A list of some useful Dataset to explore Machine Learning

Tabular Dataset

List of dataset
Number Nom Download link Type Industry Target Detail
1 churn csv Classification B2B churn Churn usecase
2 Forecast Energy France trainset csv Regression Energy TARGET Energy usecase
3 Forecast Energy France validation csv Regression Energy TARGET Energy usecase
4 DNS Attacks Origins csv Multiclassification Energy Class DNS usecase
5 Sales Forecasting csv Regression Retail Weekly_Sales  
6 Songs Hits csv Classification Retail target  
7 House pricing Regression - Trainset csv Regression Retail TARGET House usecase
8 House pricing Regression - Holdout csv Regression Retail TARGET House usecase
9 EDF Classification - Trainset csv Classification Energy TARGET  
10 EDF Classification - Holdout csv Classification Energy TARGET  
11 Sales Timeseries - Trainset csv Timeserie Retail Volume  
12 Sales Timeseries - Holdout csv Timeserie Retail Volume  
             
             

Images

Images Folders
Number Nom Download link Type Industry
1 youtube train You tube adds Trainset Images Ads
2 youtube train labels You tube adds Trainset Labels Images Labels Ads
3 youtube test You tube adds Testset Images Ads
4 youtube test Labels You tube adds Testset Labels Images Labels Ads
5 Cheezam Cheezam Images Images Ads
6 Cheezam Labels Cheezam Labels Images Labels Ads

NLP

NLP Folders
Number Nom Download link Type Industry
1 Netflix catalog Netflix movies with data NLP entertainment
2 French candidates multiclassif trainset sample Tweets of French Presidential Candidates 2022 ( train sample ) NLP politics
3 French candidates multiclassif holdout Tweets of French Presidential Candidates 2022 ( holdout ) NLP politics
         

Externals models

Externals Models
Number Nom Download link type format
1 classication model Files Classification onnx