Machine Learning/Datasets

From Noisebridge
< Machine Learning(Difference between revisions)
Jump to: navigation, search
m
Line 18: Line 18:
 
*[http://www.sci.usq.edu.au/staff/dunn/Datasets/applications/health/fev.html Smoking and Respiratory Function]
 
*[http://www.sci.usq.edu.au/staff/dunn/Datasets/applications/health/fev.html Smoking and Respiratory Function]
 
**How does smoking affect lung capacity?
 
**How does smoking affect lung capacity?
 +
 +
'''Time Series'''
 +
*[http://robjhyndman.com/tsdldata/data/ausgundeaths.dat Gun-related Deaths in Australia]
 +
**"Deaths from gun-related homicides and suicides and non-gun-related homicides and suicides. Australia: 1915-2004.
 +
Source: Neill and Leigh (2007)."
  
 
'''Clustering'''
 
'''Clustering'''

Revision as of 23:33, 14 March 2011

This page describes in detail the datasets used for the NBML Course.

Classification

  • MNIST Handwritten Digits
    • Classify handwritten digits using this dataset, a very popular one with lots of training examples.
  • Heart Disease
    • Predict whether a person will have heart disease based on a subset of 76 factors.
  • Census Income
    • Try to predict whether a person has an income greater than or less than 50k

Regression

Time Series

Source: Neill and Leigh (2007)."

Clustering

Personal tools