Machine Learning/Datasets

From Noisebridge
Revision as of 00:07, 15 March 2011 by Mschachter (talk | contribs)
Jump to navigation Jump to search

Machine learning is a vast field and there are many different types of problems to be solved. If you find a dataset interesting, try to categorize it (or add a new category) and add it to the links below.

Classification

  • MNIST Handwritten Digits
    • Classify handwritten digits using this dataset, a very popular one with lots of training examples.
  • Heart Disease
    • Predict whether a person will have heart disease based on a subset of 76 factors.
  • Census Income
    • Try to predict whether a person has an income greater than or less than 50k

Regression

Time Series


Clustering


Text Data

  • Enron Emails
    • Search through Enron's publicly accessible emails.