Machine Learning/Datasets

From Noisebridge
< Machine Learning(Difference between revisions)
Jump to: navigation, search
m
m
Line 30: Line 30:
 
*[http://robjhyndman.com/tsdldata/annual/globtp.dat Changes in Global Air Temperature 1880-1985]
 
*[http://robjhyndman.com/tsdldata/annual/globtp.dat Changes in Global Air Temperature 1880-1985]
 
**"Surface air temperature change for the globe, 1880-1985, Temperature change actually means temperature against an arbitrary zero point. From James Hansen and Sergej Lebedeff, "Global Trends of Measured Surface Air Temperature", `Journal of Geophysical Research`, Vol. 92, No. D11, pages 13,345-13,372, November 20, 1987."
 
**"Surface air temperature change for the globe, 1880-1985, Temperature change actually means temperature against an arbitrary zero point. From James Hansen and Sergej Lebedeff, "Global Trends of Measured Surface Air Temperature", `Journal of Geophysical Research`, Vol. 92, No. D11, pages 13,345-13,372, November 20, 1987."
 +
*[http://robjhyndman.com/tsdldata/data/earthq.dat Number of Earthquakes per Year 1900-1988 (>= 7.0)]
 +
**"Source: National Earthquake Information Center. Different lists will give different numbers depending on the formula used for calculating the magnitude."
 +
  
  
 
===Clustering===
 
===Clustering===

Revision as of 00:01, 15 March 2011

Machine learning is a vast field and there are many different types of problems to be solved. If you find a dataset interesting, try to categorize it (or add a new category) and add it to the links below.

Contents

Classification

  • MNIST Handwritten Digits
    • Classify handwritten digits using this dataset, a very popular one with lots of training examples.
  • Heart Disease
    • Predict whether a person will have heart disease based on a subset of 76 factors.
  • Census Income
    • Try to predict whether a person has an income greater than or less than 50k

Regression

Time Series


Clustering

Personal tools