Machine Learning Meetup Notes: 2010-05-26

From Noisebridge
Revision as of 22:09, 26 May 2010 by ThomasLotze (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.
  • Andy gave overview of where we're at with KDD data
  • Mike S gave presentation:
    • Gaussian Mixture Models
    • k-means clustering
    • very basic expectation-maximization
  • Brainstorming session on how to reduce skill set column
    • Tom tried to quantify opportunity per skills per row as high dimensional vector
  • Brainstorming on how to reduce other data and compute new features for the KDD Dataset
    • Tom will apply k-means clustering of skills (or steps), for data reduction
    • Andy will compute new features: unique step/problem id, student IQ (avg. correct), step challenge/difficulty (avg correct), step complexity (# skills required)
    • Mike will use self-organizing maps to reduce skills
    • Paul will visualize/summarize the data, to provide understanding and insight
    • Mike will set up an FTP server for people to transfer their enormous datasets
    • Theo will use some Weka classifiers to produce a classification method for the data