How PayPal Makes Merchants Smarter through Data Mining zite.to/1kdtCDq
RapidMiner 6 adds application wizards, better visualization, ease of use zite.to/19MbgkQ
Some Data Mining Tutorials for the Retail Industry zite.to/13NzH5k
Personal Data Mining zite.to/1353Zyq
CLIPS (Computational Linguistics & Psycholinguistics) has released a new module for web mining for Python called Patterns.
Quoting from the CLIPS site:
It bundles tools for data retrieval (Google + Twitter + Wikipedia API, web spider, HTML DOM parser), text analysis (rule-based shallow parser, WordNet interface, syntactical + semantical n-gram search algorithm, tf-idf + cosine similarity + LSA metrics) and data visualization (graph networks).
Visit their site to get the download.
Check out Mallet, a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
Kaggle has hosted several data mining competitions, similar to the Netflix prize, but recently announced a new and big one. It’s called the Heritage Health Prize and the prize has been set at $3M. The focus on the prize is being able to predict when a person needs to go to the hospital before they actually make a visit. Here’s some more info from O’Reilly Radar. And here is Anthony Goldbloom of Kaggle announcing the contest at the Strata Conference…
Check out the 2010 INFORMS Data Mining Contest. Participants are challenged to predict stock prices at five minute intervals. Visit the site to download the training data set. The submission deadline in October 10th, 2010.
Part of BMW Oracle’s upper hand in the most recent America’s Cup may have come from the use of data mining. The boat and all its sensors can generate 2,500 data points 10 times per second. Check out this article from the Oracle Data Mining and Analytics blog to read the rest.
This Boing Boing article questions whether the US military may be gathering data from unsuspecting teens and using it for data mining exercises to improve recruiting.