Datasets
Here are some of the datasets that have been used in our presentations and hack sessions. They are recommended for study and for use in demonstrations.
| Source: Description | Link (Contributor) |
|---|---|
| UC Irvine Machine Learning datasets: shellfish size vs. weight, income vs. demographics, etc., etc. | Search (Hannes) |
| Stack Overflow: Q&A about Open Data | Subscribe or Search (John) |
| Criteo: Anonymized web click logs similar to prior Kaggle competition | 23 files, ~1 TB (Hobs) |
| Yahoo: Anonymized web click logs (request access with e-mail from .edu TLD) | Register with your .edu e-mail address (Hobs) |
