DataTaunew | comments | leaders | submitlogin
A collection of small datasets (flowingdata.com)
6 points by cdr6934 3450 days ago | 2 comments


2 points by izyda 3448 days ago | link

Seems pretty cool - thanks for sharing!

One disadvantage of this though, compared to say the default data sets provided in R (like Iris) or the data sets in the http://archive.ics.uci.edu/ml/ is that there's no widely accepted benchmarks for these data sets. So if you are creating a new ML algorithm or just practicing the application of some method, it's hard to compare your performance against others.

Nonetheless, there are use cases where this could be very valuable; thanks again for sharing!

-----

1 point by cdr6934 3450 days ago | link

As a data mangler, its always good to have a varied array of data to work on.

-----




RSS | Announcements