DataTaunew | comments | leaders | submitlogin
What to do with “small” data? (medium.com)
13 points by D33B 3122 days ago | 2 comments


2 points by marksimi 3121 days ago | link

One of my favorite posts on small data: http://sumsar.net/blog/2014/10/tiny-data-and-the-socks-of-ka...

-----

2 points by debrouwere 3120 days ago | link

Cool from an educational point of view, but also severely overkill: you can get the posterior distribution of how many socks someone has, given that they just ended up with 11 out of 11 unmatched socks, with SciPy and a little loop:

  posterior = [hypergeom.pmf(11, M=n_pairs*2, n=n_pairs, N=11) * poisson.pmf(n_pairs, mu=20) for n_pairs in range(50)]
  
You don't need Metropolis or Approximate Bayesian Computation because there's maybe only 50 plausible answers, meaning you can just run through them all.

-----




RSS | Announcements