DataTaunew | comments | leaders | submitlogin
1 point by aqokaq 3130 days ago | link | parent

Nice! All combinations from 1 to 320 words from a corpus of ~350k English language words. Fun use of Python's itertools, a a cool logo to boot!

Here's a quick game: choose a random page. See how many words you can define or have heard of. Is the global mean > 33%? How does it correlate to socio-economic markers such as education level?

And another game: choose a random page. Find the longest n-gram that makes contextual sense. For example: "ventilated achondritic unenthroned turkess" ;)




RSS | Announcements