Tag Archives: Ngram


Search me: what Mad Men and brave moles can do for historical records

Posted on by Ville Miettinen

… is not something that people said in the 1960s ).

This study was made possible with Google’s Ngram viewer. As we discussed about a year ago, Ngram allows you to chart how many times a different combination of words or letters has appeared in Google Books’ huge corpus of 5 million texts published between 1500 and 2008.

Along with the many fascinating results, the great thing about this project is that it has managed to get the media interested in Ngram again. My own Ngram-like …

Tags: crowdsourcing Don Draper Downton Abbey google History Mad Men microtask National Library of Finland Ngram Optical character recognition

, , , , , , , , , | Leave a comment

Summer Blockbuster, in cinemas now: The Document Processing Knight Rises

Posted on by Tommaso De Benetti

As regular readers of this blog know, there is nothing we like more than discussing strange and new types of crowdsourcing. From weird music-related experiments to the incidence of expressions such as “I need to” during the Mad Men era , we try to keep you informed with what is going on across our industry.

Every now and then, however, we use this forum to talk …

Tags: crowdsourcing Don Draper Downton Abbey google History Mad Men microtask National Library of Finland Ngram Optical character recognition

, , , , , , , , , | Leave a comment

Google Ngrams: in the beginning was the word search

Posted on by Ville Miettinen

… turn this literary data-mountain into something “useful and accessible”. The result is Google Ngram viewer : a tool that searches and graphs the frequency of words contained in over 5 million books. Basically you type in a word and get back a pretty-yet-educational chart of say religion vs science or drinking habits through the ages. As Aiden and Michel enthusiastically demonstrated in a recent TED talk, it’s surprisingly addictive.

So, where’s the crowdsourcing angle in all …

, , , , , , , , , , | 2 Comments