google competition

6 February 2002

google programming contest:

google is providing a selection of about 900,000 web pages … your mission is to write a program that does something interesting with the data, in such a way that it would scale to a web-sized collection of documents. part of your job is to convince us of why your program is interesting and why it will scale; other than that, you’re free to implement whatever strikes your fancy. there must be a way to figure out the ultimate googlewhack here, that is, the two most common words that appear only once on any web page.