Do the best you can until you know better. Then when you know better, do better. – Maya Angelou

Mr. Walker's Classroom Blog

Wayback Machine: Now with 240,000,000,000 URLs

Wayback Machine: Now with 240,000,000,000 URLs

Today we updated the Wayback Machine with much more data and some code improvements.  Now we cover from late 1996 to December 9, 2012 so you can surf the web as it was up until a month ago.  Also, we have gone from having 150,000,000,000 URLs to having 240,000,000,000 URLs, a total of about 5 petabytes of data.   (Want a humorous description of a petabyte?  start at 28:55)  This database is queried over 1,000 times a second by over 500,000 people a day helping make archive.org the 250th most popular website.

live 2012 election coverageOver the past year we archived tons of pages about the United States 2012 presidential election.  You can revisit theNew York Times live coverage page from election day, the campaign sites of Republican hopefuls like Newt Gingrich andRon Paul, and mini-scandals like Romney’s car elevator or using aspirin as contraceptives.  The Wayback record of the 2008 election was recently used by the Sunlight Foundation to contrast how Obama’s team dealt with disclosing inauguration donors then vs. now, so hopefully the 2012 election content will prove just as useful in the future.

city of heroes siteThe prolific volunteers of Archive Teamspent a lot of time this year archiving web sites on the verge of disappearing and then contributing those records to Internet Archive.  City of Heroes (including theboards with years of posts), Fortune Cityand Splinder were all saved from the proverbial wood chipper.

The updated version does have at least one known issue – there is a small amount of older content missing from the index, and it will take us another month or two to sort out that problem.  In the mean time, you can still visit the previous version of the Wayback with that content.


Comments

Leave a Reply