Common Crawl - Open Repository of Web Crawl Data

Interesting public dataset.