tl;dr you can download the 376 GB compressed (1.6 TB uncompressed) crawl of 115 million websites with this torrent or you can see a 285 MB sample file here.

http://blog.meanpath.com/meanpath-january-2014-index/