by agencies on 7/5/21, 2:02 AM with 0 comments
Common crawl is bigger but I've also read still is not good enough to use as input for good web search.
Did the trec web track produce anything useful?
[1] http://www-personal.umich.edu/~kevynct/trec-web-2014/