A collection of processed CommonCrawl data as part of the BigBanyanTree initiative. Each dataset is extracted from a random 1% sample of the data.
-
big-banyan-tree/BBT_CommonCrawl_2018
Viewer • Updated • 61.5M • 528 • 3 -
big-banyan-tree/BBT_CommonCrawl_2019
Viewer • Updated • 55.8M • 145 • 2 -
big-banyan-tree/BBT_CommonCrawl_2020
Viewer • Updated • 46.9M • 67 • 2 -
big-banyan-tree/BBT_CommonCrawl_2021
Viewer • Updated • 48.5M • 2.83k • 2