| cc_warc | Provides WARC paths for commoncrawl.org |
| rcpp_read_warc_sample | Loads the sample warc file in Rcpp |
| sparkwarc | sparkwarc |
| spark_rcpp_read_warc | Reads a WARC File into using Rcpp |
| spark_read_warc | Reads a WARC File into Apache Spark |
| spark_read_warc_sample | Loads the sample warc file in Spark |
| spark_warc_sample_path | Retrieves sample warc path |