HadoopConcatGz · A Splitable Hadoop InputFormat for Concatenated GZIP Files and *.(w)arc.gz
web2warc · web2warc
archivespark · archivespark
Advertisement
Top Dependency Usages