| Class | Description |
|---|---|
| WARCFileReader |
Reads
WARCRecords from a WARC file, using Hadoop's filesystem APIs. |
| WARCFileWriter |
Writes
WARCRecords to a WARC file, using Hadoop's filesystem APIs. |
| WARCInputFormat |
Hadoop InputFormat for mapreduce jobs ('new' API) that want to process data in WARC files.
|
| WARCInputFormat.WARCReader | |
| WARCOutputFormat |
Hadoop OutputFormat for mapreduce jobs ('new' API) that want to write data to WARC files.
|
| WARCRecord |
Immutable implementation of a record in a WARC file.
|
| WARCRecord.Header |
Contains the parsed headers of a
WARCRecord. |
| WARCWritable |
A mutable wrapper around a
WARCRecord implementing the Hadoop Writable interface. |
Copyright © 2007–2020 The Apache Software Foundation. All rights reserved.