Class | Description |
---|---|
WARCFileReader |
Reads
WARCRecord s from a WARC file, using Hadoop's filesystem APIs. |
WARCFileWriter |
Writes
WARCRecord s to a WARC file, using Hadoop's filesystem APIs. |
WARCInputFormat |
Hadoop InputFormat for mapreduce jobs ('new' API) that want to process data in WARC files.
|
WARCInputFormat.WARCReader | |
WARCOutputFormat |
Hadoop OutputFormat for mapreduce jobs ('new' API) that want to write data to WARC files.
|
WARCRecord |
Immutable implementation of a record in a WARC file.
|
WARCRecord.Header |
Contains the parsed headers of a
WARCRecord . |
WARCWritable |
A mutable wrapper around a
WARCRecord implementing the Hadoop Writable interface. |
Copyright © 2007–2020 The Apache Software Foundation. All rights reserved.