@InterfaceAudience.Public public class TableInputFormat extends TableInputFormatBase implements org.apache.hadoop.mapred.JobConfigurable
Modifier and Type | Field and Description |
---|---|
static String |
COLUMN_LIST
space delimited list of columns
|
private static org.slf4j.Logger |
LOG |
Constructor and Description |
---|
TableInputFormat() |
Modifier and Type | Method and Description |
---|---|
void |
configure(org.apache.hadoop.mapred.JobConf job) |
protected void |
initialize(org.apache.hadoop.mapred.JobConf job)
Handle subclass specific set up.
|
void |
validateInput(org.apache.hadoop.mapred.JobConf job) |
closeTable, getRecordReader, getSplits, getTable, initializeTable, setInputColumns, setRowFilter, setTableRecordReader
private static final org.slf4j.Logger LOG
public static final String COLUMN_LIST
public TableInputFormat()
public void configure(org.apache.hadoop.mapred.JobConf job)
configure
in interface org.apache.hadoop.mapred.JobConfigurable
protected void initialize(org.apache.hadoop.mapred.JobConf job) throws IOException
TableInputFormatBase
TableInputFormatBase.getRecordReader(InputSplit, JobConf, Reporter)
and TableInputFormatBase.getSplits(JobConf, int)
,
will call TableInputFormatBase.initialize(JobConf)
as a convenient centralized location to handle
retrieving the necessary configuration information and calling
TableInputFormatBase.initializeTable(Connection, TableName)
.
Subclasses should implement their initialize call such that it is safe to call multiple times.
The current TableInputFormatBase implementation relies on a non-null table reference to decide
if an initialize call is needed, but this behavior may change in the future. In particular, it
is critical that initializeTable not be called multiple times since this will leak Connection
instances.initialize
in class TableInputFormatBase
IOException
public void validateInput(org.apache.hadoop.mapred.JobConf job) throws IOException
IOException
Copyright © 2007–2020 The Apache Software Foundation. All rights reserved.