@InterfaceAudience.Public public class TableInputFormat extends TableInputFormatBase implements org.apache.hadoop.mapred.JobConfigurable
| Modifier and Type | Field and Description |
|---|---|
static String |
COLUMN_LIST
space delimited list of columns
|
| Constructor and Description |
|---|
TableInputFormat() |
| Modifier and Type | Method and Description |
|---|---|
void |
configure(org.apache.hadoop.mapred.JobConf job) |
protected void |
initialize(org.apache.hadoop.mapred.JobConf job)
Handle subclass specific set up.
|
void |
validateInput(org.apache.hadoop.mapred.JobConf job) |
closeTable, getRecordReader, getSplits, getTable, initializeTable, setInputColumns, setRowFilter, setTableRecordReaderpublic static final String COLUMN_LIST
public TableInputFormat()
public void configure(org.apache.hadoop.mapred.JobConf job)
configure in interface org.apache.hadoop.mapred.JobConfigurableprotected void initialize(org.apache.hadoop.mapred.JobConf job) throws IOException
TableInputFormatBaseTableInputFormatBase.getRecordReader(InputSplit, JobConf, Reporter) and TableInputFormatBase.getSplits(JobConf, int),
will call TableInputFormatBase.initialize(JobConf) as a convenient centralized location to handle
retrieving the necessary configuration information and calling
TableInputFormatBase.initializeTable(Connection, TableName).
Subclasses should implement their initialize call such that it is safe to call multiple times.
The current TableInputFormatBase implementation relies on a non-null table reference to decide
if an initialize call is needed, but this behavior may change in the future. In particular, it
is critical that initializeTable not be called multiple times since this will leak Connection
instances.initialize in class TableInputFormatBaseIOExceptionpublic void validateInput(org.apache.hadoop.mapred.JobConf job) throws IOException
IOExceptionCopyright © 2007–2020 The Apache Software Foundation. All rights reserved.