Class TsvImporterTextMapper
java.lang.Object
org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,ImmutableBytesWritable,org.apache.hadoop.io.Text>
org.apache.hadoop.hbase.mapreduce.TsvImporterTextMapper
@Public
public class TsvImporterTextMapper
extends org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,ImmutableBytesWritable,org.apache.hadoop.io.Text>
Write table content out to map output files.
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper
org.apache.hadoop.mapreduce.Mapper.Context
-
Field Summary
Modifier and TypeFieldDescriptionprivate org.apache.hadoop.mapreduce.Counter
private static final org.slf4j.Logger
private boolean
private ImportTsv.TsvParser
private String
Column seperatorprivate boolean
Should skip bad lines -
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionprotected void
doSetup
(org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, ImmutableBytesWritable, org.apache.hadoop.io.Text>.org.apache.hadoop.mapreduce.Mapper.Context context) Handles common parameter initialization that a subclass might want to leverage.org.apache.hadoop.mapreduce.Counter
boolean
void
incrementBadLineCount
(int count) void
map
(org.apache.hadoop.io.LongWritable offset, org.apache.hadoop.io.Text value, org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, ImmutableBytesWritable, org.apache.hadoop.io.Text>.org.apache.hadoop.mapreduce.Mapper.Context context) Convert a line of TSV text into an HBase table row.protected void
setup
(org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, ImmutableBytesWritable, org.apache.hadoop.io.Text>.org.apache.hadoop.mapreduce.Mapper.Context context) Handles initializing this class with objects specific to it (i.e., the parser).Methods inherited from class org.apache.hadoop.mapreduce.Mapper
cleanup, run
-
Field Details
-
LOG
-
separator
Column seperator -
skipBadLines
Should skip bad lines -
badLineCount
-
logBadLines
-
parser
-
-
Constructor Details
-
TsvImporterTextMapper
public TsvImporterTextMapper()
-
-
Method Details
-
getSkipBadLines
-
getBadLineCount
-
incrementBadLineCount
-
setup
protected void setup(org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, ImmutableBytesWritable, org.apache.hadoop.io.Text>.org.apache.hadoop.mapreduce.Mapper.Context context) Handles initializing this class with objects specific to it (i.e., the parser). Common initialization that might be leveraged by a subclass is done indoSetup
. Hence a subclass may choose to override this method and calldoSetup
as well before handling it's own custom params.- Overrides:
setup
in classorg.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable,
org.apache.hadoop.io.Text, ImmutableBytesWritable, org.apache.hadoop.io.Text>
-
doSetup
protected void doSetup(org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, ImmutableBytesWritable, org.apache.hadoop.io.Text>.org.apache.hadoop.mapreduce.Mapper.Context context) Handles common parameter initialization that a subclass might want to leverage. -
map
public void map(org.apache.hadoop.io.LongWritable offset, org.apache.hadoop.io.Text value, org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, throws IOExceptionImmutableBytesWritable, org.apache.hadoop.io.Text>.org.apache.hadoop.mapreduce.Mapper.Context context) Convert a line of TSV text into an HBase table row.- Overrides:
map
in classorg.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable,
org.apache.hadoop.io.Text, ImmutableBytesWritable, org.apache.hadoop.io.Text> - Throws:
IOException
-