Class HRegionPartitioner<KEY,VALUE>
java.lang.Object
org.apache.hadoop.mapreduce.Partitioner<ImmutableBytesWritable,VALUE>
org.apache.hadoop.hbase.mapreduce.HRegionPartitioner<KEY,VALUE>
- Type Parameters:
KEY- The type of the key.VALUE- The type of the value.
- All Implemented Interfaces:
org.apache.hadoop.conf.Configurable
@Public
public class HRegionPartitioner<KEY,VALUE>
extends org.apache.hadoop.mapreduce.Partitioner<ImmutableBytesWritable,VALUE>
implements org.apache.hadoop.conf.Configurable
This is used to partition the output keys into groups of keys. Keys are grouped according to the
regions that currently exist so that each reducer fills a single region so load is distributed.
This class is not suitable as partitioner creating hfiles for incremental bulk loads as region
spread will likely change between time of hfile creation and load time. See
BulkLoadHFiles and
Bulk Load.
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate org.apache.hadoop.conf.Configurationprivate Connectionprivate RegionLocatorprivate static final org.slf4j.Loggerprivate byte[][] -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionorg.apache.hadoop.conf.ConfigurationgetConf()Returns the current configuration.intgetPartition(ImmutableBytesWritable key, VALUE value, int numPartitions) Gets the partition number for a given key (hence record) given the total number of partitions i.e.voidsetConf(org.apache.hadoop.conf.Configuration configuration) Sets the configuration.
-
Field Details
-
LOG
-
conf
-
connection
-
locator
-
startKeys
-
-
Constructor Details
-
HRegionPartitioner
public HRegionPartitioner()
-
-
Method Details
-
getPartition
Gets the partition number for a given key (hence record) given the total number of partitions i.e. number of reduce-tasks for the job.Typically a hash function on a all or a subset of the key.
- Specified by:
getPartitionin classorg.apache.hadoop.mapreduce.Partitioner<ImmutableBytesWritable,VALUE> - Parameters:
key- The key to be partitioned.value- The entry value.numPartitions- The total number of partitions.- Returns:
- The partition number for the
key. - See Also:
-
Partitioner.getPartition(java.lang.Object, java.lang.Object, int)
-
getConf
Returns the current configuration.- Specified by:
getConfin interfaceorg.apache.hadoop.conf.Configurable- Returns:
- The current configuration.
- See Also:
-
Configurable.getConf()
-
setConf
Sets the configuration. This is used to determine the start keys for the given table.- Specified by:
setConfin interfaceorg.apache.hadoop.conf.Configurable- Parameters:
configuration- The configuration to set.- See Also:
-
Configurable.setConf(org.apache.hadoop.conf.Configuration)
-