@InterfaceAudience.Public public class CellCounter extends org.apache.hadoop.conf.Configured implements org.apache.hadoop.util.Tool
1. Total number of rows in the table 2. Total number of CFs across all rows 3. Total qualifiers across all rows 4. Total occurrence of each CF 5. Total occurrence of each qualifier 6. Total number of versions of each qualifier. 7. Total size of serialized cells of each CF. 8. Total size of serialized cells of each qualifier. 9. Total size of serialized cells across all rows.The cellcounter can take optional parameters to use a user supplied row/family/qualifier string to use in the report and second a regex based or prefix based row filter to restrict the count operation to a limited subset of rows from the table or a start time and/or end time to limit the count to a time range.
Constructor and Description |
---|
CellCounter() |
Modifier and Type | Method and Description |
---|---|
static org.apache.hadoop.mapreduce.Job |
createSubmittableJob(org.apache.hadoop.conf.Configuration conf,
String[] args)
Sets up the actual job.
|
static void |
main(String[] args)
Main entry point.
|
int |
run(String[] args) |
public CellCounter()
public static org.apache.hadoop.mapreduce.Job createSubmittableJob(org.apache.hadoop.conf.Configuration conf, String[] args) throws IOException
conf
- The current configuration.args
- The command line parameters.IOException
- When setting up the job fails.public int run(String[] args) throws Exception
run
in interface org.apache.hadoop.util.Tool
Exception
Copyright © 2007–2020 The Apache Software Foundation. All rights reserved.