Package org.apache.hadoop.hbase.filter
Class Filter
java.lang.Object
org.apache.hadoop.hbase.filter.Filter
- Direct Known Subclasses:
FilterBase
,FilterWrapper
Interface for row and column filters directly applied within the regionserver. A filter can
expect the following call sequence:
reset()
: reset the filter state before filtering a new row.filterAllRemaining()
: true means row scan is over; false means keep going.filterRowKey(Cell)
: true means drop this row; false means include.filterCell(Cell)
: decides whether to include or exclude this Cell. SeeFilter.ReturnCode
.transformCell(Cell)
: if the Cell is included, let the filter transform the Cell.filterRowCells(List)
: allows direct modification of the final list to be submittedfilterRow()
: last chance to drop entire row based on the sequence of filter calls. Eg: filter a row if it doesn't contain a specified column.
FilterBase
to
help you reduce boilerplate.- See Also:
-
Nested Class Summary
-
Field Summary
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescription(package private) abstract boolean
areSerializedFieldsEqual
(Filter other) Concrete implementers can signal a failure condition in their code by throwing anIOException
.abstract boolean
If this returns true, the scan will terminate.filterCell
(Cell c) A way to filter based on the column family, column qualifier and/or the column value.abstract boolean
Last chance to veto row based on previousfilterCell(Cell)
calls.abstract void
filterRowCells
(List<Cell> kvs) Chance to alter the list of Cells to be submitted.abstract boolean
filterRowKey
(Cell firstRowCell) Filters a row based on the row key.abstract Cell
getNextCellHint
(Cell currentCell) If the filter returns the match code SEEK_NEXT_USING_HINT, then it should also tell which is the next key it must seek to.abstract boolean
Primarily used to check for conflicts with scans(such as scans that do not read a full row at a time).abstract boolean
isFamilyEssential
(byte[] name) Check that given column family is essential for filter to check row.boolean
static Filter
parseFrom
(byte[] pbBytes) Concrete implementers can signal a failure condition in their code by throwing anIOException
.abstract void
reset()
Reset the state of the filter between rows.void
setReversed
(boolean reversed) alter the reversed scan flagabstract byte[]
TODO: JAVADOC Concrete implementers can signal a failure condition in their code by throwing anIOException
.abstract Cell
Give the filter a chance to transform the passed Cell.
-
Field Details
-
reversed
-
-
Constructor Details
-
Filter
public Filter()
-
-
Method Details
-
reset
Reset the state of the filter between rows. Concrete implementers can signal a failure condition in their code by throwing anIOException
.- Throws:
IOException
- in case an I/O or an filter specific failure needs to be signaled.
-
filterRowKey
Filters a row based on the row key. If this returns true, the entire row will be excluded. If false, each KeyValue in the row will be passed tofilterCell(Cell)
below. IffilterAllRemaining()
returns true, thenfilterRowKey(Cell)
should also return true. Concrete implementers can signal a failure condition in their code by throwing anIOException
.- Parameters:
firstRowCell
- The first cell coming in the new row- Returns:
- true, remove entire row, false, include the row (maybe).
- Throws:
IOException
- in case an I/O or an filter specific failure needs to be signaled.
-
filterAllRemaining
If this returns true, the scan will terminate. Concrete implementers can signal a failure condition in their code by throwing anIOException
.- Returns:
- true to end scan, false to continue.
- Throws:
IOException
- in case an I/O or an filter specific failure needs to be signaled.
-
filterCell
A way to filter based on the column family, column qualifier and/or the column value. Return code is described below. This allows filters to filter only certain number of columns, then terminate without matching ever column. If filterRowKey returns true, filterCell needs to be consistent with it. filterCell can assume that filterRowKey has already been called for the row. If your filter returnsReturnCode.NEXT_ROW
, it should returnReturnCode.NEXT_ROW
untilreset()
is called just in case the caller calls for the next row. Concrete implementers can signal a failure condition in their code by throwing anIOException
.- Parameters:
c
- the Cell in question- Returns:
- code as described below
- Throws:
IOException
- in case an I/O or an filter specific failure needs to be signaled.- See Also:
-
transformCell
Give the filter a chance to transform the passed Cell. If the Cell is changed a new Cell object must be returned. NOTICE: Filter will be evaluate at server side so the returnedCell
must be anExtendedCell
, although it is marked as IA.Private.- Parameters:
v
- the Cell in question- Returns:
- the changed Cell
- Throws:
IOException
- in case an I/O or an filter specific failure needs to be signaled.- See Also:
-
filterRowCells
Chance to alter the list of Cells to be submitted. Modifications to the list will carry on Concrete implementers can signal a failure condition in their code by throwing anIOException
.- Parameters:
kvs
- the list of Cells to be filtered- Throws:
IOException
- in case an I/O or an filter specific failure needs to be signaled.
-
hasFilterRow
Primarily used to check for conflicts with scans(such as scans that do not read a full row at a time).- Returns:
- True if this filter actively uses filterRowCells(List) or filterRow().
-
filterRow
Last chance to veto row based on previousfilterCell(Cell)
calls. The filter needs to retain state then return a particular value for this call if they wish to exclude a row if a certain column is missing (for example). Concrete implementers can signal a failure condition in their code by throwing anIOException
.- Returns:
- true to exclude row, false to include row.
- Throws:
IOException
- in case an I/O or an filter specific failure needs to be signaled.
-
getNextCellHint
If the filter returns the match code SEEK_NEXT_USING_HINT, then it should also tell which is the next key it must seek to. After receiving the match code SEEK_NEXT_USING_HINT, the QueryMatcher would call this function to find out which key it must next seek to. Concrete implementers can signal a failure condition in their code by throwing anIOException
. NOTICE: Filter will be evaluate at server side so the returnedCell
must be anExtendedCell
, although it is marked as IA.Private.- Returns:
- KeyValue which must be next seeked. return null if the filter is not sure which key to seek to next.
- Throws:
IOException
- in case an I/O or an filter specific failure needs to be signaled.
-
isFamilyEssential
Check that given column family is essential for filter to check row. Most filters always return true here. But some could have more sophisticated logic which could significantly reduce scanning process by not even touching columns until we are 100% sure that it's data is needed in result. Concrete implementers can signal a failure condition in their code by throwing anIOException
.- Throws:
IOException
- in case an I/O or an filter specific failure needs to be signaled.
-
toByteArray
TODO: JAVADOC Concrete implementers can signal a failure condition in their code by throwing anIOException
.- Returns:
- The filter serialized using pb
- Throws:
IOException
- in case an I/O or an filter specific failure needs to be signaled.
-
parseFrom
Concrete implementers can signal a failure condition in their code by throwing anIOException
.- Parameters:
pbBytes
- A pb serializedFilter
instance- Returns:
- An instance of
Filter
made frombytes
- Throws:
DeserializationException
- if an error occurred- See Also:
-
areSerializedFieldsEqual
Concrete implementers can signal a failure condition in their code by throwing anIOException
.- Returns:
- true if and only if the fields of the filter that are serialized are equal to the corresponding fields in other. Used for testing.
-
setReversed
alter the reversed scan flag- Parameters:
reversed
- flag
-
isReversed
-