@InterfaceAudience.Private public class LruBlockCache extends Object implements ResizableBlockCache, HeapSize
HeapSize
,
memory-bound using an LRU eviction algorithm, and concurrent: backed by a
ConcurrentHashMap
and with a non-blocking eviction thread giving
constant-time cacheBlock(org.apache.hadoop.hbase.io.hfile.BlockCacheKey, org.apache.hadoop.hbase.io.hfile.Cacheable, boolean, boolean)
and getBlock(org.apache.hadoop.hbase.io.hfile.BlockCacheKey, boolean, boolean, boolean)
operations.
Contains three levels of block priority to allow for
scan-resistance and in-memory families
HColumnDescriptor.setInMemory(boolean)
(An
in-memory column family is a column family that should be served from memory if possible):
single-access, multiple-accesses, and in-memory priority.
A block is added with an in-memory priority flag if
HColumnDescriptor.isInMemory()
,
otherwise a block becomes a single access
priority the first time it is read into this block cache. If a block is accessed again while
in cache, it is marked as a multiple access priority block. This delineation of blocks is used
to prevent scans from thrashing the cache adding a least-frequently-used
element to the eviction algorithm.
Each priority is given its own chunk of the total cache to ensure fairness during eviction. Each priority will retain close to its maximum size, however, if any priority is not using its entire chunk the others are able to grow beyond their chunk size.
Instantiated at a minimum with the total size and average block size. All sizes are in bytes. The block size is not especially important as this cache is fully dynamic in its sizing of blocks. It is only used for pre-allocating data structures and in initial heap estimation of the map.
The detailed constructor defines the sizes for the three priorities (they
should total to the maximum size
defined). It also sets the levels that
trigger and control the eviction thread.
The acceptable size
is the cache size level which triggers the eviction
process to start. It evicts enough blocks to get the size below the
minimum size specified.
Eviction happens in a separate thread and involves a single full-scan of the map. It determines how many bytes must be freed to reach the minimum size, and then while scanning determines the fewest least-recently-used blocks necessary from each of the three priorities (would be 3 times bytes to free). It then uses the priority chunk sizes to evict fairly according to the relative sizes and usage.
Modifier and Type | Field and Description |
---|---|
static long |
CACHE_FIXED_OVERHEAD |
Constructor and Description |
---|
LruBlockCache(long maxSize,
long blockSize)
Default constructor.
|
LruBlockCache(long maxSize,
long blockSize,
boolean evictionThread)
Constructor used for testing.
|
LruBlockCache(long maxSize,
long blockSize,
boolean evictionThread,
org.apache.hadoop.conf.Configuration conf) |
LruBlockCache(long maxSize,
long blockSize,
boolean evictionThread,
int mapInitialSize,
float mapLoadFactor,
int mapConcurrencyLevel,
float minFactor,
float acceptableFactor,
float singleFactor,
float multiFactor,
float memoryFactor,
boolean forceInMemory)
Configurable constructor.
|
LruBlockCache(long maxSize,
long blockSize,
org.apache.hadoop.conf.Configuration conf) |
Modifier and Type | Method and Description |
---|---|
void |
cacheBlock(BlockCacheKey cacheKey,
Cacheable buf)
Cache the block with the specified name and buffer.
|
void |
cacheBlock(BlockCacheKey cacheKey,
Cacheable buf,
boolean inMemory,
boolean cacheDataInL1)
Cache the block with the specified name and buffer.
|
static long |
calculateOverhead(long maxSize,
long blockSize,
int concurrency) |
void |
clearCache()
Clears the cache.
|
boolean |
containsBlock(BlockCacheKey cacheKey)
Whether the cache contains block with specified cacheKey
|
boolean |
evictBlock(BlockCacheKey cacheKey)
Evict block from cache.
|
protected long |
evictBlock(LruCachedBlock block,
boolean evictedByEvictionProcess)
Evict the block, and it will be cached by the victim handler if exists &&
block may be read again later
|
int |
evictBlocksByHfileName(String hfileName)
Evicts all blocks for a specific HFile.
|
Cacheable |
getBlock(BlockCacheKey cacheKey,
boolean caching,
boolean repeat,
boolean updateCacheMetrics)
Get the buffer of the block with the specified name.
|
BlockCache[] |
getBlockCaches() |
long |
getBlockCount()
Returns the number of blocks currently cached in the block cache.
|
long |
getCurrentSize()
Returns the occupied size of the block cache, in bytes.
|
Map<DataBlockEncoding,Integer> |
getEncodingCountsForTest() |
long |
getFreeSize()
Returns the free size of the block cache, in bytes.
|
long |
getMaxSize()
Get the maximum size of this cache.
|
CacheStats |
getStats()
Get counter statistics for this cache.
|
long |
heapSize() |
Iterator<CachedBlock> |
iterator() |
void |
logStats() |
void |
setMaxSize(long maxSize)
Sets the max heap size that can be used by the BlockCache.
|
void |
setVictimCache(BlockCache handler) |
void |
shutdown()
Shutdown the cache.
|
long |
size()
Returns the total size of the block cache, in bytes.
|
String |
toString() |
protected long |
updateSizeMetrics(LruCachedBlock cb,
boolean evict)
Helper function that updates the local size counter and also updates any
per-cf or per-blocktype metrics it can discern from given
LruCachedBlock |
public LruBlockCache(long maxSize, long blockSize)
All other factors will be calculated based on defaults specified in this class.
maxSize
- maximum size of cache, in bytesblockSize
- approximate size of each block, in bytespublic LruBlockCache(long maxSize, long blockSize, boolean evictionThread)
public LruBlockCache(long maxSize, long blockSize, boolean evictionThread, org.apache.hadoop.conf.Configuration conf)
public LruBlockCache(long maxSize, long blockSize, org.apache.hadoop.conf.Configuration conf)
public LruBlockCache(long maxSize, long blockSize, boolean evictionThread, int mapInitialSize, float mapLoadFactor, int mapConcurrencyLevel, float minFactor, float acceptableFactor, float singleFactor, float multiFactor, float memoryFactor, boolean forceInMemory)
maxSize
- maximum size of this cache, in bytesblockSize
- expected average size of blocks, in bytesevictionThread
- whether to run evictions in a bg thread or notmapInitialSize
- initial size of backing ConcurrentHashMapmapLoadFactor
- initial load factor of backing ConcurrentHashMapmapConcurrencyLevel
- initial concurrency factor for backing CHMminFactor
- percentage of total size that eviction will evict untilacceptableFactor
- percentage of total size that triggers evictionsingleFactor
- percentage of total size for single-access blocksmultiFactor
- percentage of total size for multiple-access blocksmemoryFactor
- percentage of total size for in-memory blockspublic void setMaxSize(long maxSize)
ResizableBlockCache
setMaxSize
in interface ResizableBlockCache
maxSize
- The max heap size.public void cacheBlock(BlockCacheKey cacheKey, Cacheable buf, boolean inMemory, boolean cacheDataInL1)
It is assumed this will NOT be called on an already cached block. In rare cases (HBASE-8547) this can happen, for which we compare the buffer contents.
cacheBlock
in interface BlockCache
cacheKey
- block's cache keybuf
- block bufferinMemory
- if block is in-memorycacheDataInL1
- public void cacheBlock(BlockCacheKey cacheKey, Cacheable buf)
cacheBlock
in interface BlockCache
cacheKey
- block's cache keybuf
- block bufferprotected long updateSizeMetrics(LruCachedBlock cb, boolean evict)
LruCachedBlock
cb
- evict
- public Cacheable getBlock(BlockCacheKey cacheKey, boolean caching, boolean repeat, boolean updateCacheMetrics)
getBlock
in interface BlockCache
cacheKey
- block's cache keycaching
- true if the caller caches blocks on cache missesrepeat
- Whether this is a repeat lookup for the same block
(used to avoid double counting cache misses when doing double-check locking)updateCacheMetrics
- Whether to update cache metrics or notpublic boolean containsBlock(BlockCacheKey cacheKey)
cacheKey
- public boolean evictBlock(BlockCacheKey cacheKey)
BlockCache
evictBlock
in interface BlockCache
cacheKey
- Block to evictpublic int evictBlocksByHfileName(String hfileName)
This is used for evict-on-close to remove all blocks of a specific HFile.
evictBlocksByHfileName
in interface BlockCache
protected long evictBlock(LruCachedBlock block, boolean evictedByEvictionProcess)
block
- evictedByEvictionProcess
- true if the given block is evicted by
EvictionThreadpublic long getMaxSize()
public long getCurrentSize()
BlockCache
getCurrentSize
in interface BlockCache
public long getFreeSize()
BlockCache
getFreeSize
in interface BlockCache
public long size()
BlockCache
size
in interface BlockCache
public long getBlockCount()
BlockCache
getBlockCount
in interface BlockCache
public void logStats()
public CacheStats getStats()
Includes: total accesses, hits, misses, evicted blocks, and runs of the eviction processes.
getStats
in interface BlockCache
public long heapSize()
public static long calculateOverhead(long maxSize, long blockSize, int concurrency)
public Iterator<CachedBlock> iterator()
iterator
in interface Iterable<CachedBlock>
iterator
in interface BlockCache
public void shutdown()
BlockCache
shutdown
in interface BlockCache
public void clearCache()
public Map<DataBlockEncoding,Integer> getEncodingCountsForTest()
public void setVictimCache(BlockCache handler)
public BlockCache[] getBlockCaches()
getBlockCaches
in interface BlockCache
Copyright © 2007-2016 The Apache Software Foundation. All Rights Reserved.