org.apache.hadoop.hbase.quotas.RateLimiter

org.apache.hadoop.hbase.quotas.FeedbackAdaptiveRateLimiter

@Private @Evolving public class FeedbackAdaptiveRateLimiter extends RateLimiter

An adaptive rate limiter that dynamically adjusts its behavior based on observed usage patterns to achieve stable, full utilization of configured quota allowances while managing client contention.

Core Algorithm: This rate limiter divides time into fixed refill intervals (configurable via hbase.quota.rate.limiter.refill.interval.ms, default is 1 refill per TimeUnit of the RateLimiter). At the beginning of each interval, a fresh allocation of resources becomes available based on the configured limit. Clients consume resources as they make requests. When resources are exhausted, clients must wait until the next refill, or until enough resources become available.

Adaptive Backpressure: When multiple threads compete for limited resources (contention), this limiter detects the contention and applies increasing backpressure by extending wait intervals. This prevents thundering herd behavior where many threads wake simultaneously and compete for the same resources. The backoff multiplier increases by a small increment (see FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_INCREMENT) per interval when contention occurs, and decreases (see FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_DECREMENT) when no contention is detected, converging toward optimal throughput. The multiplier is capped at a maximum value (see FEEDBACK_ADAPTIVE_MAX_BACKOFF_MULTIPLIER) to prevent unbounded waits.

Contention is detected when getWaitInterval(long, long, long) is called with insufficient available resources (i.e., amount > available), indicating a thread needs to wait for resources. If this occurs more than once in a refill interval, the limiter identifies it as contention requiring increased backpressure.

Oversubscription for Full Utilization: In practice, synchronization overhead and timing variations often prevent clients from consuming exactly their full allowance, resulting in consistent under-utilization. This limiter addresses this by tracking utilization via an exponentially weighted moving average (EWMA). When average utilization falls below the target range (determined by FEEDBACK_ADAPTIVE_UTILIZATION_ERROR_BUDGET), the limiter gradually increases the oversubscription proportion (see FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_INCREMENT), allowing more resources per interval than the base limit. Conversely, when utilization exceeds the target range, oversubscription is decreased (see FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_DECREMENT). Oversubscription is capped (see FEEDBACK_ADAPTIVE_MAX_OVERSUBSCRIPTION) to prevent excessive bursts while still enabling consistent full utilization.

Example Scenario: Consider a quota of 1000 requests per second with a 1-second refill interval. Without oversubscription, clients might typically achieve only 950 req/s due to coordination delays. This limiter would detect the under-utilization, gradually increase oversubscription, allowing slightly more resources per interval, which compensates for inefficiencies and achieves stable throughput closer to the configured quota. If multiple threads simultaneously try to consume resources and repeatedly wait, the backoff multiplier increases their wait times, spreading out their retry attempts and reducing wasted CPU cycles.

Configuration Parameters:

FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_INCREMENT: Controls rate of backpressure increase
FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_DECREMENT: Controls rate of backpressure decrease
FEEDBACK_ADAPTIVE_MAX_BACKOFF_MULTIPLIER: Caps the maximum wait time extension
FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_INCREMENT: Controls rate of oversubscription increase
FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_DECREMENT: Controls rate of oversubscription decrease
FEEDBACK_ADAPTIVE_MAX_OVERSUBSCRIPTION: Caps the maximum burst capacity
FEEDBACK_ADAPTIVE_UTILIZATION_ERROR_BUDGET: Defines the acceptable range around full utilization

This algorithm converges toward stable operation where: (1) wait intervals are just long enough to prevent excessive contention, and (2) oversubscription is just high enough to achieve consistent full utilization of the configured allowance.

Nested Class Summary

Nested Classes

Modifier and Type

Class

Description

static class

FeedbackAdaptiveRateLimiter.FeedbackAdaptiveRateLimiterFactory
Field Summary

Fields

Modifier and Type

Field

Description

private final double

backoffMultiplierDecrement

private final double

backoffMultiplierIncrement

private final org.apache.hbase.thirdparty.com.google.common.util.concurrent.AtomicDouble

currentBackoffMultiplier

static final double

DEFAULT_BACKOFF_MULTIPLIER_DECREMENT

static final double

DEFAULT_BACKOFF_MULTIPLIER_INCREMENT

static final double

DEFAULT_MAX_BACKOFF_MULTIPLIER

static final double

DEFAULT_MAX_OVERSUBSCRIPTION

static final double

DEFAULT_OVERSUBSCRIPTION_DECREMENT

static final double

DEFAULT_OVERSUBSCRIPTION_INCREMENT

static final double

DEFAULT_UTILIZATION_ERROR_BUDGET

private final double

emaAlpha

static final String

FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_DECREMENT

Amount to decrease the backoff multiplier when no contention is detected per refill interval.

static final String

FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_INCREMENT

Amount to increase the backoff multiplier when contention is detected per refill interval.

static final String

FEEDBACK_ADAPTIVE_MAX_BACKOFF_MULTIPLIER

Maximum ceiling for the backoff multiplier to avoid unbounded waits.

static final String

FEEDBACK_ADAPTIVE_MAX_OVERSUBSCRIPTION

Maximum ceiling for oversubscription to prevent unbounded bursts.

static final String

FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_DECREMENT

Amount to decrease the oversubscription proportion when utilization exceeds (1.0+errorBudget).

static final String

FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_INCREMENT

Amount to increase the oversubscription proportion when utilization is below (1.0-errorBudget).

static final String

FEEDBACK_ADAPTIVE_UTILIZATION_ERROR_BUDGET

Acceptable deviation around full utilization (1.0) for adjusting oversubscription.

private boolean

hadContentionThisInterval

private final AtomicLong

lastIntervalConsumed

private final double

maxBackoffMultiplier

private final double

maxOversubscription

private final double

maxTargetUtilization

private final double

minTargetUtilization

private long

nextRefillTime

private final double

oversubscriptionDecrement

private final double

oversubscriptionIncrement

private final org.apache.hbase.thirdparty.com.google.common.util.concurrent.AtomicDouble

oversubscriptionProportion

private final long

refillInterval

private double

utilizationEma

private static final int

WINDOW_TIME_MS

Fields inherited from class org.apache.hadoop.hbase.quotas.RateLimiter
DEFAULT_TIME_UNIT, QUOTA_RATE_LIMITER_CONF_KEY
Constructor Summary

Constructors

Constructor

Description

FeedbackAdaptiveRateLimiter(long refillInterval, double backoffMultiplierIncrement, double backoffMultiplierDecrement, double maxBackoffMultiplier, double oversubscriptionIncrement, double oversubscriptionDecrement, double maxOversubscription, double utilizationErrorBudget)
Method Summary

Modifier and Type

Method

Description

private long

applyBackoffMultiplier(long baseWaitInterval)

void

consume(long amount)

consume amount available units, amount could be a negative number

long

getNextRefillTime()

private long

getOversubscribedLimit(long limit)

private long

getRefillIntervalAdjustedLimit(long limit)

long

getWaitInterval(long limit, long available, long amount)

Time in milliseconds to wait for before requesting to consume 'amount' resource.

long

refill(long limit)

Refill the available units w.r.t the elapsed time.

void

setNextRefillTime(long nextRefillTime)

Methods inherited from class org.apache.hadoop.hbase.quotas.RateLimiter
consume, getAvailable, getLimit, getTimeUnitInMillis, getWaitIntervalMs, getWaitIntervalMs, isAvailable, isBypass, set, toString, update, waitInterval, waitInterval

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait

Field Details
- FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_INCREMENT
  
  public static final String FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_INCREMENT
  
  Amount to increase the backoff multiplier when contention is detected per refill interval. In other words, if we are throttling more than once per refill interval, then we will increase our wait intervals (increase backpressure, decrease throughput).
  See Also:
  
  Constant Field Values
- DEFAULT_BACKOFF_MULTIPLIER_INCREMENT
  
  public static final double DEFAULT_BACKOFF_MULTIPLIER_INCREMENT
  See Also:
  
  Constant Field Values
- FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_DECREMENT
  
  public static final String FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_DECREMENT
  
  Amount to decrease the backoff multiplier when no contention is detected per refill interval. In other words, if we are only throttling once per refill interval, then we will decrease our wait interval (decrease backpressure, increase throughput).
  See Also:
  
  Constant Field Values
- DEFAULT_BACKOFF_MULTIPLIER_DECREMENT
  
  public static final double DEFAULT_BACKOFF_MULTIPLIER_DECREMENT
  See Also:
  
  Constant Field Values
- FEEDBACK_ADAPTIVE_MAX_BACKOFF_MULTIPLIER
  
  public static final String FEEDBACK_ADAPTIVE_MAX_BACKOFF_MULTIPLIER
  
  Maximum ceiling for the backoff multiplier to avoid unbounded waits.
  See Also:
  
  Constant Field Values
- DEFAULT_MAX_BACKOFF_MULTIPLIER
  
  public static final double DEFAULT_MAX_BACKOFF_MULTIPLIER
  See Also:
  
  Constant Field Values
- FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_INCREMENT
  
  public static final String FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_INCREMENT
  
  Amount to increase the oversubscription proportion when utilization is below (1.0-errorBudget).
  See Also:
  
  Constant Field Values
- DEFAULT_OVERSUBSCRIPTION_INCREMENT
  
  public static final double DEFAULT_OVERSUBSCRIPTION_INCREMENT
  See Also:
  
  Constant Field Values
- FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_DECREMENT
  
  public static final String FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_DECREMENT
  
  Amount to decrease the oversubscription proportion when utilization exceeds (1.0+errorBudget).
  See Also:
  
  Constant Field Values
- DEFAULT_OVERSUBSCRIPTION_DECREMENT
  
  public static final double DEFAULT_OVERSUBSCRIPTION_DECREMENT
  See Also:
  
  Constant Field Values
- FEEDBACK_ADAPTIVE_MAX_OVERSUBSCRIPTION
  
  public static final String FEEDBACK_ADAPTIVE_MAX_OVERSUBSCRIPTION
  
  Maximum ceiling for oversubscription to prevent unbounded bursts. Some oversubscription can be nice, because it allows you to balance the inefficiency and latency of retries, landing on stable usage at approximately your configured allowance. Without adequate oversubscription, your steady state may often seem significantly, and suspiciously, lower than your configured allowance.
  See Also:
  
  Constant Field Values
- DEFAULT_MAX_OVERSUBSCRIPTION
  
  public static final double DEFAULT_MAX_OVERSUBSCRIPTION
  See Also:
  
  Constant Field Values
- FEEDBACK_ADAPTIVE_UTILIZATION_ERROR_BUDGET
  
  public static final String FEEDBACK_ADAPTIVE_UTILIZATION_ERROR_BUDGET
  
  Acceptable deviation around full utilization (1.0) for adjusting oversubscription. If stable throttle usage is typically under (1.0-errorBudget), then we will allow more oversubscription. If stable throttle usage is typically over (1.0+errorBudget), then we will pull back oversubscription.
  See Also:
  
  Constant Field Values
- DEFAULT_UTILIZATION_ERROR_BUDGET
  
  public static final double DEFAULT_UTILIZATION_ERROR_BUDGET
  See Also:
  
  Constant Field Values
- WINDOW_TIME_MS
  
  private static final int WINDOW_TIME_MS
  See Also:
  
  Constant Field Values
- nextRefillTime
  
  private volatile long nextRefillTime
- refillInterval
  
  private final long refillInterval
- backoffMultiplierIncrement
  
  private final double backoffMultiplierIncrement
- backoffMultiplierDecrement
  
  private final double backoffMultiplierDecrement
- maxBackoffMultiplier
  
  private final double maxBackoffMultiplier
- oversubscriptionIncrement
  
  private final double oversubscriptionIncrement
- oversubscriptionDecrement
  
  private final double oversubscriptionDecrement
- maxOversubscription
  
  private final double maxOversubscription
- minTargetUtilization
  
  private final double minTargetUtilization
- maxTargetUtilization
  
  private final double maxTargetUtilization
- currentBackoffMultiplier
  
  private final org.apache.hbase.thirdparty.com.google.common.util.concurrent.AtomicDouble currentBackoffMultiplier
- hadContentionThisInterval
  
  private volatile boolean hadContentionThisInterval
- oversubscriptionProportion
  
  private final org.apache.hbase.thirdparty.com.google.common.util.concurrent.AtomicDouble oversubscriptionProportion
- emaAlpha
  
  private final double emaAlpha
- utilizationEma
  
  private volatile double utilizationEma
- lastIntervalConsumed
  
  private final AtomicLong lastIntervalConsumed
Constructor Details
- FeedbackAdaptiveRateLimiter
  
  FeedbackAdaptiveRateLimiter(long refillInterval, double backoffMultiplierIncrement, double backoffMultiplierDecrement, double maxBackoffMultiplier, double oversubscriptionIncrement, double oversubscriptionDecrement, double maxOversubscription, double utilizationErrorBudget)
Method Details
- refill
  
  public long refill(long limit)
  
  Description copied from class: RateLimiter
  
  Refill the available units w.r.t the elapsed time.
  
  Specified by:
  
  refill in class RateLimiter
  
  Parameters:
  
  limit - Maximum available resource units that can be refilled to.
  
  Returns:
  
  how many resource units may be refilled ?
- getOversubscribedLimit
  
  private long getOversubscribedLimit(long limit)
- consume
  
  public void consume(long amount)
  
  Description copied from class: RateLimiter
  
  consume amount available units, amount could be a negative number
  
  Overrides:
  
  consume in class RateLimiter
  
  Parameters:
  
  amount - the number of units to consume
- getWaitInterval
  
  public long getWaitInterval(long limit, long available, long amount)
  
  Description copied from class: RateLimiter
  
  Time in milliseconds to wait for before requesting to consume 'amount' resource.
  
  Specified by:
  
  getWaitInterval in class RateLimiter
  
  Parameters:
  
  limit - Maximum available resource units that can be refilled to.
  
  available - Currently available resource units
  
  amount - Resources for which time interval to calculate for
  
  Returns:
  
  estimate of the ms required to wait before being able to provide 'amount' resources.
- getRefillIntervalAdjustedLimit
  
  private long getRefillIntervalAdjustedLimit(long limit)
- applyBackoffMultiplier
  
  private long applyBackoffMultiplier(long baseWaitInterval)
- setNextRefillTime
  
  public void setNextRefillTime(long nextRefillTime)
  
  Specified by:
  
  setNextRefillTime in class RateLimiter
- getNextRefillTime
  
  public long getNextRefillTime()
  
  Specified by:
  
  getNextRefillTime in class RateLimiter

Class FeedbackAdaptiveRateLimiter

Nested Class Summary

Field Summary

Fields inherited from class org.apache.hadoop.hbase.quotas.RateLimiter

Constructor Summary

Method Summary

Methods inherited from class org.apache.hadoop.hbase.quotas.RateLimiter

Methods inherited from class java.lang.Object

Field Details

FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_INCREMENT

DEFAULT_BACKOFF_MULTIPLIER_INCREMENT

FEEDBACK_ADAPTIVE_BACKOFF_MULTIPLIER_DECREMENT

DEFAULT_BACKOFF_MULTIPLIER_DECREMENT

FEEDBACK_ADAPTIVE_MAX_BACKOFF_MULTIPLIER

DEFAULT_MAX_BACKOFF_MULTIPLIER

FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_INCREMENT

DEFAULT_OVERSUBSCRIPTION_INCREMENT

FEEDBACK_ADAPTIVE_OVERSUBSCRIPTION_DECREMENT

DEFAULT_OVERSUBSCRIPTION_DECREMENT

FEEDBACK_ADAPTIVE_MAX_OVERSUBSCRIPTION

DEFAULT_MAX_OVERSUBSCRIPTION

FEEDBACK_ADAPTIVE_UTILIZATION_ERROR_BUDGET

DEFAULT_UTILIZATION_ERROR_BUDGET

WINDOW_TIME_MS

nextRefillTime

refillInterval

backoffMultiplierIncrement

backoffMultiplierDecrement

maxBackoffMultiplier

oversubscriptionIncrement

oversubscriptionDecrement

maxOversubscription

minTargetUtilization

maxTargetUtilization

currentBackoffMultiplier

hadContentionThisInterval

oversubscriptionProportion

emaAlpha

utilizationEma

lastIntervalConsumed

Constructor Details

FeedbackAdaptiveRateLimiter

Method Details

refill

getOversubscribedLimit

consume

getWaitInterval

getRefillIntervalAdjustedLimit

applyBackoffMultiplier

setNextRefillTime

getNextRefillTime