AutoscalingSettings

class baseten.client.managementapi.AutoscalingSettings(*, min_replica, max_replica, autoscaling_window, scale_down_delay, concurrency_target, target_utilization_percentage, target_in_flight_tokens=None, max_scale_down_rate=None)

Bases: BaseModel

Parameters:
  • min_replica (int)

  • max_replica (int)

  • autoscaling_window (int | None)

  • scale_down_delay (int | None)

  • concurrency_target (int)

  • target_utilization_percentage (int | None)

  • target_in_flight_tokens (int | None)

  • max_scale_down_rate (float | None)

model_config = {}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].