UpdateAutoscalingSettings

class baseten.client.managementapi.UpdateAutoscalingSettings(*, min_replica=None, max_replica=None, autoscaling_window=None, scale_down_delay=None, concurrency_target=None, target_utilization_percentage=None, target_in_flight_tokens=None, max_scale_down_rate=None)

Bases: BaseModel

Parameters:
  • min_replica (int | None)

  • max_replica (int | None)

  • autoscaling_window (int | None)

  • scale_down_delay (int | None)

  • concurrency_target (int | None)

  • target_utilization_percentage (int | None)

  • target_in_flight_tokens (int | None)

  • max_scale_down_rate (MaxScaleDownRate | None)

model_config = {'extra': 'forbid'}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].