AutoscalingSettings¶
- class baseten.client.managementapi.AutoscalingSettings(*, min_replica, max_replica, autoscaling_window, scale_down_delay, concurrency_target, target_utilization_percentage, target_in_flight_tokens=None, max_scale_down_rate=None)¶
Bases:
BaseModel- Parameters:
- model_config = {}¶
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].