ModelTRTQuantizationConfiguration

class baseten.client.modelconfig.ModelTRTQuantizationConfiguration(*, calib_size=1024, calib_dataset='abisee/cnn_dailymail', calib_max_seq_length=1536, **extra_data)

Bases: BaseModel

Parameters:
  • calib_size (int | None)

  • calib_dataset (str | None)

  • calib_max_seq_length (int | None)

  • extra_data (Any)

model_config = {'extra': 'allow'}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].