ModelTRTLLMPluginConfiguration¶
- class baseten.client.modelconfig.ModelTRTLLMPluginConfiguration(*, paged_kv_cache=True, use_paged_context_fmha=True, use_fp8_context_fmha=False, **extra_data)¶
Bases:
BaseModel- Parameters:
- model_config = {'extra': 'allow'}¶
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].