nip.language_model_server.types.VllmStartRequest#
- class nip.language_model_server.types.VllmStartRequest(*, model_name: str, quantization: Literal['bitsandbytes', 'none'] = 'none')[source]#
A request to start the vLLM server with a specific model.
Attributes
__fields_set__model_computed_fieldsmodel_configConfiguration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
model_extraGet extra fields set during validation.
model_fieldsmodel_fields_setReturns the set of fields that have been explicitly set on this model instance.
model_nameThe name of the model to be served by the vLLM server.
quantizationThe quantization method to use for the model.
Methods