nip.language_model_server.types.LmTrainingConfig

Contents

nip.language_model_server.types.LmTrainingConfig#

class nip.language_model_server.types.LmTrainingConfig(*, model_name: str, method: ~typing.Literal['dpo'], dpo_config: ~nip.language_model_server.types.LmDpoTrainingConfig = <factory>, training_lora_config: ~nip.language_model_server.types.LmLoraAdapterConfig | None = None, model_already_lora_strategy: ~typing.Literal['reuse', 'stack'] = 'reuse')[source]#

Configuration for training a language model with the language model server.

Attributes

__fields_set__

model_computed_fields

model_config

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

model_extra

Get extra fields set during validation.

model_fields

model_fields_set

Returns the set of fields that have been explicitly set on this model instance.

model_name

The name of the model to be trained, typically a Hugging Face identifier.

method

The method to be used for training.

dpo_config

Configuration specific to DPO training.

training_lora_config

Configuration for the LoRA adapter to use when training.

model_already_lora_strategy

Strategy for handling models that are already LoRA-adapted.

Methods