nip.utils.types.DpoDatasetItem#
- class nip.utils.types.DpoDatasetItem[source]#
A single item in a DPO dataset.
This is used for training language models with Direct Preference Optimization (DPO).
Attributes
input
The input chat history for the DPO dataset item.
preferred_output
The preferred part of the preference pair.
non_preferred_output
The non-preferred part of the preference pair.
Methods