nip.utils.types.DpoDatasetItem#
- class nip.utils.types.DpoDatasetItem[source]#
A single item in a DPO dataset.
This is used for training language models with Direct Preference Optimization (DPO).
Attributes
inputThe input chat history for the DPO dataset item.
preferred_outputThe preferred part of the preference pair.
non_preferred_outputThe non-preferred part of the preference pair.
Methods