nip.parameters.trainers#
Parameters for the various ML trainers.
Classes
|
Common parameters for PPO trainers. |
|
Additional parameters for the Expert Iteration (EI) trainer. |
|
Additional parameters for Multi-Agent LLM Training (MALT) [MSD+24]. |
|
Additional parameters for the REINFORCE trainer. |
|
Additional parameters common to all RL trainers. |
|
Additional parameters for running agents in isolation. |
|
Additional parameters for SPG [FCR20] and its variants. |
|
Additional parameters for the text-based RL trainers. |
Additional parameters for the vanilla PPO trainer. |