ppo_ic.py#

PPO image classification experiment using a grid of hyperparameters.

This script runs through a grid of hyperparameters, specified in the param_grid dict. If the MULTIPROCESS variable is set to True, the experiments are run using a pool of workers (specified by the --num-workers command line argument). Otherwise, the experiments are run sequentially.

Additional settings, like whether to log to W&B can be set via command line arguments. Run the script with the --help flag to see all available arguments.

scripts/ppo_ic.py#

Run hyperparameter experiments in parallel

usage: scripts/ppo_ic.py [-h] [-d] [-q] [--run-infix RUN_INFIX] [--use-wandb]
                         [--wandb-project WANDB_PROJECT]
                         [--wandb-entity WANDB_ENTITY] [--tag TAG]
                         [--gpu-num GPU_NUM] [--ignore-cache] [--no-pretrain]
                         [--num-workers NUM_WORKERS]
                         [--max-tasks-per-child MAX_TASKS_PER_CHILD]
                         [--num-skip NUM_SKIP] [--dataset-on-device]
                         [--enable-efficient-attention]

-h, --help#: show this help message and exit

-d, --debug#: Print debug messages

-q, --quiet#: Print less output (set log level to WARNING)

--run-infix <run_infix>#: The string to add in the middle of the run ID

--use-wandb#: Whether to use W&B to log the experiment

--wandb-project <wandb_project>#: The name of the W&B project to use

--wandb-entity <wandb_entity>#: The name of the W&B entity to use

--tag <tag>#: An optional tag for the W&B run

--gpu-num <gpu_num>#: The (0-based) number of the GPU to use

--ignore-cache#: Ignore the dataset and model cache and rebuild from scratch.

--no-pretrain#: Don’t pretrain the agents, regardless of the hyperparameters

--num-workers <num_workers>#: The number of workers to use for multiprocessing

--max-tasks-per-child <max_tasks_per_child>#: The maximum number of tasks each worker can run before being replaced

--num-skip <num_skip>#: The number of initial tasks to skip. Useful to resume an experiment

--dataset-on-device#: Store the whole dataset on the device (needs more GPU memory).

--enable-efficient-attention#: Enable efficient attention scaled dot product backend (may be buggy).

ppo_ic.py

Contents

ppo_ic.py#

scripts/ppo_ic.py#