duo_ai.utils.evaluation¶
Functions¶
|
Retrieve or compute evaluation statistics for a test environment and expert agent. |
Module Contents¶
- duo_ai.utils.evaluation.get_test_eval_info(env_suite, env_name, test_env, evaluator)[source]¶
Retrieve or compute evaluation statistics for a test environment and expert agent.
If statistics for the given environment are missing, this function evaluates the expert agent on the test environment, saves the results, and returns the summary.
- Parameters:
env_suite (str) – Name of the environment suite (e.g., ‘procgen’).
env_name (str) – Name of the specific environment (e.g., ‘coinrun’).
test_env (object) – Test environment instance, expected to have ‘expert’, ‘base_env’, and ‘num_envs’ attributes.
evaluator (object) – Evaluator instance with an ‘eval’ method for running evaluation.
- Returns:
info – Dictionary of evaluation statistics for the test environment and expert agent.
- Return type:
dict
Examples
>>> info = get_test_eval_info('procgen', 'coinrun', test_env, evaluator) >>> print(info['reward_mean'])