Skip to content

Conversation

@MischaPanch
Copy link
Collaborator

@MischaPanch MischaPanch commented Oct 16, 2025

Refactoring of the scripts, reducing parametrization of HL scripts to the minimum, restored same default config as in v0.5.0 (except for mujoco task versions, which have been bumped from 3 to 4).
Various improvements in the rliable eval code are done.
Also, this PR adds the possibility to run and evaluate multiple experiments directly from an ExperimentBuilder. This possibility is used to establish a benchmarking script that will run multiple scripts in parallel in tmux sessions, evaluate them with rliable and aggregate the stats such that they can be displayed in the benchmarking section in the logs.

This concludes most of the preparations for establishing easily reproducible benchmarking runs.

@MischaPanch MischaPanch requested a review from opcode81 October 16, 2025 08:59
Conflicts:
	poetry.lock
	pyproject.toml
	tianshou/highlevel/experiment.py
@opcode81 opcode81 changed the base branch from master to dev-v2 October 24, 2025 14:24
@MischaPanch MischaPanch changed the base branch from dev-v2 to master October 25, 2025 13:39
@MischaPanch MischaPanch changed the title Benchmarking - part 1 Benchmarking Oct 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants