So far in the Lightning refactor we have removed the manual training and validation code, but the evaluation on the test set in `eval.py` still remains.