(fix) Aider bench: logname fix; improve test calling instruction #3666

tobitege · 2024-08-30T12:53:53Z

Short description of the problem this fixes or functionality that this introduces. This may be used for the CHANGELOG

Fixes potential logfile name if model has : character in it. Improve instruction on how to call the (optional) test case to prevent timeouts.

Give a summary of what the PR does, explaining any non-trivial design decisions

Any colon : in a model name, like nousresearch/hermes-3-llama-3.1-405b:extended prevents logging to file as there won't be a folder created.
If the use of the unit test is enabled (env var option), the LLM's have all different ideas on how to execute that test, which often resulted in timeouts. The instruction to the LLM has been improved to use the same format as the bench would use itself,
e.g. python -m unittest {instance.instance_name}_test.py

xingyaoww

LGTM!

logname fix; improve test calling instruction

d98a6e9

tobitege added enhancement New feature or request evaluation Related to running evaluations with OpenHands labels Aug 30, 2024

tobitege requested a review from xingyaoww August 30, 2024 12:53

Merge branch 'main' into tobitege/aider-bench-logfix

34faace

xingyaoww approved these changes Aug 30, 2024

View reviewed changes

tobitege merged commit dbb671a into main Aug 30, 2024

tobitege deleted the tobitege/aider-bench-logfix branch August 30, 2024 15:15

RajWorking pushed a commit to RajWorking/OpenHands that referenced this pull request Sep 2, 2024

logname fix; improve test calling instruction (All-Hands-AI#3666)

ac8db0c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(fix) Aider bench: logname fix; improve test calling instruction #3666

(fix) Aider bench: logname fix; improve test calling instruction #3666

tobitege commented Aug 30, 2024

xingyaoww left a comment

(fix) Aider bench: logname fix; improve test calling instruction #3666

(fix) Aider bench: logname fix; improve test calling instruction #3666

Conversation

tobitege commented Aug 30, 2024

xingyaoww left a comment

Choose a reason for hiding this comment