fix: Issue where CodeAct agent was trying to log cost on local llm and throwing Undefined Model execption out of litellm #1666

bartshappee · 2024-05-09T13:32:15Z

Fix this error introduced in #1612 when using local model.

@rbren pls review this pr, thanks.

…d throwing Undefined Model execption out of litellm

rbren

Thanks for the fix!

Instead of this implementation, what if we changed the completion_cost function in llm.py to return 0.00 when working with local models?

bartshappee · 2024-05-09T14:19:34Z

Thanks for the fix!

Instead of this implementation, what if we changed the completion_cost function in llm.py to return 0.00 when working with local models?

Is there a better way than checking that 'localhost' not in self.base_url: to know if the model is local?

rbren · 2024-05-09T17:34:06Z

Ah I didn't realize we were just exposing the completion_cost function directly from LiteLLM.

What I'd suggest is adding a new function to the LLM class in llm.py:

def completion_cost(self, prompt):
  try:
    return litellm.completion_cost(prompt)
  except Exception:
    return 0

bartshappee · 2024-05-09T17:38:12Z

I added the localhost check so that there was at least a warning in the log, if not a local model, so that we aren't logging $0 even if there is a remote cost but an unsupported model in litellm's costing module

codecov-commenter · 2024-05-09T21:08:35Z

Codecov Report

Attention: Patch coverage is 75.00000% with 6 lines in your changes are missing coverage. Please review.

❗ No coverage uploaded for pull request base (main@0cf94a2). Click here to learn what that means.

Files	Patch %	Lines
opendevin/llm/llm.py	66.66%	6 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1666   +/-   ##
=======================================
  Coverage        ?   62.82%           
=======================================
  Files           ?       96           
  Lines           ?     3857           
  Branches        ?        0           
=======================================
  Hits            ?     2423           
  Misses          ?     1434           
  Partials        ?        0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

rbren

Thanks for the fix!

rbren

Looks like there's a slight issue when base_url isn't set

SmartManoj · 2024-05-10T11:48:46Z

opendevin/llm/llm.py

+                cost = litellm_completion_cost(completion_response=response)
+                return cost
+            except Exception:
+                logger.warning('Cost calculation not supported for this model.')


We should simply ignore the exception if if 'Model not in model_prices_and_context_window. json' in str(e) as we are running local models to not worry about the cost itself else there would be huge warnings.

I've reached out to the LiteLLM team on how to handle this correctly. The whole point of this code is to not log a warning when running local models, only when running remote unknown (from a pricing perspective) models

Got it. then we should use a config or environment variable like MISC for the time being to raise this exception, else the user may skip the warning and to avoid huge warnings too.

The code change that broke local models is already going to introduce 1 line of log per step, this just puts a warning before that line, not drastically impacting logging

agenthub/codeact_agent/codeact_agent.py

opendevin/llm/llm.py

rbren · 2024-05-10T17:57:42Z

Thanks for this!

…d throwing Undefined Model execption out of litellm (#1666) * fix: Issue where CodeAct agent was trying to log cost on local llm and throwing Undefined Model execption out of litellm * Review Feedback * Missing None Check * Review feedback and improved error handling --------- Co-authored-by: Robert Brennan <accounts@rbren.io>

simplesisu · 2024-05-11T21:59:34Z

Error is still there unfortunately, atleast for me:

21:55:24 - opendevin:ERROR: agent_controller.py:150 - Error in loop
Traceback (most recent call last):
  File "/app/opendevin/controller/agent_controller.py", line 145, in _run
    finished = await self.step(i)
               ^^^^^^^^^^^^^^^^^^
  File "/app/opendevin/controller/agent_controller.py", line 261, in step
    action = self.agent.step(self.state)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/app/agenthub/codeact_agent/codeact_agent.py", line 231, in step
    cur_cost = completion_cost(completion_response=response)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/app/.venv/lib/python3.12/site-packages/litellm/utils.py", line 4453, in completion_cost
    raise e
  File "/app/.venv/lib/python3.12/site-packages/litellm/utils.py", line 4439, in completion_cost
    ) = cost_per_token(
        ^^^^^^^^^^^^^^^
  File "/app/.venv/lib/python3.12/site-packages/litellm/utils.py", line 4254, in cost_per_token
    raise litellm.exceptions.NotFoundError(  # type: ignore
litellm.exceptions.NotFoundError: Model not in model_prices_and_context_window.json. You passed model=ollama/llama3:8b-instruct-q8_0. Register pricing for model - https://docs.litellm.ai/docs/proxy/custom_pricing

21:55:24 - opendevin:INFO: agent_controller.py:193 - Setting agent state from AgentState.RUNNING to AgentState.ERROR

I am running latest commit

SmartManoj · 2024-05-12T10:04:55Z

@simplesisu Which commit?

File "/app/agenthub/codeact_agent/codeact_agent.py", line 231, in step

This line is changed in this PR.

* add draft dockerfile for build all * add rsync for build * add all-in-one docker * update prepare scripts * Update swe_env_box.py * Add swe_entry.sh (buggy now) * Parse the test command in swe_entry.sh * Update README for instance eval in sandbox * revert specialized config * replace run_as_devin as an init arg * set container & run_as_root via args * update swe entry script * update env * remove mounting * allow error after swe_entry * update swe_env_box * move file * update gitignore * get swe_env_box a working demo * support faking user response & provide sandox ahead of time; also return state for controller * tweak main to support adding controller kwargs * add module * initialize plugin for provided sandbox * add pip cache to plugin & fix jupyter kernel waiting * better print Observation output * add run infer scripts * update readme * add utility for getting diff patch * use get_diff_patch in infer * update readme * support cost tracking for codeact * add swe agent edit hack * disable color in git diff * fix git diff cmd * fix state return * support limit eval * increase t imeout and export pip cache * add eval limit config * return state when hit turn limit * save log to file; allow agent to give up * run eval with max 50 turns * add outputs to gitignore * save swe_instance & instruction * add uuid to swebench * add streamlit dep * fix save series * fix the issue where session id might be duplicated * allow setting temperature for llm (use 0 for eval) * Get report from agent running log * support evaluating task success right after inference. * remove extra log * comment out prompt for baseline * add visualizer for eval * use plaintext for instruction * reduce timeout for all; only increase timeout for init * reduce timeout for all; only increase timeout for init * ignore sid for swe env * close sandbox in each eval loop * update visualizer instruction * increase max chars * add finish action to history too * show test result in metrics * add sidebars for visualizer * also visualize swe_instance * cleanup browser when agent controller finish runinng * do not mount workspace for swe-eval to avoid accidentally overwrite files * Revert "do not mount workspace for swe-eval to avoid accidentally overwrite files" This reverts commit 8ef7739. * Revert "Revert "do not mount workspace for swe-eval to avoid accidentally overwrite files"" This reverts commit 016cfbb. * run jupyter command via copy to, instead of cp to mount * only print mixin output when failed * change ssh box logging * add visualizer for pass rate * add instance id to sandbox name * only remove container we created * use opendevin logger in main * support multi-processing infer * add back metadata, support keyboard interrupt * remove container with startswith * make pbar behave correctly * update instruction w/ multi-processing * show resolved rate by repo * rename tmp dir name * attempt to fix racing for copy to ssh_box * fix script * bump swe-bench-all version * fix ipython with self-contained commands * add jupyter demo to swe_env_box * make resolved count two column * increase height * do not add glob to url params * analyze obs length * print instance id prior to removal handler * add gold patch in visualizer * fix interactive git by adding a git --no-pager as alias * increase max_char to 10k to cover 98% of swe-bench obs cases * allow parsing note * prompt v2 * add iteration reminder * adjust user response * adjust order * fix return eval * fix typo * add reminder before logging * remove other resolve rate * re adjust to new folder structure * support adding eval note * fix eval note path * make sure first log of each instance is printed * add eval note * fix the display for visualizer * tweak visualizer for better git patch reading * exclude empty patch * add retry mechanism for swe_env_box start * fix ssh timeout issue * add stat field for apply test patch success * add visualization for fine-grained report * attempt to support monologue agent by constraining it to single thread * also log error msg when stopeed * save error as well * override WORKSPACE_MOUNT_PATH and WORKSPACE_BASE for monologue to work in mp * add retry mechanism for sshbox * remove retry for swe env box * try to handle loop state stopped * Add get report scripts * Add script to convert agent output to swe-bench format * Merge fine grained report for visualizer * Update eval readme * Update README.md * Add CodeAct gpt4-1106 output and eval logs on swe-bench-lite * Update the script to get model report * Update get_model_report.sh * Update get_agent_report.sh * Update report merge script * Add agent output conversion script * Update swe_lite_env_setup.sh * Add example swe-bench output files * Update eval readme * Remove redundant scripts * set iteration count down to false by default * fix: Issue where CodeAct agent was trying to log cost on local llm and throwing Undefined Model execption out of litellm (#1666) * fix: Issue where CodeAct agent was trying to log cost on local llm and throwing Undefined Model execption out of litellm * Review Feedback * Missing None Check * Review feedback and improved error handling --------- Co-authored-by: Robert Brennan <accounts@rbren.io> * fix prepare_swe_util scripts * update builder images * update setup script * remove swe-bench build workflow * update lock * remove experiments since they are moved to hf * remove visualizer (since it is moved to hf repo) * simply jupyter execution via heredoc * update ssh_box * add initial docker readme * add pkg-config as dependency * add script for swe_bench all-in-one docker * add rsync to builder * rename var * update commit * update readme * update lock * support specify timeout for long running tasks * fix path * separate building of all deps and files * support returning states at the end of controller * remove return None * support specify timeout for long running tasks * add timeout for all existing sandbox impl * fix swe_env_box for new codebase * update llm config in config.py * support pass sandbox in * remove force set * update eval script * fix issue of overriding final state * change default eval output to hf demo * change default eval output to hf demo * fix config * only close it when it is NOT external sandbox * add scripts * tweak config * only put in hostory when state has history attr * fix agent controller on the case of run out interaction budget * always assume state is always not none * remove print of final state * catch all exception when cannot compute completion cost * Update README.md * save source into json * fix path * update docker path * return the final state on close * merge AgentState with State * fix integration test * merge AgentState with State * fix integration test * add ChangeAgentStateAction to history in attempt to fix integration * add back set agent state * update tests * update tests * move scripts for setup * update script and readme for infer * do not reset logger when n processes == 1 * update eval_infer scripts and readme * simplify readme * copy over dir after eval * copy over dir after eval * directly return get state * update lock * fix output saving of infer * replace print with logger * update eval_infer script * add back the missing .close * increase timeout * copy all swe_bench_format file * attempt to fix output parsing * log git commit id as metadata * fix eval script * update lock * update unit tests * fix argparser unit test * fix lock * the deps are now lightweight enough to be incude in make build * add spaces for tests * add eval outputs to gitignore * remove git submodule * readme * tweak git email * update upload instruction * bump codeact version for eval --------- Co-authored-by: Bowen Li <libowen.ne@gmail.com> Co-authored-by: huybery <huybery@gmail.com> Co-authored-by: Bart Shappee <bshappee@gmail.com> Co-authored-by: Robert Brennan <accounts@rbren.io>

bartshappee added 2 commits May 9, 2024 09:28

fix: Issue where CodeAct agent was trying to log cost on local llm an…

6bf6934

…d throwing Undefined Model execption out of litellm

Merge branch 'main' into fix/codeact_agent_logging_cost_exception

3b423a8

rbren reviewed May 9, 2024

View reviewed changes

Review Feedback

2f11a65

enyst mentioned this pull request May 9, 2024

[Bug]: Issue when liteLLM doesnt have pricing for ollama model #1667

Closed

2 tasks

rbren approved these changes May 10, 2024

View reviewed changes

rbren requested changes May 10, 2024

View reviewed changes

Missing None Check

a43663f

SmartManoj suggested changes May 10, 2024

View reviewed changes

agenthub/codeact_agent/codeact_agent.py Outdated Show resolved Hide resolved

bartshappee and others added 2 commits May 10, 2024 08:58

Review feedback and improved error handling

8c0cbb0

Merge branch 'main' into fix/codeact_agent_logging_cost_exception

58e6798

enyst reviewed May 10, 2024

View reviewed changes

opendevin/llm/llm.py Show resolved Hide resolved

bartshappee requested a review from rbren May 10, 2024 15:15

rbren approved these changes May 10, 2024

View reviewed changes

rbren merged commit 78cd2e5 into All-Hands-AI:main May 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Issue where CodeAct agent was trying to log cost on local llm and throwing Undefined Model execption out of litellm #1666

fix: Issue where CodeAct agent was trying to log cost on local llm and throwing Undefined Model execption out of litellm #1666

bartshappee commented May 9, 2024

rbren left a comment

bartshappee commented May 9, 2024

rbren commented May 9, 2024

bartshappee commented May 9, 2024

codecov-commenter commented May 9, 2024 •

edited

Loading

rbren left a comment

rbren left a comment

SmartManoj May 10, 2024

bartshappee May 10, 2024

SmartManoj May 10, 2024 •

edited

Loading

bartshappee May 10, 2024

rbren commented May 10, 2024

simplesisu commented May 11, 2024 •

edited

Loading

SmartManoj commented May 12, 2024

fix: Issue where CodeAct agent was trying to log cost on local llm and throwing Undefined Model execption out of litellm #1666

fix: Issue where CodeAct agent was trying to log cost on local llm and throwing Undefined Model execption out of litellm #1666

Conversation

bartshappee commented May 9, 2024

rbren left a comment

Choose a reason for hiding this comment

bartshappee commented May 9, 2024

rbren commented May 9, 2024

bartshappee commented May 9, 2024

codecov-commenter commented May 9, 2024 • edited Loading

Codecov Report

rbren left a comment

Choose a reason for hiding this comment

rbren left a comment

Choose a reason for hiding this comment

SmartManoj May 10, 2024

Choose a reason for hiding this comment

bartshappee May 10, 2024

Choose a reason for hiding this comment

SmartManoj May 10, 2024 • edited Loading

Choose a reason for hiding this comment

bartshappee May 10, 2024

Choose a reason for hiding this comment

rbren commented May 10, 2024

simplesisu commented May 11, 2024 • edited Loading

SmartManoj commented May 12, 2024

codecov-commenter commented May 9, 2024 •

edited

Loading

SmartManoj May 10, 2024 •

edited

Loading

simplesisu commented May 11, 2024 •

edited

Loading