Skip to content
#

judge

Here are 141 public repositories matching this topic...

Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.

  • Updated Jul 10, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the judge topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the judge topic, visit your repo's landing page and select "manage topics."

Learn more