October 11, 2024

Langfuse LLM-as-a-judge now supports any (tool-calling) LLM

Tool calling makes Langfuse Evals reliable. Previously, only OpenAI models were supported. With this update, you can use any tool-calling LLM when setting up an LLM-as-a-judge evaluator.

Prior to creating an evaluator, you can now select any model that supports tool calls for which you have an LLM API key in Langfuse. On evaluator creation, Langfuse will test the model with a sample run to ensure it works as expected.

Learn more

Langfuse Evaluations

Was this page useful?

Questions? We're here to help

GitHub Q&AEmail Talk to sales

Langfuse LLM-as-a-judge now supports any (tool-calling) LLM

Was this page useful?

Questions? We're here to help

Subscribe to updates