DeepEval

Evaluation & TestingFreemiumVerifiedOpen Source

Open-source LLM evaluation framework with 50+ research-backed metrics for testing AI applications. Differentiates with native Pytest integration, multi-modal support, and advanced techniques like G-Eval, DAG, and QAG scoring. Best for teams building CI/CD pipelines to regression-test LLM-powered applications.

Visit Website GitHub Pricing Used in 23 stacks →

Price

From $0/ per month

License: Apache-2.0