DeepEval
Evaluation & TestingFreemiumVerifiedOpen Source
Open-source LLM evaluation framework with 50+ research-backed metrics for testing AI applications. Differentiates with native Pytest integration, multi-modal support, and advanced techniques like G-Eval, DAG, and QAG scoring. Best for teams building CI/CD pipelines to regression-test LLM-powered applications.
Price
From $0/ per month
License: Apache-2.0