Set up evaluation of AI agents with tool call validation, correctness checks, and task completion using Dokimos
Scaffold a new Evaluator implementation following Dokimos conventions
Create evaluation datasets in JSON, CSV, or JSONL format for Dokimos
Scaffold eval-driven tests using dokimos-junit with @DatasetSource and Assertions.assertEval()
Scaffold a Dokimos Experiment with dataset, task, evaluators, and reporter
Set up evaluation of LangChain4j applications and RAG pipelines using Dokimos