Benchmark

Precision/recall/faithfulness benchmark runner — gates CI on configurable thresholds; supports custom query files and optional faithfulness scoring via --with-faithfulness

comparisonconcepterrorfaqnotequickstartreferencetasktiptroubleshootingwarning
comparisonconcepterrorfaqnotequickstartreferencetasktiptroubleshootingwarning