ai-statistics
Statistically sane analysis methods for comparing AI model and prompt performance.
Statistical analysis methods for comparing prompt and model performance in LLM evaluations.