Skip to main content

AI Model Research

Original research from the meo-benchmark project: the Effective Value metric and its depth-driven rank inversion, an epistemic-integrity (sycophancy) study, a negative result on cheap-ensemble fusion, and bias-controlled multi-LLM-as-judge methodology.

Based on the meo-benchmark preprint (Zenodo 10.5281/zenodo.20586608). See the methodology for how scores are produced.