// GLOSSARY · METHODOLOGY

Inter-Rater Reliability.

The statistical measure of agreement between independent evaluators scoring the same performance against the same rubric.

All glossary terms Methodology

Inter-Rater Reliability — The statistical measure of agreement between independent evaluators scoring the same performance against the same rubric.

Inter-rater reliability (IRR) is the formal measure of whether a scoring system is consistent across people. High IRR means the rubric and the calibration practice are producing the same score for the same performance, regardless of who is observing.

OCTAAR treats IRR as an operational metric, not a research artifact. It is monitored across the cycle, surfaced as a leading indicator, and used to trigger evaluator recalibration before drift becomes a finding.

// ALSO KNOWN AS

IRR, Inter-observer reliability, Inter-rater agreement

// DOMAIN

Cross-domain

// RELATED TERMS

// LAST UPDATED

May 19, 2026 · OCTAAR Methodology Team

// SEE IT IN OPERATION

How inter-rater reliability lives inside the OCTAAR cycle.

Request Operational Readiness Demo Read the methodology

// OR · CONTACT THE TEAM