Every Gaussia metric supports two statistical computation modes. You pass a StatisticalMode instance when running a metric to control how scores are aggregated.
Returns a single value — the weighted mean of all interaction scores.
from gaussia.statistical import FrequentistModefrom gaussia.metrics.context import Contextresults = Context.run( MyRetriever, model=model, statistical_mode=FrequentistMode(),)for r in results: print(f"Context awareness: {r.context_awareness:.3f}") # context_awareness_ci_low and context_awareness_ci_high are None
Returns a mean with a credible interval, computed via bootstrap resampling.
from gaussia.statistical import BayesianModeresults = Context.run( MyRetriever, model=model, statistical_mode=BayesianMode( mc_samples=5000, # Number of Monte Carlo samples ci_level=0.95, # 95% credible interval ),)for r in results: print(f"Context awareness: {r.context_awareness:.3f}") print(f"95% CI: [{r.context_awareness_ci_low:.3f}, {r.context_awareness_ci_high:.3f}]")