v6.1 PRO FINAL — Scientific Foundation

Evidence-Based Architecture

CouncilIA operationalizes research from multi-agent systems and decision science into a structured deliberation platform. Our approach is scientifically grounded, not scientifically proven.

hub
Pillar 01

Multi-Agent Deliberation

Shaikh et al. (PLOS 2025)

Multi-instance LLM deliberation achieves 97% accuracy in high-stakes clinical exams. CouncilIA extends this by replacing identical instances with specialized personas for cognitive diversity.

balance
Pillar 02

Adversarial Reasoning

Ellemers et al. (PNAS 2020)

Inspired by research on 'Adversarial Alignment' where structured conflict improves theory-building. CouncilIA operationalizes this through intentional role tension and forced challenges.

groups
Pillar 03

Independent Perspectives

Distributed Multi-Agent Systems

Based on game-theoretic principles where independent agents optimize domain-specific criteria. This improves decision robustness by surfacing risks that single-model outputs miss.

Technical Spec v6.1
refresh
Pillar 04

Iterative Deliberation

3-Round Optimization

Empirical data suggests diminishing returns beyond a limited round count. CouncilIA adopts a strict Thesis → Antithesis → Synthesis protocol for maximum efficiency.

Technical Spec v6.1
shield_person
Pillar 05

Human-AI Governance

Amershi et al. (CHI 2019)

Implementation of Microsoft Research G11 (Explainability) and G17 (Human Control) guidelines. AI structures decisions, but humans remain final and accountable.

Audit-Ready Protocol

Ready to transform opinions into auditable decision documents?

Start Session

Technical Implementation

Deliberation Trace System

CouncilIA produces a structured decision trace: argument isolation, evidence mapping (RAG), and refinement logs. This enables complete auditability by regulatory bodies.

VaR-Inspired Risk Modeling

Decision uncertainty is quantified through agent disagreement and evidence gaps. High Dissent = High Uncertainty = Mandatory Human Review.

Decision Metrics

Dissent Range
0–100

Variance between expert outputs

Consensus Strength
%

Arguments surviving all rounds

Evidence Density
Cites

Citations per critical claim

Audit Score
0–100

Adherence to regulatory RAG

The 3-Round Protocol

RoundPurposeOutput
R1: ThesisIndependent expert evaluationDomain-specific analysis + Evidence
R2: AntithesisStructured adversarial critiqueUnrefuted risks + Direct challenges
R3: SynthesisEvidence-based refinementDecision document + Action plan

System Limitations

  • Dependent on input quality (Garbage In → Structured Garbage Out)
  • Does not guarantee correctness; only de-risks process
  • Does not replace domain experts or regulatory sign-off
  • Cannot validate outcome without empirical testing

What We Do Not Claim

  • ❌ AI replaces human decision-making
  • ❌ Universally proven accuracy
  • ❌ Elimination of risk

"We structure decisions. Humans remain accountable."