top of page
Search

Stress-Testing Meta-Research with AI Duels

By Zuzana Irsova and Tomas Havranek (meta-analysis.cz)


Many of us now use large language models for meta-analysis tasks like coding, short summaries, or quick checks. Where they become genuinely valuable for research, though, is not in producing a single clean answer. It is in creating structured disagreement: two models pushing on each other’s reasoning, with a researcher steering the process.


That is the idea behind the Research Audit Protocol (v1.7). It is a structured, human-in-the-loop workflow that coordinates ChatGPT and Gemini in a deliberate “duel.” The goal is not AI approval. The goal is to generate the kinds of counterexamples, boundary conditions, and missing assumptions that a single model (and often a single human pass) would not surface. (Of course, advanced users can substitute Claude for the auditor role, or run the same workflow via an API-based multi-agent setup.)


Why duels work A normal chat is optimized for flow. A duel is optimized for scrutiny:


  • Anchor (ChatGPT): Start with a full, file-grounded assessment before seeing any critique.

  • Duel (Gemini): Probe hard for identification problems, hidden assumptions, and failure modes—and force specificity.

  • Synthesis (You + ChatGPT): Map the disagreement (or convergence) and record what changed and why, so the final view is auditable.


A MAER-Net Case Study: WAIVE vs. MAIVE As a proof of concept for meta-method work, we applied the protocol to an audit of the proposed WAIVE idea against the MAIVE framework. The duel did not just restate the two approaches. It forced us to pin down the key tension that matters for applied meta-analysis: when does downweighting “suspiciously precise” results reduce spurious precision, and when might it also penalize genuinely informative studies?


In other words, it pushed us to state the boundary conditions clearly—the kind of slow thinking that improves methods before they hit peer review.


Resources

 
 
 

Recent Posts

See All
MAER-Net Blog - Introduction

Introduction to the MAER-Net Blog Content overview How to create a new account for the MAER-Net Blog. Access your profile . Subscribe to...

 
 
 
MAIVE Is Now on CRAN

We are happy to share that MAIVE  (Meta-Analysis Instrumental Variable Estimator) is now available on CRAN : install.packages("MAIVE") MAIVE is a simple tool that helps detect when study results look

 
 
 
bottom of page