The QI Index

Research Agenda

QI as a dataset for the study of knowledge itself

~300 episodes. 20+ years. Thousands of factual claims. Two hosts. A rotating panel of comedians and experts. All against the backdrop of massive changes in how information is produced and consumed. This is a research instrument.

Methodology Note

All research dimensions are designed to produce SymSys-grade, peer-review-quality insights. The transcript processing pipeline captures data for all dimensions from the start — following the principle of maximum extraction, deferred interpretation. The cost of extracting a data point you don't use is nearly zero. The cost of realizing 200 episodes in that you needed something you didn't capture is enormous.

5.1active

47 data points

Taxonomy of Epistemic Failure Modes

“Why do people believe wrong things, and are the reasons classifiable?”

Every incorrect or outdated claim is tagged with a failure mode from a developing taxonomy — from compression loss to folk etymology to cultural specificity misread as universality.

SymSys Connections

Cognitive SciencePhilosophy of ScienceComputational LinguisticsSemiotics

Planned Visualization

Treemap / sunburst filling in as episodes are processed

47/100 pts

5.2active

23 data points

Knowledge Half-Life by Domain

“How fast do different categories of knowledge become obsolete?”

Testing Arbesman's Half-Life of Facts thesis with a novel methodology — testing factual claims against current consensus rather than using citation data.

SymSys Connections

ScientometricsBayesian EpistemologyPhilosophy of LanguageEvolutionary Epistemology

Planned Visualization

Kaplan-Meier survival curves by domain

23/100 pts

5.3active

15 data points

The Internet Effect

“Has the nature of what people get wrong changed as the information environment changed?”

QI premiered in 2003. The show's run maps onto Wikipedia, social media, smartphones, and AI-generated content. Tracking shifts in misconception types across information eras.

SymSys Connections

Media EcologyMemeticsNetwork ScienceHCIAI & Information Integrity

Planned Visualization

Timeline showing misconception type composition shifting across eras

15/100 pts

5.4active

34 data points

The Landscape of Informal Knowledge

“What predicts who knows what, beyond formal credentials?”

Mapping where panelists' accuracy surprises — when comedians outperform experts, when personal background trumps education.

SymSys Connections

Situated CognitionEcological RationalityEpistemics of TestimonyCognitive Diversity

Planned Visualization

Panelist x domain heatmap filling in over time

34/100 pts

5.5active

42 data points

The Fry-Toksvig Transition

“How does a moderator shape collective knowledge production?”

A natural experiment: comparing how the same show functions under two different hosts across multiple dimensions.

SymSys Connections

Discourse AnalysisSocial PsychologyConversation AnalysisFraming Effects

Planned Visualization

Split-screen dashboard comparing both eras

42/100 pts

5.6active

28 data points

The Confidence-Accuracy Relationship

“Are panelists who sound more confident actually more accurate?”

Testing the Dunning-Kruger effect in a naturalistic setting with calibration plots per panelist.

SymSys Connections

Cognitive PsychologyDecision ScienceBehavioral Economics

Planned Visualization

Calibration plots (confidence vs. accuracy) per panelist

28/100 pts

5.7emerging

12 data points

The 'Nobody Knows' Category

“What characterizes questions that genuinely have no answer?”

Cataloguing the boundaries of human knowledge — the things we don't know, categorized.

SymSys Connections

Formal EpistemologyPhilosophy of ScienceDecision Theory

Planned Visualization

A growing inventory of the things we don't know, categorized

12/100 pts

Emerging Research

New research questions are added as patterns emerge from the data. The extraction pipeline is designed for maximum granularity so that unforeseen research directions are supported by data already collected.