← All Experiments

Social Media Feature Analysis

Papers 166/167 · Substrate: Epidemiological (CDC YRBS + PISA 2022) · Status: 12/14 PASS

613,744
Students
80
Countries
R²=0.80
Persistent Sadness
5.6×
Girl:Boy Ratio
13
Verifiable Features

The Circularity Break

The Void Framework's platform scoring (N=1,344) uses the framework's own rubric — a circularity concern. Papers 166/167 break this by replacing the rubric with verifiable facts about platform design: does the platform have an algorithmic feed? Autoplay? Opaque recommendations? These are binary/ordinal features anyone can check. The outcomes come from external health datasets (CDC YRBS, PISA 2022) that the framework had no role in collecting.

Same dimensional structure (Opacity, Reactivity, Coupling). Different operationalization. Independent data. No rubric. If the same structure predicts outcomes here, it's not circular.

The 13 Features

FeatureDimensionType
Algorithmic feedOpacityBinary
AutoplayReactivityBinary
Opaque recommendationOpacityBinary
Infinite scrollCouplingBinary
Like counts visibleCouplingBinary
Follower counts visibleCouplingBinary
Push notificationsReactivityBinary
Default public profileOpacityBinary
Ephemeral contentOpacityBinary
Video-first formatReactivityBinary
Direct messaging (minors)CouplingBinary
Content creation toolsCouplingOrdinal
Engagement gamificationReactivityOrdinal

Result: Feature-weighted exposure R²=0.80 for persistent sadness (CDC YRBS, 7 waves). Cross-national replication: r=−0.648 in Western Europe (p=0.017), survives GDP control. Girls 5.6× more affected in 91% of countries (p<0.000001). Opacity features dominate (O avg R²=0.549 > R 0.493 > α 0.375). opaque_recommendation alone: R²=0.938 for female teen sadness.

Framework Prediction vs Result

The Void Framework predicted that opacity features would dominate harm outcomes. The data confirmed: Opacity > Reactivity > Coupling. The single most predictive feature — opaque_recommendation — is the purest opacity measure: the platform decides what you see and doesn't tell you why. R²=0.938 from one binary feature.

Litigation Relevance

This methodology is Daubert-qualified: verifiable features (any expert can check), large sample (613K students), cross-national replication (80 countries), external outcomes (CDC/PISA), and specific causal mechanism (opacity → explaining-away penalty → drift → harm). 14 predictions tested, 12 confirmed, 12/12 kill conditions survived.

The $6B+ social media litigation wave needs exactly this: a methodology that connects specific, checkable design choices to specific, measurable health outcomes, with a causal mechanism that doesn't require proving intent.

Caveats: Ecological inference (country-level exposure → individual outcomes) remains a limitation, though the PISA bridge (individual social media use → outcomes) provides within-country validation. 2023 pullback is attributed to COVID amplifier effect (LOO without 2021 R²=0.926). Monte Carlo robustness: 98.2% of 10K feature perturbations maintain R²>0.7.

Papers on Zenodo · Code on GitHub · All Experiments