INFO 230  ·  Cultural Analytics  ·  Spring 2026

Who Appears With Whom: A Network Analysis of
the Marvel Universe

Maddie MacDonald  ·  UC Berkeley Statistics, MA

Can the structure of a comic book universe reveal sixty years of editorial decisions? This project applies a full network analysis pipeline to the Marvel Universe Social Network (a dataset of 6,439 heroes and 96,104 co-appearances across 12,651 comic issues) to find out.
Hi Professor Tim! Similar to project 1, the full analysis, code, and write-up live in the notebook below. The interactive community visualizations are linked separately.

Sixty years of Marvel comics, 6,439 heroes, and 96,104 co-appearances later...the network structure tells a surprisingly coherent story. Looking at who co-appears with whom, Captain America and Spider-Man came out on top as the most connected heroes, which is exactly what you'd expect. The more interesting findings came from digging deeper.

An algorithm with zero prior Marvel knowledge sorted heroes into communities that almost perfectly match the actual teams, e.g., X-Men together, Avengers together, Fantastic Four together, and Defenders together. Notably, Hawkeye turned out to be one of the biggest surprises, not the most connected hero by any means, but consistently one of the top bridge characters in the network, linking parts of the Marvel Universe that wouldn't otherwise be connected.

Moreover, Spider-Man beat out Captain America for most comic issues appeared in overall, while Captain America co-appeared with the broadest range of unique heroes, a subtle but real distinction between Marvel's most prolific solo character and its most collaborative one. The one metric that broke down in an interesting way was eigenvector centrality, which got dominated by obscure 1940s heroes rather than modern icons, so it turns out being densely connected in the early days of Marvel can inflate your "prestige score" in ways that have nothing to do with actual cultural prominence. That finding alone is a good reminder that network metrics are only as meaningful as the data behind them, and that 60 years of publishing history collapsed into a single static graph will always carry the fingerprints of the era it started in. The full discussion, limitations, and ideas for further study are in the notebook.

01

Community detection worked almost too well. An algorithm with no Marvel knowledge sorted heroes into groups that almost perfectly match the actual team rosters: X-Men, Avengers, Fantastic Four, and Defenders all emerged as distinct communities.

02

Hawkeye is a bigger bridge than his fame suggests. Despite not being one of the most connected heroes, he consistently ranked near the top for betweenness centrality, linking otherwise separate corners of the network.

03

Spider-Man vs. Captain America tells two different stories. Spider-Man appeared in more total issues; Captain America co-appeared with the broadest range of unique heroes. Most prolific solo character vs. most collaborative one.

04

Eigenvector centrality broke down in a revealing way. Obscure 1940s Golden Age heroes dominated the weighted version, showing how early-era density can distort prestige metrics in ways that don't reflect actual cultural prominence.

One of the most fun parts of this project was getting to actually interact with the networks rather than just look at static plots. The pyvis visualizations below let you drag nodes around and explore connections, and the Cytoscape exports via NDEx take it a step further with a more polished network viewer. Neither set is directly referenced in the formal analysis, they're more of a "here's what the data actually looks like when you can play with it" bonus. Hope you enjoy exploring them as much as I did!

PyVis — Please allow a moment to load

X-Men — Top 40 Heroes

The most densely connected community in the network, anchored by Wolverine, Cyclops, and Storm

Avengers — Top 40 Heroes

Centered on Captain America and Iron Man, the two highest degree heroes in the full network

Spider-Man's World — Top 40 Heroes

Spider-Man's extended cast including Daredevil, Mary Jane, and J. Jonah Jameson

Defenders and Mystic Heroes — Top 40 Heroes

The loosest community in the network, reflecting the Defenders' informal team structure

Cytoscape via NDEx — no account needed to view

Defenders and Mystic Heroes

20 nodes, 59 edges

Avengers

20 nodes, 186 edges

Fantastic Four

20 nodes, 129 edges

Spider-Man's World

20 nodes, 125 edges

Thor and Cosmic Heroes

20 nodes, 101 edges

X-Men

20 nodes, 189 edges