I work with data to see the past.

I am a PhD candidate at the Leiden Centre for Digital Humanities and the Leiden Institute for Area Studies, reading tens of thousands of Korean texts with computer vision and language models. My work maps how colonial-Korean print culture looked and read, across its type and its advertisements. I ask what these methods reveal and what they obscure.

Much of this material survives in large digitized collections that resist conventional close viewing, and my work develops computational methods to make them tractable as historical sources.

With Steven Denney, I read South Korea's national-history textbooks with language models, following how they narrate the nation across more than a century of rewriting.

Latest

All talks →

Research

  • Printshop typography

    Because colonial-Korean printshops worked from the same imported type, their pages look alike, and an interpretable vision model tells four Seoul shops apart not by separate typefaces but by how often each leans on shared letterforms.

    Interpretable computer vision
  • Newspaper advertising

    A detector finds every classified advertisement and a vision–language model reads them at scale, tracing the print trade through the ads it placed.

    Computer vision · language models
  • Textbooks & nationalism

    Across every South Korean national-history textbook from 1895 on, sentence embeddings track how the language of the nation shifts each time the books are rewritten.

    Language models · with Steven Denney
All publications →

Dissertation

Typography and advertising in colonial Korea

My dissertation takes up two related problems in the material history of colonial Korea, both of which require working across very large collections of digitized print. The two case studies share a common methodological concern: how to move between close viewing and close reading of visual sources without losing sight of either.

Typography & Printshops

The first case study reads the typographic output of four printing houses in colonial Seoul: Sinmungwan, Taedong Inswaeso, Hansŏng Tosŏ, and Chosŏn Inswae. All four bought matrices from the same Japanese foundries and set to the same point sizes, so their pages converge on a narrow shared vocabulary of forms. I treat that convergence as a method rather than an obstacle. Against so uniform a background, the minute ways a shop weights a stroke or finishes a serif become more legible rather than less, the tension the chapter calls the standardisation paradox.

To recover those traces across roughly 57,000 page images, I train a multiple-instance-learning model whose attention scores are the classification itself, so we can identify which patches of a page the model relies on. What surfaces is differentiation through density rather than distinct fingerprints: the four shops share one typographic space, and each concentrates in different parts of it, as in the rendering of the iŭng (ㅇ) or the heavy initial serifs that mark Hansŏng Tosŏ. These signatures track the printer that set the page, not the publisher that commissioned it.

Advertising & Consumer Culture

The second case study examines the rise of mass advertising in colonial-era newspapers as both a visual and a social phenomenon. Using computer vision, I segment and extract hundreds of thousands of advertisements from newspaper page images. Large language models then classify and annotate them with structured metadata, which makes it possible to treat advertising as quantifiable historical evidence.

The dataset is the point: it lets questions about consumer culture, visual persuasion, and commercial networks be posed across an entire corpus rather than through scattered examples alone.

Positions & Education

Current

  • 2022—

    PhD Candidate — Leiden University

    Digital Humanities & Korean Studies

  • Researcher — Leiden HumAN

    Humanities & AI Network, Leiden University

  • Managing Editor — Korean Histories

    Editorial and production coordination

  • Board Member, Research Planning (연구기획 이사)KADH

    Korean Association for Digital Humanities (한국디지털인문학협의회)

  • Web Developer — AKSE

    Full-stack; Association for Korean Studies in Europe

Previous

Education

  • 2021–22

    M.A. East Asian Studies (Korea specialization)

    Leiden University

  • 2019–21

    B.A. Korean Studies

    Leiden University

Grants & fellowships

  • 2026

    Overseas Korean Studies Grant

    Academy of Korean Studies (한국학중앙연구원) · KS-2026-R-094 · with Steven Denney (PI)

  • 2024–25

    Junior Research Fellowship

    Kyujanggak Institute for Korean Studies, Seoul National University

Teaching