Crowd-sourced Chinese genealogies as data for demographic and economic history

CEPH-affiliated Author(s):

Abstract: This paper evaluates the usefulness of crowd-sourced Chinese genealogical data for quantitative research in demography and economic history. I first examine whether genealogies — despite well-known selection biases — produce demographic patterns consistent with established historical knowledge of China. Comparisons with existing studies show that aggregate population-growth trends and sex ratios over time align reasonably well with established demographic and historical findings, suggesting that genealogies, though selective, capture coherent and interpretable patterns. Building on these plausibility checks, the paper argues that the main value of genealogical data lies in their scalability and temporal depth, particularly as crowd-sourced digitization vastly expands the number of available records. These features make genealogies well suited to analyses that leverage variation across regions and over time, an approach that is central in modern economic history.

Keywords: Crowd-sourced genealogies, China, migration, sex ratios

JEL classification: J11, J13, N1, N35

Cite this article: Melanie Meng Xue, Crowd-sourced Chinese genealogies as data for demographic and economic history, Explorations in Economic History, Volume 99, 2026, 101734, ISSN 0014-4983, https://doi.org/10.1016/j.eeh.2025.101734