[CogSci25] Word Embeddings Track Social Group Changes Across 70 Years in China

Abstract

Language encodes societal beliefs about social groups through word patterns. While computational methods like word embeddings enable quantitative analysis of these patterns, studies have primarily examined gradual shifts in Western contexts. We present the first large-scale computational analysis of Chinese state-controlled media (1950-2019) to examine how revolutionary social transformations are reflected in official linguistic representations of social groups. Using diachronic word embeddings at multiple temporal resolutions, we find that Chinese representations differ significantly from Western counterparts, particularly regarding economic status, ethnicity, and gender. These representations show distinct evolutionary dynamics: while stereotypes of ethnicity, age, and body type remain remarkably stable across political upheavals, representations of gender and economic classes undergo dramatic shifts tracking historical transformations. This work advances our understanding of how officially sanctioned discourse encodes social structure through language while highlighting the importance of non-Western perspectives in computational social science.

Publication
In Proceedings of Annual Meeting of the Cognitive Science Society
Yuxi Ma (Yuki)
Yuxi Ma (Yuki)
Ph.D. '24

My research interests include psychology-inspired AI research to understand and model human behavior and cognition, as well as investigating machine creativity and its applications in art.

Yongqian Peng
Yongqian Peng
Tong Class '21

My research interests include Ai+psychology, human computer interaction and computer vision etc.

Yixin Zhu
Yixin Zhu
Assistant Professor

I build humanlike AI.

Related