Simple Additions, Substantial Gains: Expanding Scripts, Languages, and Lineage Coverage in URIEL+
Published in Language Resources and Evaluation Conference (LREC) 2026, 2026
This paper extends URIEL+ by adding script vectors, integrating Glottolog to expand language coverage, and broadening lineage-based imputation. These additions reduce sparsity, increase language coverage, and make URIEL+ more complete for multilingual and low-resource language research.
Recommended citation: Mason Shipton, York Hay Ng, Aditya Khan, Phuong Hanh Hoang, Xiang Lu, A. Seza Dogruoz, and En-Shiun Annie Lee. 2026. Simple Additions, Substantial Gains: Expanding Scripts, Languages, and Lineage Coverage in URIEL+. In Proceedings of the Language Resources and Evaluation Conference (LREC) 2026.
Download Paper
