publications

2024

  1. arXiv 2024
    LangSAMP: Language-Script Aware Multilingual Pretraining
    Yihong Liu, Haotian Ye, Chunlan Ma, and 2 more authors
    arXiv preprint arXiv:2409.18199, 2024
  2. COLING 2025
    How Transliterations Improve Crosslingual Alignment
    Yihong Liu, Mingyang Wang, Amir Hossein Kargaran, and 6 more authors
    arXiv preprint arXiv:2409.17326, 2024
  3. EMNLP 2024
    SynthEval: Hybrid Behavioral Testing of NLP Models with Synthetic Evaluation
    Raoyuan Zhao, Abdullatif Köksal, Yihong Liu, and 3 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  4. arXiv 2024
    Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts
    Chunlan Ma, Yihong Liu, Haotian Ye, and 1 more author
    arXiv preprint arXiv:2407.02320, Nov 2024
  5. EMNLP 2024
    Breaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignment
    Orgest Xhelili, Yihong Liu, and Hinrich Schuetze
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  6. COLING 2025
    TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data
    Yihong Liu, Chunlan Ma, Haotian Ye, and 1 more author
    arXiv preprint arXiv:2405.09913, Nov 2024
  7. ACL 2024
    TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models
    Yihong Liu, Chunlan Ma, Haotian Ye, and 1 more author
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  8. Insights 2024
    MoSECroT: Model Stitching with Static Word Embeddings for Crosslingual Zero-shot Transfer
    Haotian Ye, Yihong Liu, Chunlan Ma, and 1 more author
    In Proceedings of the Fifth Workshop on Insights from Negative Results in NLP, Jun 2024
  9. NAACL 2024
    OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining
    Yihong Liu, Peiqin Lin, Mingyang Wang, and 1 more author
    In Findings of the Association for Computational Linguistics: NAACL 2024, Jun 2024

2023

  1. EMNLP 2023
    Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs
    Yihong Liu, Haotian Ye, Leonie Weissweiler, and 2 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2023, Dec 2023
  2. arXiv 2023
    A study of conceptual language similarity: comparison and evaluation
    Haotian Ye, Yihong Liu, and Hinrich Schütze
    arXiv preprint arXiv:2305.13401, Dec 2023
  3. ACL 2023
    A Crosslingual Investigation of Conceptualization in 1335 Languages
    Yihong Liu, Haotian Ye, Leonie Weissweiler, and 4 more authors
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2023
  4. IWSLT 2023
    On the Copying Problem of Unsupervised NMT: A Training Schedule with a Language Discriminator Loss
    Yihong Liu, Alexandra Chronopoulou, Hinrich Schütze, and 1 more author
    In Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), Jul 2023

2022

  1. ACL 2022
    Flow-Adapter Architecture for Unsupervised Machine Translation
    Yihong Liu, Haris Jabbar, and Hinrich Schuetze
    In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), May 2022

2021

  1. Journal
    A label-oriented loss function for learning sentence representations
    Yihong Liu, Wei Guan, Dongxu Lu, and 1 more author
    Computer Speech & Language, May 2021