Pengxiang Cheng

I am a research scientist and team lead in the AI group at Bloomberg, where I manage the Core NLP team, focusing on developing libraries and frameworks for NLP and LLM applications and training foundation models with financial domain knowledge.

Prior to joining Bloomberg, I obtained my Ph.D. in Computer Science at UT Austin, working on natural language understanding and computational semantics with Dr. Katrin Erk. I completed my undergraduate studies at Tsinghua University, majoring in Automation and Economics.

Publications

An Alternative to FLOPS Regularization to Effectively Productionize SPLADE-doc.

Aldo Porco, Dhruv Mehra, Igor Malioutov, Karthik Radhakrishnan, Moniba Keymanesh, Daniel Preoțiuc-Pietro, Sean MacAvaney, and Pengxiang Cheng. SIGIR, 2025.

Details

Unsupervised Contrast-Consistent Ranking with Language Models.

Niklas Stoehr, Pengxiang Cheng, Jing Wang, Daniel Preoțiuc-Pietro, and Rajarshi Bhowmik. EACL, 2024.

Details

Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning.

Genta Winata, Lingjue Xie, Karthik Radhakrishnan, Shijie Wu, Xisen Jin, Pengxiang Cheng, Mayank Kulkarni, and Daniel Preoțiuc-Pietro. ACL Findings, 2023.

Details

Dataless Knowledge Fusion by Merging Weights of Language Models.

Xisen Jin, Xiang Ren, Daniel Preoțiuc-Pietro, and Pengxiang Cheng. ICLR, 2023.

Code Details

Attending to Entities for Better Text Understanding.

Pengxiang Cheng and Katrin Erk. AAAI, 2020.

Poster Details

The UTexas System for TAC 2019 SM-KBP Task 3: Hypothesis Detection with Graph Convolutional Networks.

Pengxiang Cheng, Alex Tomkovich, Eric Holgate, Su Wang, and Katrin Erk. TAC, 2019.

Poster

Implicit Argument Prediction as Reading Comprehension.

Pengxiang Cheng and Katrin Erk. AAAI, 2019.