publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. ICML 2026
    AG-REPA: Causal Layer Selection for Representation Alignment in Audio Flow Matching
    Pengfei ZHANG, Tianxin Xie, Minghao Yang, and Li Liu
    In The Forty-Third International Conference on Machine Learning, 2026
  2. ICLR 2026
    Resp-Agent: An Agent-Based System for Multimodal Respiratory Sound Generation and Disease Diagnosis
    Pengfei ZHANG, Tianxin Xie, Minghao Yang, and Li Liu
    In The Fourteenth International Conference on Learning Representations, 2026
  3. AAAI 2026
    Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards
    Linghan Fang, Tianxin Xie, and Li Liu
    Proceedings of the AAAI Conference on Artificial Intelligence, Mar 2026

2025

  1. arXiv
    PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation
    Tianxin Xie, Wentao Lei, Kai Jiang, Guanjie Huang, Pengfei Zhang, Chunhui Zhang, Fengji Ma, Haoyu He, Han Zhang, Jiangshan He, Jinting Wang, Linghan Fang, Lufei Gao, Orkesh Ablet, Peihua Zhang, Ruolin Hu, Shengyu Li, Weilin Lin, Xiaoyang Feng, Xinyue Yang, Yan Rong, Yanyun Wang, Zihang Shao, Zelin Zhao, Chenxing Li, Shan Yang, Wenfu Wang, Meng Yu, Dong Yu, and Li Liu
    arXiv preprint arXiv:2512.23994, 2025
  2. EMNLP 2025
    Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey
    Tianxin Xie, Yan Rong, Pengfei Zhang, Wenwu Wang, and Li Liu
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
  3. arXiv
    EmoSteer-TTS: Fine-Grained and Training-Free Emotion-Controllable Text-to-Speech via Activation Steering
    Tianxin Xie, Shan Yang, Chenxing Li, Dong Yu, and Li Liu
    arXiv preprint arXiv:2508.03543, 2025
  4. ICASSP 2025
    Inter- and Intra-Sentence Cuer-Invariant Representation Learning for Generalizable Cued Speech Recognition
    Tianxin Xie and Li Liu
    In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
  5. TPAMI
    Natural Adversarial Mask for Face Identity Protection in Physical World
    Tianxin Xie, Hu Han, Shiguang Shan, and Xilin Chen
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025