Tianxin Xie

my_photo.jpeg

I am a PhD student at the AI Thrust, Information Hub, Hong Kong University of Science and Technology (Guangzhou), advised by Prof. Li Liu. I received my B.E. degree in Computer Science from Hunan Normal University, and my M.S. degree in Computer Science from the University of Chinese Academy of Sciences, advised by Prof. Hu Han.

My research focuses on text-to-speech, speech LLMs, audio generation and reasoning, and multi-modal learning.

Email: txie151[at]connect.hkust-gz.edu.cn / tianxin.xie[at]outlook.com

Google Scholar ORCID GitHub

news

Apr 07, 2026 We have released a T2AV benchmark, i.e., PhyAVBench, which contains 11,605 newly recorded videos. The dataset will be soon released to the public.
Jan 27, 2026 One paper accepted by ICLR 2026.
Nov 10, 2025 I have started my internship at Tencent AI Lab.
Nov 08, 2025 One paper accepted by AAAI 2026.
Aug 21, 2025 One paper accepted by EMNLP 2025.
Dec 21, 2024 One paper accepted by ICASSP 2025.
Dec 13, 2024 One paper accepted by TPAMI.
Dec 09, 2024 We released a comprehensive survey of controllable speech synthesis on arXiv.
Jul 01, 2024 I graduated from the Institute of Computing Technology, Chinese Academy of Sciences, and will be pursuing my PhD at HKUST (GZ).

latest posts

selected publications

  1. arXiv
    PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation
    Tianxin Xie, Wentao Lei, Kai Jiang, and 8 more authors
    arXiv preprint arXiv:2512.23994, 2025
  2. EMNLP 2025
    Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey
    Tianxin Xie, Yan Rong, Pengfei Zhang, and 2 more authors
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
  3. arXiv
    EmoSteer-TTS: Fine-Grained and Training-Free Emotion-Controllable Text-to-Speech via Activation Steering
    Tianxin Xie, Shan Yang, Chenxing Li, and 2 more authors
    arXiv preprint arXiv:2508.03543, 2025
  4. ICASSP 2025
    Inter- and Intra-Sentence Cuer-Invariant Representation Learning for Generalizable Cued Speech Recognition
    Tianxin Xie and Li Liu
    In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
  5. TPAMI
    Natural Adversarial Mask for Face Identity Protection in Physical World
    Tianxin Xie, Hu Han, Shiguang Shan, and 1 more author
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025