Xiang Li 李希昂

Ph.D. Candidate

Department of Electrical and Computer Engineering,
Carnegie Mellon University,
Pittsburgh, PA.

Email: xl6@andrew.cmu.edu

I am a final-year PhD at Carnegie Mellon University (CMU). I am advised by Prof. Bhiksha Raj. My current research is about audio profiling, image/video synthesis and robust video segmentation. Before that, I spent one year at Microsoft Research, Asia working on video segmentation as a research intern. I received my B.Eng in Electrical and Electronic Engineering from Huazhong University of Science and Technology, where I was advised by Prof. Wei Wang.

I will be on the job market starting from Spring 2025!

News

[06/2024] One first-author paper accepted to ECCV 2024.
[06/2024] One co-author paper accepted to InterSpeech 2024.
[05/2024] Two papers (one first-author) accepted to ICML 2024.
[03/2024] One paper accepted to NAACL 2024.
[02/2024] One first-author paper accepted to CVPR 2024.
[12/2023] One paper accepted to ICASSP 2024.
[10/2023] One first-author paper accepted to EMNLP 2023.
[09/2023] One first-author paper accepted to NeurIPS 2023.
[06/2023] One first-author paper accepted to ICCV 2023.
[06/2023] One first-author paper accepted to ACM MM 2023.

Publications

ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Xiang Li, Kai Qiu, Hao Chen, Jason Kuen, Zhe Lin, Rita Singh, Bhiksha Raj
preprint
[paper ] [code]

Slight Corruption in Pre-training Data Makes Better Diffusion Models
Hao Chen, Yujin Han, Diganta Misra, Xiang Li, Kai Hu, Difan Zou, Masashi Sugiyama, Jindong Wang, Bhiksha Raj
preprint
[paper ]

Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization
Kai Hu, Weichen Yu, Tianjun Yao, Xiang Li, Wenhe Liu, Lijun Yu, Yining Li, Kai Chen, Zhiqiang Shen, Matt Fredrikson
preprint
[paper ]

R^2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Rita Singh, Kashu Yamazaki, Hao Chen, Xiaonan Huang, Bhiksha Raj
ECCV 2024
[paper] [code (coming)]

Evaluating and Improving Continual Learning in Spoken Language Understanding
Muqiao Yang, Xiang Li, Umberto Cappellazzo, Shinji Watanabe, Bhiksha Raj
InterSpeech 2024
[paper]

Completing Visual Objects via Bridging Generation and Segmentation
Xiang Li, Yinpeng Chen, Chung-Ching Lin, Rita Singh, Bhiksha Raj, Zicheng Liu
ICML 2024
[paper]

Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition
Xiang Li, Jinglu Wang, Xiaohao Xu, Rita Singh, Yan Lu, Bhiksha Raj
CVPR 2024
[paper] [code]

AutoPRM: Self-supervised Fine-grained Feedback for Multi-Step Reasoning via Controllable Question Decomposition
Zhaorun Chen, Zhuokai Zhao, Zhihong Zhu, Ruiqi Zhang, Xiang Li, Bhiksha Raj, Huaxiu Yao
NAACL 2024 (short version at ICLR 2024 R2-FM workshop)
[paper]

A General Framework for Learning from Weak Supervision
Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj
ICML 2024
[paper]

Customizable Perturbation Synthesis for Robust SLAM Benchmarking
Xiaohao Xu, Tianyi Zhang, Sibo Wang, Xiang Li, Yongqi Chen, Ye Li, Bhiksha Raj, Matthew Johnson-Roberson, Xiaonan Huang
preprint
[paper] [code]

A Closer Look at Reinforcement Learning-based Automatic Speech Recognition
Fan Yang, Muqiao Yang, Xiang Li, Yuxuan Wu, Zhiyuan Zhao, Bhiksha Raj, Rita Singh
Computer Speech & Language
[paper]

Improving Continual Learning of Acoustic Scene Classification via Mutual Information Optimization
Muqiao Yang, Umberto Cappellazzo, Xiang Li, Shinji Watanabe, Bhiksha Raj
ICASSP 2024
[paper]

Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text
Xiang Li, Jinglu Wang, Xiaohao Xu, Muqiao Yang, Rita Singh, Bhiksha Raj
EMNLP 2023
[paper]

PaintSeg: Training-free Segmentation via Painting
Xiang Li, Chung-Ching Lin, Yinpeng Chen, Jinglu Wang, Zicheng Liu, Bhiksha Raj
NeurIPS 2023
[paper] [code]

Rethinking Voice-Face Correlation: A Geometry View
Xiang Li, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj
ACM Multimedia, 2023
[paper] [code]

Robust Referring Video Object Segmentation with Cyclic Structural Consensus
Xiang Li, Jinglu Wang, Xiaohao Xu, Xiao Li, Yan Lu, Bhiksha Raj
ICCV, 2023
[paper] [project page] [code]

The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features
Liao Qu*, Xianwei Zou*, Xiang Li*, Wendong Yan, Rita Singh, Bhiksha Raj
InterSpeech, 2023
[paper] [code]

Self-supervised Multi-Modal Video Forgery Attack Detection
Chenhui Zhao, Xiang Li, Rabhi Younes
WCNC, 2023
[paper] [code]

Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
Xiang Li, Haoyuan Cao, Shijie Zhao, Junlin Li, Li Zhang, Bhiksha Raj
AAAI, 2023
[paper] [visualization]

Forgery Attack Detection in Surveillance Video Streams Using Wi-Fi Channel State Information
Yong Huang, Xiang Li, Wei Wang, Tao Jiang, Qian Zhang
IEEE Transactions on Wireless Communication
[paper]

Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation
Xiang Li, Jinglu Wang, Xiao Li, Yan Lu
AAAI, 2022
[paper]

Video Instance Segmentation by Instance Flow Assembly
Xiang Li, Jinglu Wang, Xiao Li, Yan Lu
IEEE Transactions on Multimedia
[paper]

Towards Cross-Modal Forgery Detection and Localization on Live Surveillance Videos
Yong Huang, Xiang Li, Wei Wang, Tao Jiang, Qian Zhang
IEEE INFOCOM, 2021
[paper]

Predicting Spatial Visualization Problems’ Difficulty Level from Eye-Tracking Data
Xiang Li, Rabih Younes, Diana Bairaktarova, Qi Guo
Sensors, 2020
[paper]

ActivityGAN: Generative Adversarial Networks for Data Augmentation in Sensor-Based Human Activity Recognition
Xiang Li, Jinqi Luo, Rabih Younes
DLHAR workshop @ Ubicomp (Best Paper Award), 2020
[paper]

Toward Data Augmentation and Interpretation in Sensor-Based Fine-Grained Hand Activity Recognition
Jinqi Luo, Xiang Li, Rabih Younes
ML4HAR workshop @ IJCAI, 2020
[paper]

Academic Activities

Reviewer

Conference: AAAI, ICCV, ECCV, CVPR, EMNLP, NAACL, ACL, NeurIPS

Journal: TIP
Mentorship

Kai Qiu: MS at CMU (ongoing).

Liao Qu: MS at CMU. Now MLE at Tiktok.

Xianwei Zou: MS at CMU. Now PhD at UC Santa Barbara (UCSB).

Chenhui Zhao: MS at Duke. Now PhD at Duke University.

Zhaorun Chen: MS at Purdue. Now PhD at University of Chicago.

Fan Yang: MS at CMU. Now PhD at Ohio State University (OSU).