Haoxing Chen's Homepage

About Me

I am an Artificial Intelligence Research Fellow with DeepLearning Lab at Ant Research Institute, which is under the leadership of CTO Zhengyu He. My research benefits from collaboration with esteemed colleagues including Dr. Jianguo Li, Dr. Yaohui Li, Dr. Zhangxuan Gu, Dr. Yan Hong, and Scientist Zhuoer Xu.

I pursued my M.E. in Automation and Artificial Intelligence Group at Nanjing University (NJU), where I was mentored by Prof. Chunlin Chen and Prof. Huaxiong Li. My B.E. was obtained at Southeast University (SEU). Currently, I focus on Foundation model architecture and Representation learning.

News

[April 2026] We released our first dLLM-based unify model LLaDA2.0-Uni.
[April 2026] One paper on "AI-Generated video detection" is accepted to SCIS 2026.
[Mar. 2026] I am honored to be selected for the Doctoral/Master’s Thesis Incentive Program by the Chinese Institute of Electronics.
[Feb. 2026] One paper on "test-time scaling UniDLLM" is accepted to CVPR 2026.
[Jan. 2026] One paper on "Reasoning LLM" is accepted to ICLR 2026.
[Aug. 2025] One paper on "Test-time adaptation" is accepted to EMNLP 2025.
[July 2025] Two papers accepted to ACM MM 2025.
[July 2025] One paper accepted to IEEE TCSVT.
[May 2025] One paper on "Vision mamba" is accepted to ICML 2025.
[Mar. 2025] Joined the DeepLearning Lab at Ant Research Institute for AGI research.
[Feb. 2025] One paper accepted to CVPR 2025.
[Dec. 2024] One paper accepted to AAAI 2025 as Oral.
[July 2024] One paper accepted to Pattern Recognition.
[July 2024] Showcased at WAIC.
[July 2024] One paper accepted to ECCV 2024.
[Dec. 2023] Two papers accepted to ICASSP 2024.
[Sep. 2023] One paper accepted to NeurIPS 2023.
[Aug. 2023] Won 2nd place (2/717) in AFAC Competition.
[July 2023] One paper accepted to ACM MM 2023 as Oral.
[April 2023] One paper accepted to ICML 2023.
[Mar. 2023] Won 3rd place (3/1267) in ICDAR Competition.
[Feb. 2023] One paper accepted to CVPR 2023.
[Jan. 2023] One paper accepted to SCIS 2023.
[July 2022] One paper accepted to ACM MM 2022.

Research Interest

I work in the field of Large Language Models (LLMs) and Unified Multimodal Large Models (UMMs). Currently, I focus on the following research topics:

Foundation Model Architecture

Exploring next-generation Large Language Models (LLMs) and Multimodal Large Models (MLLMs) with enhanced efficiency and unified architectures. My research focuses on architectural innovations to enable parameter-efficient scaling, dynamic computation, and cross-modal unification, supporting diverse real-world applications.

Cross-Modal Reasoning

Focusing on enhancing cognitive and interactive capabilities in complex scenarios, with an emphasis on Interleaved Reasoning and Interleaved Generation. By strengthening the deep fusion of multi-source information (text, images, etc.), I aim to enable models to perform rigorous logical deduction on mixed-modality inputs and generate high-quality, semantically coherent content with interleaved text and images, achieving advanced intelligent interaction closer to human intuition.

Experiences

AI Researcher | AGI Center, Ant Research Institute

Mar 2025 – Present

AI Researcher | Tiansuan Lab, Ant Group

May 2022 – Mar 2025

Master Student | Nanjing University

Sep 2020 – June 2023. Advisor: Prof. Chunlin Chen and Prof. Huaxiong Li

Undergraduate Student | Southeast University

Sep 2016 – June 2020

Selected Publications Google Scholar DBLP

Tech Report Publications

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Haoxing Chen, Yi Xin, Qi Qin, et al.

arXiv 2604.20796, 2026.

Paper Models Code

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Haoyuan Wu^#, Haoxing Chen^#*, Xiaodong Chen^#, Zhanchao Zhou^#, Tieyuan Chen^#, et al.

arXiv 2508.07785, 2025.

Paper Models Code

MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks

Mingsong Li, Lin Liu, Hongjun Wang, Haoxing Chen, et al.

arXiv 2509.14638, 2025.

Paper Datasets

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Yi Xin, Qi Qin, ... Haoxing Chen, ... et al.

arXiv 2510.06308, 2025.

Paper Models Code

Top Conference / Journal Publications

DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark

Haoxing Chen, Yan Hong, Zizheng Huang, et al.

Sci. China Inf. Sci. 2026 CCF-A IF: 7.6

Paper Code

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Yi Xin, Siqi Luo, Qi Qin, Haoxing Chen, et al.

CVPR 2026 CCF-A

Paper

DND: Boosting Large Language Models with Dynamic Nested Depth

Tieyuan Chen, Xiaodong Chen, Haoxing Chen, Zhenzhong Lan, Weiyao Lin, Jianguo Li.

ICLR 2026

Paper

Dynamic Model-bank Test-time Adaptation for Automatic Speech Recognition

Yanshuo Wang, Yanghao Zhou, Yukang Lin, Haoxing Chen, et al.

EMNLP 2025 CCF-B

Paper

Towards Explainable Fake Image Detection with Multi-Modal Large Language Models

Yikun Ji, Yan Hong, Jiahui Zhan, Haoxing Chen, et al.

ACM MM 2025 CCF-A

Paper

InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation

Yukang Lin, Yan Hong, ... Haoxing Chen, et al.

ACM MM 2025 CCF-A

Paper

Conditional Prototype Rectification Prompt Learning

Haoxing Chen, Yaohui Li, Zizheng Huang, et al.

IEEE TCSVT 2025 CCF-B IF: 11.1

Paper Code

Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training

Zizheng Huang, Haoxing Chen, et al.

ICML 2025 CCF-A

Paper Code

Efficient Transfer Learning for Video-language Foundation Models

Haoxing Chen, Zizheng Huang, Yan Hong, et al.

CVPR 2025 CCF-A

Paper

WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection

Yan Hong, Jianming Feng, Haoxing Chen, et al.

AAAI 2025 CCF-A Oral

Data

Learning Latent Distangled Embeddings and Graphs for Multi-view Clustering

Chao Zhang, Haoxing Chen, Huaxiong Li, Chunlin Chen.

Pattern Recognition 2024 CCF-B IF: 7.5

Paper

ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image

Yan Hong, Yuxuan Duan, Bo Zhang, Haoxing Chen, et al.

ECCV 2024 CCF-B

Segment Anything Model Meets Image Harmonization

Haoxing Chen, Yaohui Li, Zhangxuan Gu, et al.

ICASSP 2024 CCF-B

Paper arXiv BibTeX

DiffusionInst: Diffusion Model for Instance Segmentation

Zhangxuan Gu, Haoxing Chen, et al.

ICASSP 2024 CCF-B Oral

Paper arXiv Code Code (Ant)

DiffUTE: Universal Text Editing Diffusion Model

Haoxing Chen, Zhuoer Xu, Zhangxuan Gu, et al.

NeurIPS 2023 CCF-A

Paper arXiv Code Video

Hierarchical Dynamic Image Harmonization

Haoxing Chen, Zhangxuan Gu, Yaohui Li, et al.

ACM MM 2023 CCF-A Oral

Paper arXiv Code

Model-Aware Contrastive Learning: Towards Escaping the Dilemmas

Zizheng Huang^#, Haoxing Chen^#*, et al.

ICML 2023 CCF-A

Paper arXiv Code

Sparse Spatial Transformers for Few-Shot Learning

Haoxing Chen, Huaxiong Li, Yaohui Li, Chunlin Chen.

Sci. China Inf. Sci. 2023 CCF-A IF: 8.8

Paper SCIS Link Code

Mobile User Interface Element Detection Via Adaptively Prompt Tuning

Zhangxuan Gu, Zhuoer Xu, Haoxing Chen, et al.

CVPR 2023 CCF-A

Paper Code

Transductive Aesthetic Preference Propagation for Personalized Image Aesthetics Assessment

Yaohui Li, Yuzhe Yang, Huaxiong Li, Haoxing Chen, et al.

ACM MM 2022 CCF-A

Paper BibTeX

Awards

2026, Doctoral/Master’s Thesis Incentive Program by the Chinese Institute of Electronics.
2023, 2nd place (2/717) in AFAC Financial Data Verification Competition.
2023, Nanjing University (NJU) Outstanding Graduates.
2023, 3rd place (3/1267) in ICDAR Detecting Tampered Text in Images Competition.
2022, Chinese National Scholarship.
2019, Meritorious Prize in the Mathematical Contest In Modeling (MCM).
2018, First Prize of Jiangsu Province in the National Mathematical Modelling Competition.
2018, National Special Award of the 8th Education Robot Competition Of China (ERCC).

Services

ICLR'25, ICCV'25, CVPR'25, ICME'25, ICML'24/25, NeurIPS'24, WACV'24, ACM MM'23/24, AAAI'23/25, PAKDD'22, ICPR'22, Reviewer
IEEE Trans on TIP/TCYB/TMM/TNNLS/TCSVT, Reviewer