We are seeking self-motivated PhD students or outstanding master students for research internships focused on Foundation Model Architecture. Feel free to contact us.

About Me

I am an Artificial Intelligence Research Fellow with DeepLearning Lab at Ant Research Institute, which is under the leadership of CTO Zhengyu He. My research benefits from collaboration with esteemed colleagues including Dr. Jianguo Li, Dr. Yaohui Li, Dr. Zhangxuan Gu, Dr. Yan Hong, and Scientist Zhuoer Xu.

I pursued my M.E. in Automation and Artificial Intelligence Group at Nanjing University (NJU), where I was mentored by Prof. Chunlin Chen and Prof. Huaxiong Li. My B.E. was obtained at Southeast University (SEU). Currently, I focus on Foundation model architecture and Representation learning.

News

  • [Feb. 2026] One paper on "test-time scaling UniDLLM" is accepted to CVPR 2026.
  • [Jan. 2026] One paper on "Reasoning LLM" is accepted to ICLR 2026.
  • [Aug. 2025] One paper on "Test-time adaptation" is accepted to EMNLP 2025.
  • [July 2025] Two papers accepted to ACM MM 2025.
  • [July 2025] One paper accepted to IEEE TCSVT.
  • [May 2025] One paper on "Vision mamba" is accepted to ICML 2025.
  • [Mar. 2025] Joined the DeepLearning Lab at Ant Research Institute for AGI research.
  • [Feb. 2025] One paper accepted to CVPR 2025.
  • [Dec. 2024] One paper accepted to AAAI 2025 as Oral.
  • [July 2024] One paper accepted to Pattern Recognition.
  • [July 2024] Showcased at WAIC.
  • [July 2024] One paper accepted to ECCV 2024.
  • [Dec. 2023] Two papers accepted to ICASSP 2024.
  • [Sep. 2023] One paper accepted to NeurIPS 2023.
  • [Aug. 2023] Won 2nd place (2/717) in AFAC Competition.
  • [July 2023] One paper accepted to ACM MM 2023 as Oral.
  • [April 2023] One paper accepted to ICML 2023.
  • [Mar. 2023] Won 3rd place (3/1267) in ICDAR Competition.
  • [Feb. 2023] One paper accepted to CVPR 2023.
  • [Jan. 2023] One paper accepted to SCIS 2023.
  • [July 2022] One paper accepted to ACM MM 2022.

Research Interest

I work in the field of few-shot learning, image generation, self-supervised learning, computer vision and machine learning. Currently, I focus on the following research topics:

Foundation Model Architecture

Exploring next-generation large language models (LLMs) and multimodal large models (MLLMs) with enhanced efficiency and unified architectures. His work investigates architectural innovations that enable parameter-efficient scaling, dynamic computation, and cross-modal unification while maintaining strong generalization capabilities.

Representation Learning

Crafting highly transferable representations — compact yet expressive abstractions that leap across tasks, modalities, and datasets with minimal adaptation. By unifying self-supervised objectives and multimodal fusion, he seeks representations that encode universal concepts, enabling rapid zero-shot or few-shot mastery of new domains.

Experiences

AI Researcher | AGI Center, Ant Research Institute

Mar 2025 – Present

AI Researcher | Tiansuan Lab, Ant Group

May 2022 – Mar 2025

Master Student | Nanjing University

Sep 2020 – June 2023. Advisor: Prof. Chunlin Chen and Prof. Huaxiong Li

Undergraduate Student | Southeast University

Sep 2016 – June 2020

Selected Publications Google Scholar DBLP

Tech Report Publications

GroveMoE
Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts
Haoyuan Wu#, Haoxing Chen#*, Xiaodong Chen#, Zhanchao Zhou#, Tieyuan Chen#, et al.
arXiv 2508.07785, 2025.
MultiEdit
MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks
Mingsong Li, Lin Liu, Hongjun Wang, Haoxing Chen, et al.
arXiv 2509.14638, 2025.
Lumina-DiMOO
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
Yi Xin, Qi Qin, ... Haoxing Chen, ... et al.
arXiv 2510.06308, 2025.

Top Conference / Journal Publications

dMLLM-TTS
dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Yi Xin, Siqi Luo, Qi Qin, Haoxing Chen, et al.
CVPR 2026 CCF-A
DND
DND: Boosting Large Language Models with Dynamic Nested Depth
Tieyuan Chen, Xiaodong Chen, Haoxing Chen, Zhenzhong Lan, Weiyao Lin, Jianguo Li.
ICLR 2026
EMNLP
Dynamic Model-bank Test-time Adaptation for Automatic Speech Recognition
Yanshuo Wang, Yanghao Zhou, Yukang Lin, Haoxing Chen, et al.
EMNLP 2025 CCF-B
Explainable AIGC
Towards Explainable Fake Image Detection with Multi-Modal Large Language Models
Yikun Ji, Yan Hong, Jiahui Zhan, Haoxing Chen, et al.
ACM MM 2025 CCF-A
InterAnimate
InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation
Yukang Lin, Yan Hong, ... Haoxing Chen, et al.
ACM MM 2025 CCF-A
CPR
Conditional Prototype Rectification Prompt Learning
Haoxing Chen, Yaohui Li, Zizheng Huang, et al.
IEEE TCSVT 2025 CCF-B IF: 11.1
ShuffleMamba
Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training
Zizheng Huang, Haoxing Chen, et al.
ICML 2025 CCF-A
MSTA
Efficient Transfer Learning for Video-language Foundation Models
Haoxing Chen, Zizheng Huang, Yan Hong, et al.
CVPR 2025 CCF-A
WildFake
WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection
Yan Hong, Jianming Feng, Haoxing Chen, et al.
AAAI 2025 CCF-A Oral
PR2024
Learning Latent Distangled Embeddings and Graphs for Multi-view Clustering
Chao Zhang, Haoxing Chen, Huaxiong Li, Chunlin Chen.
Pattern Recognition 2024 CCF-B IF: 7.5
ComFusion
ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image
Yan Hong, Yuxuan Duan, Bo Zhang, Haoxing Chen, et al.
ECCV 2024 CCF-B
SRIN
Segment Anything Model Meets Image Harmonization
Haoxing Chen, Yaohui Li, Zhangxuan Gu, et al.
ICASSP 2024 CCF-B
DiffusionInst
DiffusionInst: Diffusion Model for Instance Segmentation
Zhangxuan Gu, Haoxing Chen, et al.
ICASSP 2024 CCF-B Oral
DiffUTE
DiffUTE: Universal Text Editing Diffusion Model
Haoxing Chen, Zhuoer Xu, Zhangxuan Gu, et al.
NeurIPS 2023 CCF-A
HDNet
Hierarchical Dynamic Image Harmonization
Haoxing Chen, Zhangxuan Gu, Yaohui Li, et al.
ACM MM 2023 CCF-A Oral
MACL
Model-Aware Contrastive Learning: Towards Escaping the Dilemmas
Zizheng Huang#, Haoxing Chen#*, et al.
ICML 2023 CCF-A
SSFormer
Sparse Spatial Transformers for Few-Shot Learning
Haoxing Chen, Huaxiong Li, Yaohui Li, Chunlin Chen.
Sci. China Inf. Sci. 2023 CCF-A IF: 8.8
APT
Mobile User Interface Element Detection Via Adaptively Prompt Tuning
Zhangxuan Gu, Zhuoer Xu, Haoxing Chen, et al.
CVPR 2023 CCF-A
TAPP
Transductive Aesthetic Preference Propagation for Personalized Image Aesthetics Assessment
Yaohui Li, Yuzhe Yang, Huaxiong Li, Haoxing Chen, et al.
ACM MM 2022 CCF-A

Awards

  • 2023, 2nd place (2/717) in AFAC Financial Data Verification Competition.
  • 2023, Nanjing University (NJU) Outstanding Graduates.
  • 2023, 3rd place (3/1267) in ICDAR Detecting Tampered Text in Images Competition.
  • 2022, Chinese National Scholarship.
  • 2019, Meritorious Prize in the Mathematical Contest In Modeling (MCM).
  • 2018, First Prize of Jiangsu Province in the National Mathematical Modelling Competition.
  • 2018, National Special Award of the 8th Education Robot Competition Of China (ERCC).

Services

  • ICLR'25, ICCV'25, CVPR'25, ICME'25, ICML'24/25, NeurIPS'24, WACV'24, ACM MM'23/24, AAAI'23/25, PAKDD'22, ICPR'22, Reviewer
  • IEEE Trans on TIP/TCYB/TMM/TNNLS/TCSVT, Reviewer