Yanqi Dai's Homepage

About me

Capable and versatile multimodal intelligence.

I am a fourth-year Ph.D. candidate in the Gaoling School of Artificial Intelligence at Renmin University of China, advised by Prof. Zhiwu Lu. I was a visiting student in the College of Computing and Data Science at Nanyang Technological University, advised by Prof. Hanwang Zhang. Before my doctoral studies, I received my B.E. from the School of Software at Dalian University of Technology.

My research interests lie in large multimodal models, reinforcement learning for reasoning, and multi-task learning. Through my research, I aim to develop more capable and versatile large multimodal models.

Large
Multimodal
Models

Data & Task Balancing

RL for Reasoning

Multimodal Role-Playing Agents

Visual Instruction Tuning

Selected papers

Research highlights.

ICLR 2026MathForge

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Yanqi Dai, Yuxiang Ji, Xiao Zhang, Yong Wang*, Xiangxiang Chu, Zhiwu Lu*

Hugging Face Daily Papers #2 of the day.

Paper Code Data News

WWW 2026 · OralVisATB

Adaptive Task Balancing for Visual Instruction Tuning via Inter-Task Contribution and Intra-Task Difficulty

Yanqi Dai, Yong Wang, Zebin You, Dong Jing, Xiangxiang Chu, Zhiwu Lu*

Paper Code News

ICLR 2025MMRole

MMRole: A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents

Yanqi Dai, Huanran Hu, Lei Wang, Shengjie Jin, Xu Chen*, Zhiwu Lu*

Paper Code Data News

Technical ReportAwaker2.5-VL

Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts

Jinqiang Long†, Yanqi Dai†, Guoxing Yang, Hongpeng Lin, Nanyi Fei, Yizhao Gao, Zhiwu Lu

Paper Code News

UAI 2023IGB

Improvable Gap Balancing for Multi-Task Learning

Yanqi Dai, Nanyi Fei, Zhiwu Lu*

Paper Code

† Equal contribution. * Corresponding author.

Experience

From research labs to foundation-model teams.

May 2026 — Present · Beijing

Algorithm Intern, VLM Pretraining Team in Base Group · Zhipu AI

Mar 2025 — Sep 2025 · Beijing

Research Intern, Machine Learning Team · Amap-Alibaba Group

Jun 2023 — Feb 2025 · Beijing

Algorithm Intern, Model Team · Metabrain AGI

Selected honors & awards

Recognition.

Excellent Scholarship for Postgraduate StudyRenmin University of China · 2025

Excellent Scholarship for Postgraduate StudyRenmin University of China · 2023

Scientific Research FundRenmin University of China · 2023

First-Class Academic Scholarship for Postgraduate StudyRenmin University of China · 2023

Outstanding GraduateLiaoning Province · 2022

Toly Bread Alumni ScholarshipDalian University of Technology · 2020 / 2019

First Prize National Mathematics Competition for College Students2019