CV
Research Interest
- Visual Document Retrieval, focusing on enhancing retrieval performance through visual perception and reasoning capabilities, as well as achieving low storage costs via token compression.
- Reinforcement Learning of Multimodal Large Language Models, focusing on aligning visual and textual representations through reinforcement learning to improve multimodal understanding and generation capabilities.
Education
University of Science and Technology of China (USTC)
Hefei, China
M.Eng. in Electronic and Information Engineering
Sept. 2025 - Jun. 2028
- Research Focus: Visual Document Retrieval, Multimodal Large Language Models
- Supervisors: Prof. Fuli Feng and Prof. Fengbin Zhu(NUS).
Zhejiang Gongshang University
Hangzhou, China
B.Sc. in Software Engineering
Sept. 2020 - Jun. 2024
- Honors: Zhejiang Provincial Government Scholarship
- Research Focus: Dual Self-Attention Mechanisms
- Supervisors: Prof. Hua Zhang
Internship
Douyin Research Lab, ByteDance
Shenzhen, China
Algorithm research intern
Mar. 2026 - Now
- Research Focus: Agent
Skills
- Programming & Tools: Python, Java, PyTorch, Linux, Git
- Languages: Chinese