CV

Research Interest

  • Visual Document Retrieval, focusing on enhancing retrieval performance through visual perception and reasoning capabilities, as well as achieving low storage costs via token compression.
  • Reinforcement Learning of Multimodal Large Language Models, focusing on aligning visual and textual representations through reinforcement learning to improve multimodal understanding and generation capabilities.

Education

University of Science and Technology of China (USTC) Hefei, China
M.Eng. in Electronic and Information Engineering Sept. 2025 - Jun. 2028
Zhejiang Gongshang University Hangzhou, China
B.Sc. in Software Engineering Sept. 2020 - Jun. 2024
  • Honors: Zhejiang Provincial Government Scholarship
  • Research Focus: Dual Self-Attention Mechanisms
  • Supervisors: Prof. Hua Zhang

Internship

Douyin Research Lab, ByteDance Shenzhen, China
Algorithm research intern Mar. 2026 - Now
  • Research Focus: Agent

Skills

  • Programming & Tools: Python, Java, PyTorch, Linux, Git
  • Languages: Chinese