About Me
I am currently a Ph.D. student in the Department of Electrical and Computer Engineering at the University of Toronto supervised by Prof. Baochun Li. Prior to starting my Ph.D., I worked as a research assistant in Professor Chau Yuen’s Lab. I received my B.Sc. and M.S. degrees in Information and Communication Engineering from Xidian University, Xi’an, China. My research focuses on distributed computing, multimodal machine learning, and decision-making, as well as on the intersections among these fields.
Research Focus Currently, my research focuses on enhancing the reasoning capabilities of large language models (LLMs) and vision-language models (VLMs) through prompting, fine-tuning, and post-training methods. One of my primary research projects involves integrating reinforcement learning (RL) and multi-agent RL algorithms to develop advanced reasoning-oriented LLMs and VLMs.
Research Application I am exploring how to apply LLMs and VLMs to address practical scientific problems in telecommunications. As an initial exploration, I have organized a dataset aimed at reliable multimodal process supervision for VLMs in telecommunications.
Academic Projects
- dmmrl: A lightweight platform for developing and evaluating RL-driven reasoning mechanisms in LLMs and VLMs.
- llmpebase: A platform for developing reasoning mechanisms in large language models.
- vggbase: A platform for designing and assessing image-text alignment methods.
- personalized fl: Implementations of diverse personalized federated learning (FL) methods.
- ssl fl: Implementations of self-supervised learning methods within the federated paradigm.
Research Highlights
- Perception: capturing explicit relationships for visual-language data in the world.
- Reasoning: building sequential decision-making algorithms for reliable problem-solving.
- Efficiency: enabling faster and secure perception and reasoning in large-scale networks.
- AI for Science: applying generative models to practical problems in telecommunications.