Zhengxu Yu's CV
- Phone: +44 7852 446689
- Email: yuzxfred@gmail.com
- Website: zhengxuyu.github.io
- LinkedIn: https://www.linkedin.com/in/yuzhengxu
- GitHub: https://github.com/zhengxuyu
Intro
I am currently a AI Algorithm Expert at Alibaba Group. I obtained my Ph.D. from Zhejiang University in 2021, advised by Prof. Deng Cai and Prof. Xiaofei He. Previously, I obtained my Master’s degree from University of Surrey, advised by Prof. H Lilian Tang.
My research interests focus on Reinforcement Learning, Large language Model, Computer Vision to achieving embodied AGI. My current research goal is to exploring LLM reasoning capability and forming LLM agent to solve general real-world tasks.
Previously, I has published eleven papers on top-tier peer-reviewed Artificial Intelligence international conferences and journals.
Experience
Alibaba Group, Algorithm Expert
- Apr 2021 – present
- Hangzhou, China
- Developing Reinforcement Learning based self-improvement algorithms to train large language models (LLMs) to achieve superhuman performance in various tasks. Achieved state-of-the-art performance in several benchmarks like AIME MATH benchmark within same LLM parameter scale.
- Appling post-trained LLM models to build LLM Agent, and apply in real-world applications, such as Optimization Problem in Operations Research.
- Leading cross-functional teams to deliver algorithm and applications to client.
- Mentoring research interns and junior researchers.
Damo Academy, Alibaba Group, Research Intern
- Jan 2018 – Apr 2021
- Hangzhou, China
- Proposed multi-agent reinforcement learning methods to facilitate the coordination of multiple agents in cooperative and competitive scenarios.
- Proposed several optimization methods to improve the generalization ability of deep neural networks in computer vision tasks.
- Proposed Generative Adversarial Network (GAN) based synthetic data generating model for augmenting training data in computer vision tasks.
- Proposed several Deep Graph Neural Network (GNN) models for stochastic modeling tasks in dynamic systems.
Education
Zhejiang University, Ph.D. in Computer Science
- Sept 2017 – Mar 2021
- Research Interests: Machine Learning, Computer Vision, Generative Model, Data Mining
University of Surrey, M.Sc. in Information Systems
- Sept 2015 – Nov 2016
- Research Interests: Machine Learning, Computer Vision, Data Mining
Jilin University, B.Sc. in Communication Engineering
- Sept 2011 – June 2015
Technologies
- Languages & Technologies: Python, PyTorch, Pandas, LangChain, vllm, ray, deepspeed
Selected Publications
Progressive Transfer Learning (10.1109/TIP.2022.3141258)
- 2022
- Yu, Z., Jin, Z., Wei, L., Huang, J., Cai, D., He, X., Hua, X.S.
- IEEE Transactions on Image Processing (TIP)
Urban Traffic Light Control via Active Multi-agent Communication and Supply-Demand Modeling (10.1109/TKDE.2021.3130258)
- 2021
- Guo, X.+, Yu, Z.+, Wang, P., Jin, Z., Huang, J., Cai, D., He, X., Hua, X.S., (+Co-first author)
- IEEE Transactions on Knowledge and Data Engineering
MaCAR: Urban Traffic Light Control via Active Multi-agent Communication and Action Rectification (10.24963/IJCAI.2020/345)
- 2020
- Yu, Z., Liang, S., Wei, L., Jin, Z., Huang, J., Cai, D., He, X., Hua, X.S.
- IJCAI
Progressive Transfer Learning for Person Re-identification (10.24963/ijcai.2019/586)
- 2019
- Yu, Z., Jin, Z., Wei, L., Guo, J., Huang, J., Cai, D., He, X., Hua, X.S.
- IJCAI-2019