Zhengxu Yu's CV

Phone: +44 7852 446689
Email: yuzxfred@gmail.com
Website: zhengxuyu.github.io
LinkedIn: https://www.linkedin.com/in/yuzhengxu
GitHub: https://github.com/zhengxuyu

Intro

I am currently a AI Algorithm Expert at Alibaba Group. I obtained my Ph.D. from Zhejiang University in 2021, advised by Prof. Deng Cai and Prof. Xiaofei He. Previously, I obtained my Master’s degree from University of Surrey, advised by Prof. H Lilian Tang.

My research interests focus on Reinforcement Learning, Large language Model, Computer Vision to achieving embodied AGI. My current research goal is to exploring LLM reasoning capability and forming LLM agent to solve general real-world tasks.

Previously, I has published eleven papers on top-tier peer-reviewed Artificial Intelligence international conferences and journals.

Experience

Alibaba Group, Algorithm Expert

Apr 2021 – present
Hangzhou, China
Developing Reinforcement Learning based self-improvement algorithms to train large language models (LLMs) to achieve superhuman performance in various tasks. Achieved state-of-the-art performance in several benchmarks like AIME MATH benchmark within same LLM parameter scale.
Appling post-trained LLM models to build LLM Agent, and apply in real-world applications, such as Optimization Problem in Operations Research.
Leading cross-functional teams to deliver algorithm and applications to client.
Mentoring research interns and junior researchers.

Damo Academy, Alibaba Group, Research Intern

Jan 2018 – Apr 2021
Hangzhou, China
Proposed multi-agent reinforcement learning methods to facilitate the coordination of multiple agents in cooperative and competitive scenarios.
Proposed several optimization methods to improve the generalization ability of deep neural networks in computer vision tasks.
Proposed Generative Adversarial Network (GAN) based synthetic data generating model for augmenting training data in computer vision tasks.
Proposed several Deep Graph Neural Network (GNN) models for stochastic modeling tasks in dynamic systems.

Education

Zhejiang University, Ph.D. in Computer Science

Sept 2017 – Mar 2021
Research Interests: Machine Learning, Computer Vision, Generative Model, Data Mining

University of Surrey, M.Sc. in Information Systems

Sept 2015 – Nov 2016
Research Interests: Machine Learning, Computer Vision, Data Mining

Jilin University, B.Sc. in Communication Engineering

Sept 2011 – June 2015

Technologies

Languages & Technologies: Python, PyTorch, Pandas, LangChain, vllm, ray, deepspeed

Selected Publications

Progressive Transfer Learning (10.1109/TIP.2022.3141258)

2022
Yu, Z., Jin, Z., Wei, L., Huang, J., Cai, D., He, X., Hua, X.S.
IEEE Transactions on Image Processing (TIP)

Urban Traffic Light Control via Active Multi-agent Communication and Supply-Demand Modeling (10.1109/TKDE.2021.3130258)

2021
Guo, X.+, Yu, Z.+, Wang, P., Jin, Z., Huang, J., Cai, D., He, X., Hua, X.S., (+Co-first author)
IEEE Transactions on Knowledge and Data Engineering

MaCAR: Urban Traffic Light Control via Active Multi-agent Communication and Action Rectification (10.24963/IJCAI.2020/345)

2020
Yu, Z., Liang, S., Wei, L., Jin, Z., Huang, J., Cai, D., He, X., Hua, X.S.
IJCAI

Progressive Transfer Learning for Person Re-identification (10.24963/ijcai.2019/586)

2019
Yu, Z., Jin, Z., Wei, L., Guo, J., Huang, J., Cai, D., He, X., Hua, X.S.
IJCAI-2019