Biography

I am currently a second-year master’s student at Tsinghua Shenzhen International Graduate School, Tsinghua University, affiliated with the Intelligent Computing Lab. My supervisor is Prof. Xiu Li, and I have received a lot of research guidance from my senior fellow student Jiafei Lyu. My main research focus on Reinforcement Learning, especially on Agent Exploration, Multi-Agent Reinforcement Learning and RLHF of Large Models. I am proficient and interested in using mathematical theory to optimize reinforcement learning methods. Currently, I am undertaking an internship at Tencent Technology Engineering Group (TEG), Machine Learning Platform Department, researching the effective fine-tuning of Large Language Models using RLHF.

If you believe I am a good fit for your position, please don’t hesitate to contact me at yk22@mails.tsinghua.edu.cn

News

  • 2024.5 The paper “Exploration and Anti-Exploration with Distributional Random Network Distillation” is accepted by ICML 2024

  • 2024.4 The work “BATON: Aligning Text-to-Audio Model with Human Preference Feedback” is accepted by IJCAI 2024.

  • 2024.2 The paper “Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model” is accepted by CVPR 2024

  • 2024.2 The work “BATON: Aligning Text-to-Audio Model with Human Preference Feedback” is on Arxiv.

  • 2024.1 My work “Exploration and Anti-Exploration with Distributional Random Network Distillation” is on Arxiv.

  • 2023.11 The paper “Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model” is selected as HuggingFace daily paper

  • 2023.11 My work “Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model” is on Arxiv.

Publications

Educations

2022-present: Pursuing Master’s Degree in Artificial Intelligence at Tsinghua Shenzhen International Graduate School, Tsinghua University.

2018-2022: Bachelor’s Degree in Automation from the Department of Electronic Information Engineering, Xi’an Jiaotong University.

Honors & Awards

2023.10: AI bank scholarship of Tsinghua University

2022.10: Outstanding graduates of Xi’an Jiaotong University.

2021.10: School-level scholarship of Xi’an Jiaotong University.

2021.10: School-level outstanding student of Xi’an Jiaotong University.

2020.10: School-level scholarship of Xi’an Jiaotong University.

2020.10: School-level outstanding student of Xi’an Jiaotong University.

2019.10: School-level scholarship of Xi’an Jiaotong University.

2019.10: School-level outstanding student of Xi’an Jiaotong University.

Competitions

Math:

  • First Prize in the National Finals of the 13th Chinese Mathematics Competitions (CMC), ranking 7th nationwide.

  • third prize in the National Finals of the 3rd Hua Jiao Cup Mathematics Competition.

  • Second Prize at the provincial level in the 12th Chinese Mathematics Competitions (CMC).

Computer Science:

  • 1st place in the “Gorgewalk_v2” environment of the Tencent Honor of Kings AI Challenge.

  • Top 20 in the “hok_1v1” environment of the Tencent Honor of Kings AI Challenge.

  • Top 20 in the RLChina AI Challenge - RenYin Winter Season.

  • Honorable Mention Award in the 2021 Mathematical Contest in Modeling (MCM).

Internships

Tencent TEG, Machine Learning Platform Department

Reseach on efficient RLHF-based fine-tuning methods for improving LLM performance.

Parametrix Technology Company

Research on how to enhance AIGC performance using RLHF and developing baseline algorithms code for the Kaggle Lux competition.

Teaching

Teaching Assistant of the course Introduction to Reinforcement Learning instructed by Prof. Xiu Li, Spring 2024.