Avatar
Weirui Ye
Slow down and do the best better.

About Me

I am a Ph.D student in Institute for Interdisciplinary Information Science (IIIS) at Tsinghua University, advised by Prof. Yang Gao. I received my bachelor degree from the School of Software in Tsinghua University, under the supervision of Prof. Mingsheng Long and Prof. Feng Xu .

My interest lies in sample efficient policy learning, include Model-based RL, large models for decision-making, and robot learning etc.

Publications

    EfficientZero V2: Mastering Discrete and
    Continuous Control with Limited Data

    Shengjie Wang*, Shaohuai Liu*, Weirui Ye*, Jiacheng You, Yang Gao
    arXiv preprint 2024   [Arxiv]



    Seer: Language Instructed Video Prediction
    with Latent Diffusion Models

    Xianfan Gu, Chuan Wen, Weirui Ye, Jiaming Song, Yang Gao
    ICLR 2024   [Arxiv]



    Foundation Reinforcement Learning: towards Embodied
    Generalist Agents with Foundation Prior Assistance

    Weirui Ye, Yunsheng Zhang, Mengchen Wang, Shengjie Wang, Xianfan Gu,
    Pieter Abbeel, Yang Gao
    arXiv preprint 2023   [Arxiv]   [Website]



    Real-time Scheduling of Renewable Power Systems
    through Planning-based Reinforcement Learning

    Shaohuai Liu*, Jinbo Liu*, Weirui Ye, ... , Fangchun Di*, Yang Gao*
    arXiv preprint 2023   [Arxiv]



    Become a Proficient Player with Limited Data
    through Watching Pure Videos

    Weirui Ye*, Yunsheng Zhang*, Pieter Abbeel, Yang Gao
    ICLR 2023   [PDF]



    SpeedyZero: Mastering Atari with
    Limited Data and Time

    Yixuan Mei*, Jiaxuan Gao*, Weirui Ye, Shaohuai Liu, Yang Gao, Yi Wu
    ICLR 2023   [PDF]



    Spending Thinking Time Wisely:
    Accelerating MCTS with Virtual Expansions

    Weirui Ye, Pieter Abbeel, Yang Gao
    NeurIPS 2022   [PDF]



    Planning for Sample Efficient Imitation Learning
    Zhao-Heng Yin, Weirui Ye, Qifeng Chen, Yang Gao
    NeurIPS 2022   [Arxiv]



    Mastering Atari Games with Limited Data
    Weirui Ye, Shaohuai Liu, Thanard Kurutach, Pieter Abbeel, Yang Gao
    NeurIPS 2021   [Website]   [Arxiv]   [PDF]   [Code]



    Simultaneous Learning of Pivots and Representations for
    Cross-domain Sentiment Classification

    Liang Li, Weirui Ye, Mingsheng Long, Yateng Tang, Jin Xu, Jianmin Wang
    AAAI 2020   [PDF]



    Transferable Attention for Domain Adaptation
    Ximei Wang, Liang Li, Weirui Ye, Mingsheng Long, Jianmin Wang
    AAAI 2019   [PDF]

Experiences

Ph.D. Student of Computer Science
2020 - Present
Second Bachelor of Economics
Bachelor of Software Engineering