Yiping Wang 王宜平

alt text 

Yiping Wang
Ph.D student
Paul G. Allen School of Computer Science & Engineering,
University of Washington
Email: ypwang61@cs.washington.edu

Google Scholar / Twitter / Github / LinkedIn

About me

I'm a second-year Ph.D. student in Paul G. Allen School of Computer Science & Engineering from University of Washington. I feel very fortunate to have worked under the guidance of Prof. Simon Shaolei Du since 2022 summer.

My main research interest broadly spread across machine learning theory and foundation models. For the theortical part, I care about understanding the foundations of deep learning and representation learning, especially the training dynamics of the basic components like Transformer. For the empirical part, I am keen on developing efficient algorithms with strong theoretical guarantees or insightful observations. Currently, in this aspect, I'm working on data selection/scheduling for multi-modal pretraining and improving inference efficiency of LLM. I'm also working on some projects related to video generation. In addition, I have always held a strong enthusiasm for understanding the essence of intelligence and exploring the cross-cutting areas of mathematics, physics, and AGI, such as using LLMs for mathematical proof and seeking scientific truth.

I'm grateful to all my collaborators and mentors along the way. I'm priviledged to be working closely with Dr. Yuandong Tian since 2023 spring. Besides, I'm also having intern at Microsoft started from June 2024, fortunate to be advised by Yelong Shen and Shuohang Wang. During my undergraduate, I was fortunate to work closely with Prof. Huaxiu Yao and Prof. Linjun Zhang.

Previously, I studied Computer Science and Mathematics in Zhejiang University, got an honors degree from Chu Kochen Honors College.

News

  • 12/2024: Releasing a new video generation benchmark StoryEval!

  • 12/2024: Attending NeurIPS 2024 in Vancouver and presenting our CLIPLoss paper!

  • 09/2024: Attending MoDL 2024 in New York sponsored by Simons Foundation, and presenting our CLIPLoss poster!

  • 09/2024: Our CLIPLoss paper is accepted by NeurIPS 2024 as spotlight!

  • 06/2024: Started my internship at Microsoft!

  • 01/2024: One paper (JoMA) is accepted by ICLR 2024!

  • 12/2023: Attended NeurIPS 2023 in New Orleans!

  • 09/2023: One paper (Scan&Snap) is accepted by NeurIPS 2023!

  • 09/2023: Become a husky in UW!

Research directions and Selected Papers


Data Selection Algorithm

We studied how to efficiently select data for multimodal pretraining tasks, drawing inspiration from both empirical observations and theoretical insights.

alt text 

CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning
Yiping Wang*, Yifang Chen*, Wendan Yan, Alex Fang, Wenjing Zhou, Kevin Jamieson, Simon Shaolei Du
NeurIPS 2024 (Spotlight)
[Arxiv] [Code] [Poster] [Twitter] [Previous Versions]

tl;dr: We design simple but efficient data selection methods for CLIP pretraining, and get new SOTA in DataComp benchmark.


Video Generation Evaluation

We explore the common issues existing in the current top video generative models.

alt text 

Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation
Yiping Wang, Xuehai He, Kuan Wang, Luyao Ma, Jianwei Yang, Shuohang Wang, Simon Shaolei Du, Yelong Shen
preprint
[Arxiv] [Code] [Twitter] [Website]

tl;dr: Current top video generative models can not present multi-event stories like "How to Put an Elephant in a Refrigerator".


Theory of Transformer Dynamics

We attempted to analyze the training dynamics of transformers in a mathematical way.

alt text 

Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer
Yuandong Tian, Yiping Wang, Beidi Chen, Simon Shaolei Du
NeurIPS 2023 (Oral presentation @ ICML2023-HiDL)
[Arxiv] [Poster] [Twitter]

tl;dr: We analyze the 1-layer transformer with next token prediction loss, and rigorously prove its training process.

alt text 

JoMA: Demystifying Multilayer Transformers via JOint Dynamics of MLP and Attention
Yuandong Tian, Yiping Wang, Zhenyu Zhang, Beidi Chen, Simon Shaolei Du
ICLR 2024
[Arxiv] [Twitter]

tl;dr: We analyze the training dynamics of multilayer transformer, characterizing the role of self-attention and MLP nonlinearity.