Long Zhao
Research Scientist @ Google DeepMind

Brief Bio. I am a Senior Research Scientist at Google DeepMind. Before joining Google, I obtained my Ph.D. in Computer Science from Rutgers University in 2022, advised by Distinguished Professor Dimitris N. Metaxas.

My current research interests lie primarily in (1) large vision foundation models (e.g., video-language models, multimodal models, world models), (2) generative models (e.g., diffusion models, visual autoregressive models), (3) self-supervised representation learning (e.g., contrastive learning, mask modeling), and (4) contextualized machine perception (e.g., recognition, detection, segmentation, localization).

340 Main Street
Los Angeles, CA 90291
Google LAX
Email: longzh [at] google [dot] com
[Curriculum Vitae]

Background

Education

Sept. 2016 - Jan. 2022
Rutgers, The State University of New Jersey - New Brunswick. Piscataway, NJ
Ph.D. in Computer Science
Sept. 2012 - Jun. 2015
Tongji University. Shanghai, China
M.S. in Software Engineering
Sept. 2008 - Jul. 2012
Tongji University. Shanghai, China
B.Eng. in Software Engineering
  • GPA: 4.56/5.0, which is equivalent to 90.6 on 100 basis

Experience

May 2024 - Present
Applied Research, Google DeepMind. Los Angeles, CA, USA
Research Scientist.
  • Video Understanding (e.g., Multimodal Models, World Models)
  • Video Generation (e.g., Diffusion Models, Visual Autoregressive Models)
  • Human-Centric Perception (e.g., Recognition, Detection)
Nov. 2021 - May 2024
Perception Team, Google Research. Los Angeles, CA, USA
Research Scientist.
Dec. 2020 - May 2021
Brain Team, Google Research. Mountain View, CA, USA
Student Researcher. Host: Dr. Han Zhang
  • Boosting Transformers for High-Resolution Image Generation [NeurIPS'21]
  • Improving Efficiency and Interpretability for Vision Transformers [AAAI'22 (Oral)]
May 2020 - Dec. 2020
Mobile Vision Team, Google Research. Los Angeles, CA, USA
Research Intern & Student Researcher. Host: Dr. Ting Liu
  • View-Disentangled Human Pose Representation Learning [CVPR'21 (Oral)]
  • View-Invariant, Occlusion-Robust Probabilistic Pose Embedding [IJCV'21]
Sept. 2016 - Present
Computer Science Department, Rutgers University. Piscataway, NJ, USA
Research Assistant. Supervised by Prof. Dimitris N. Metaxas
Dec. 2013 - Nov. 2014
Visual Computing Group, Microsoft Research Asia (MSRA). Beijing, China
Research Intern. Mentor: Dr. Yichen Wei
  • Generic Object Proposal Generation [CVPR'15]
  • Salient Object Detection [ACCV'14]
  • Won the award of excellence in "MSRA Stars of Tomorrow Internship Program"

Selected Publications

(* indicates equal contributions. Please check Google Scholar for the full list of my publications.)