Yunzhi Lin

Machine Learning Engineer

TikTok

About

I am a Machine Learning Engineer at TikTok, focusing on large-scale video understanding and live recommendation system [View my CV].

Previously, I earned my Ph.D. in Electrical and Computer Engineering from the Institute for Robotics and Intelligent Machines at Georgia Tech, under the guidance of Patricio A. Vela. I completed my B.E. in Automation at Southeast University in 2018, where I was advised by Wenze Shao and Yangang Wang.

In 2017, I worked as a research intern at the Applied Nonlinear Control Lab, University of Alberta, advised by Alan Lynch. Later, I joined the NVIDIA Learning and Perception Research group as a research intern from May 2020 to May 2021 and again from May 2022 to December 2022, under the mentorship of Stan Birchfield, while collaborating closely with Jonathan Tremblay, Stephen Tyree, and Bowen Wen. Most recently, I was a research intern with the Meta FAIR Accel Ego-HowTo team from May 2023 to November 2023, where I was advised by Kevin Liang and Matt Feiszli. My past research has focused on deep learning, computer vision, and robotics, with a particular emphasis on 3D perception and robotic system design.

Industry Experience

Machine Learning Engineer, 08/2024 - Present

TikTok
Research Intern in FAIR Accel, 05/2023 - 11/2023

Meta
Research Intern in Learning and Perception Research Group, 05/2022 - 12/2022 & 05/2020 - 05/2021

NVIDIA

Education

Ph.D. in Electrical and Computer Engineering, 2024

Georgia Institute of Technology
M.S. in Electrical and Computer Engineering, 2020

Georgia Institute of Technology
B.E. in Automation, 2018

Southeast University

Featured Publications

Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation

A parallelized optimization method based on fast Neural Radiance Fields (NeRF) for estimating 6-DoF target poses.

Yunzhi Lin, Thomas Müller, Jonathan Tremblay, Bowen Wen, Stephen Tyree, Alex Evans, Patricio A. Vela, Stan Birchfield

ICRA 2023

Preprint Code Project

Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation

A single-stage, category-level 6-DoF pose estimation algorithm that simultaneously detects and tracks instances of objects within a known category.

Yunzhi Lin, Jonathan Tremblay, Stephen Tyree, Patricio A. Vela, Stan Birchfield

ICRA 2022

Preprint Code Project

Single-Stage Keypoint-based Category-level Object Pose Estimation from an RGB Image

A single-stage, keypoint-based approach for category-level object pose estimation that operates on unknown object instances within a known category using a single RGB image as input.

Yunzhi Lin, Jonathan Tremblay, Stephen Tyree, Patricio A. Vela, Stan Birchfield

ICRA 2022

Preprint Code Project

Multi-view Fusion for Multi-level Robotic Scene Understanding

A system for multi-level scene awareness for robotic manipulation, including three types of information: 1) a point cloud representation of all the surfaces in the scene, for the purpose of obstacle avoidance. 2) the rough pose of unknown objects from categories corresponding to primitive shapes (e.g., cuboids and cylinders), and 3) full 6-DoF pose of known objects.

Yunzhi Lin, Jonathan Tremblay, Stephen Tyree, Patricio A. Vela, Stan Birchfield

IROS 2021

Preprint PDF Dataset Project Video

Using Synthetic Data and Deep Networks to Recognize Primitive Shapes for Object Grasping

A segmentation-based architecture proposed to decompose objects into multiple primitive shapes from monocular depth input for robotic manipulation.

Yunzhi Lin, Chao Tang, Fujen Chu, Patricio A. Vela

ICRA 2020

Preprint PDF Code Project Video

Yunzhi Lin

Machine Learning Engineer

TikTok

About

Industry Experience

Education

Featured Publications

Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation

Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation

Single-Stage Keypoint-based Category-level Object Pose Estimation from an RGB Image

Multi-view Fusion for Multi-level Robotic Scene Understanding

Using Synthetic Data and Deep Networks to Recognize Primitive Shapes for Object Grasping

Contact