Robotics

Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation

A parallelized optimization method based on fast Neural Radiance Fields (NeRF) for estimating 6-DoF target poses.

Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation

A single-stage, category-level 6-DoF pose estimation algorithm that simultaneously detects and tracks instances of objects within a known category.

SGL: Symbolic Goal Learning in a Hybrid, Modular Framework for Human Instruction Following

This paper investigates robot manipulation based on human instruction with ambiguous requests. The intent is to compensate for imperfect natural language via visual observations. Early symbolic methods, based on manually defined symbols, built …

Single-Stage Keypoint-based Category-level Object Pose Estimation from an RGB Image

A single-stage, keypoint-based approach for category-level object pose estimation that operates on unknown object instances within a known category using a single RGB image as input.

Multi-view Fusion for Multi-level Robotic Scene Understanding

A system for multi-level scene awareness for robotic manipulation, including three types of information: 1) a point cloud representation of all the surfaces in the scene, for the purpose of obstacle avoidance. 2) the rough pose of unknown objects from categories corresponding to primitive shapes (e.g., cuboids and cylinders), and 3) full 6-DoF pose of known objects.

A Joint Network for Grasp Detection Conditioned on Natural Language Commands

We consider the task of grasping a target object based on a natural language command query. Previous work primarily focused on localizing the object given the query, which requires a separate grasp detection module to grasp it. The cascaded …

Using Synthetic Data and Deep Networks to Recognize Primitive Shapes for Object Grasping

A segmentation-based architecture proposed to decompose objects into multiple primitive shapes from monocular depth input for robotic manipulation.