Pose estimation of primitive-shaped objects from a depth image using superquadric representation

Ryo Hachiuma, Hideo Saito

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)


This paper presents a method for estimating the six Degrees of Freedom (6DoF) pose of texture-less primitive-shaped objects from depth images. As the conventional methods for object pose estimation require rich texture or geometric features to the target objects, these methods are not suitable for texture-less and geometrically simple shaped objects. In order to estimate the pose of the primitive-shaped object, the parameters that represent primitive shapes are estimated. However, these methods explicitly limit the number of types of primitive shapes that can be estimated. We employ superquadrics as a primitive shape representation that can represent various types of primitive shapes with only a few parameters. In order to estimate the superquadric parameters of primitive-shaped objects, the point cloud of the object must be segmented from a depth image. It is known that the parameter estimation is sensitive to outliers, which are caused by the miss-segmentation of the depth image. Therefore, we propose a novel estimation method for superquadric parameters that are robust to outliers. In the experiment, we constructed a dataset in which the person grasps and moves the primitive-shaped objects. The experimental results show that our estimation method outperformed three conventional methods and the baseline method.

Original languageEnglish
Article number2927
JournalApplied Sciences (Switzerland)
Issue number16
Publication statusPublished - 2020 Aug


  • Depth image
  • Pose estimation
  • Primitive shapes
  • Superquadrics

ASJC Scopus subject areas

  • Materials Science(all)
  • Instrumentation
  • Engineering(all)
  • Process Chemistry and Technology
  • Computer Science Applications
  • Fluid Flow and Transfer Processes


Dive into the research topics of 'Pose estimation of primitive-shaped objects from a depth image using superquadric representation'. Together they form a unique fingerprint.

Cite this