Physically plausible object pose refinement in cluttered scenes

  • Estimating the 6-DoF pose of objects from images is a fundamental task in computer vision and a prerequisite for downstream tasks like augmented reality or robotic grasping applications. This task becomes particularly challenging in cluttered scenes, when many objects are present in the image in close proximity and occlude one another. However, the close proximity between objects also provides additional cues about the objects, as objects in physically plausible scenes do not intersect one another and thus occluding objects constrain the ones they occlude. We present a novel approach for utilizing this information in 6-DoF object pose refinement of known objects. Our formulation extends RAFT-based pose refinement to reduce penetrations between objects to a large degree and leads to more plausible object poses with less penetrations. We evaluate our approach quantitatively and qualitatively on two benchmark datasets, demonstrate improvements over baselines, and will make the source codeEstimating the 6-DoF pose of objects from images is a fundamental task in computer vision and a prerequisite for downstream tasks like augmented reality or robotic grasping applications. This task becomes particularly challenging in cluttered scenes, when many objects are present in the image in close proximity and occlude one another. However, the close proximity between objects also provides additional cues about the objects, as objects in physically plausible scenes do not intersect one another and thus occluding objects constrain the ones they occlude. We present a novel approach for utilizing this information in 6-DoF object pose refinement of known objects. Our formulation extends RAFT-based pose refinement to reduce penetrations between objects to a large degree and leads to more plausible object poses with less penetrations. We evaluate our approach quantitatively and qualitatively on two benchmark datasets, demonstrate improvements over baselines, and will make the source code of our approach publicly available to foster future research in this area.show moreshow less

Export metadata

Statistics

Number of document requests

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Michael Strecke, Joerg StuecklerGND
Frontdoor URLhttps://opus.bibliothek.uni-augsburg.de/opus4/127301
ISBN:9783031851865OPAC
ISBN:9783031851872OPAC
ISSN:0302-9743OPAC
ISSN:1611-3349OPAC
Parent Title (English):Pattern Recognition: 46th DAGM German Conference, DAGM GCPR 2024, Munich, Germany, September 10–13, 2024, proceedings, part II
Publisher:Springer
Place of publication:Cham
Editor:Daniel Cremers, Zorah Lähner, Michael Moeller, Matthias Nießner, Björn Ommer, Rudolph Triebel
Type:Conference Proceeding
Language:English
Year of first Publication:2025
Publishing Institution:Universität Augsburg
Release Date:2026/01/12
First Page:218
Last Page:233
Series:Lecture Notes in Computer Science ; 15298
DOI:https://doi.org/10.1007/978-3-031-85187-2_14
Institutes:Fakultät für Angewandte Informatik
Fakultät für Angewandte Informatik / Institut für Informatik
Fakultät für Angewandte Informatik / Institut für Informatik / Professur für Intelligente Perzeption in technischen Systemen
Dewey Decimal Classification:0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik