Robust scene interpretation by means of machine vision is a key factor in various new applications in robotics. Part of this problem is the efficient recognition and classification of previously known three-dimensional (3D) shapes in arbitrary scenes. So far, heavily constrained conditions have been utilized, or otherwise solutions have not been achieved in real time.
With the availability of ever faster computers and 3D-sensing technology (real-time stereo processing, laser range-scanner, etc.), more general approaches become feasible. They allow for weaker scene restrictions and hence facilitate new scenarios. Fundamental to visual object recognition are descriptions of general free-form shapes.
In computer graphics, surface meshes are a popular description of free forms. They are also useful for recognition purposes and the Internet makes them accessible to everybody for testing and comparing algorithms. A major drawback, however, is their large memory requirement. Furthermore, surface meshes are defined with respect to a global coordinate system. Thus time consuming registration is necessary to align the object of interest to the frame of the referenced object model before matching is possible. The same problems apply to voxel-based descriptions of shape.
Representations based on superquadrics, generalized cylinders, and splines all suffer from a great sensibility to noise and outliers in the sensed data. A significant effort is required to obtain a robust fit procedure and to select the model order so as to avoid over-fitting.
It is most desirable to develop a shape representation that
We compress the distribution of the 4D features into a histogram. A database of histograms, one per object, is sampled in a training phase. During recognition, sensed surface data, as may be acquired by stereo vision, a laser range-scanner, etc., are processed and compared to the stored histograms. We evaluate the match quality by the following six criteria:
For each match criterion, we have investigated:
The recognition-rate curves are shown in the figure for a database of 20 objects, along with an illustrative example of an object as stored in the database (top left), and the same object degraded by noise (top center) and by missing data (top right).
All experiments show that Kullback-Leibler and likelihood matching yield robust recognition rates and outperform the other criteria. Hence, our novel 4D feature histogram demonstrates high representation capacity for free-form recognition.
Eric Wahl, Ulrich Hillenbrand, and Gerd Hirzinger. Surflet-Pair-Relation Histograms: A Statistical 3D-Shape representation for Rapid Classification. 3-D Digital Imaging and Modeling — 3DIM 2003, IEEE Computer Society Press, pp. 474-481.
Eric Wahl and Gerd Hirzinger. Cluster-Based Point Cloud Analysis for Rapid Scene Interpretation. 27th Annual Meeting of the German Association of Pattern Recognition DAGM 2005, Vienna.
Eric Wahl and Gerd Hirzinger. A Method for Fast Search of Variable Regions on Dynamic 3D Point Clouds. 27th Annual Meeting of the German Association of Pattern Recognition DAGM 2005, Vienna.