These slides:
https://s.ntnu.no/irobot
F for fullscreen
SPACE to advance
ESC to zoom out
Slides progress downward,
then to next column
A MsC by
Peder Bergebakken Sundt
Panda Emika 7-DoF robot arm
With
Intel Realsense 3D vision
A MsC by
Peder Bergebakken Sundt
Affected by
point order,
no topology.
Scales poorly
Either limited
topologically or
self-intersecting
... all map poorly to neural networks!
Explicit 3D shape representations
Points
Voxels
Meshes
A new concept, first explored in 2019
"We have many names for the things we love:"
Park JJ, Florence P, Straub J, Newcombe R, Lovegrove S. DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA: IEEE; 2019, p. 165–74. https://doi.org/10.1109/CVPR.2019.00025.
"We have many names for the things we love:"
Park JJ, Florence P, Straub J, Newcombe R, Lovegrove S. DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA: IEEE; 2019, p. 165–74. https://doi.org/10.1109/CVPR.2019.00025.
A new concept, first explored in 2019
Query
coordinates
Value at
coordinate
Trained on (x,y,r,g,b) tuples
Signed Distance Functions (SDF)
Implicit Isosurface
Metric Constraint
Moving to 3D...
Signed Distance Functions (SDF)
Signed Distance Functions (SDF)
Train it on (x, y) tuples
Signed Distance Functions (SDF)
Marching Cubes
+
Train it on (x, y) tuples
3D models provided by
Signed Distances
Gradients
Positive SDF
Negative SDF
⬤
⬤
Free-space
Near-surface
⬤
⬤
Mesh
Gradients
Positive SDF
Negative SDF
⬤
⬤
Free-space
Near-surface
⬤
⬤
(Cosine similarity)
Mesh
Loss
Signed Distances
Requires us to compute
the derivative of the network itself
ReLU-based
Sinusoidal
SIREN
ReLU-based
Ground-
Truth
ReLU
ReLU
w/gradients
SIREN
w/gradients
Embedding more than one shape
Auto-encoders map poorly
to learning implicit functions...
... and treat the latent vectors as learnable parameters!
-> Just skip the encoder!
Keep a database
of codes per object
Add a regularizing cost to each latent code in the auto-decoder database:
Pulls each code (z) towards 0, and
incentivise a spherical distribution
(Reconstruction loss)
(Latent code regularization)
and generalization.
We need:
We need:
Then we train with random transformations,
and discover the pose via gradient decent at test time
We sample SDF points from two distributions:
The these two distributions
are balanced 90% / 10%
Hit
Miss
Camera
⬤
⬤
⬤
Positive
Negative
⬤
⬤
T-SNE of learned latent vectors
Shape completion:
(start)
(end)
From the YCB data- and object set
Positive SDF
Negative SDF
Unit sphere
Reconstruction
volume
⬤
⬤
⬤
⬤
Ground-
Truth
To be solved...
Questions?
Tips?
https://s.ntnu.no/irobot
This is is the basis for a paper in the works.