A novel neural ray field for
representing 3D shapes
Peder Bergebakken Sundt
Theoharis Theoharis
s.ntnu.no/marf
Lambertian
Single network evaluation per pixel.
(Including the normal)
Major speedup for inverse-rendering, and analysis-by-synthesis (Renders in realtime!)
MARFs improve multi-view consistency.
(Not guaranteed in ray fields)
we also get for essentially free
With our novel shape representation
Local Thickness
Unsupervised
Part Segmentation
"Approximate" Normals
We also get for essentially free:
Topological Skeleton
With our novel shape representation
Local Thickness
Subsurface Scattering
With a backward pass we show
that one can also compute
(i.e. not realtime)
True Normals
Curvature
With a backward pass we show
that one can also compute:
Despite the network
having no second derivative
(i.e. not realtime)
"Approximate" Normals
Curvature
With a backward pass we show
that one can also compute:
Despite the network
having no second derivative
"Approximate" Normals
True Normals
(i.e. not realtime)
Curvature
Anisotropic shading
(Using principal curvatures)
MARFs can represent multiple shapes
in a shared latent space
Latent space traversal
Learned from only 20 shapes
Medial atoms?
Ray Fields?
Neural Fields?
Medial atoms
Ray Fields
Neural Fields
MARFs
discrete 3D shape representations
Let's look at the shortcomings of
No topology,
lossy surface reconstruction.
Scales poorly
Fixed topology
or self-intersecting
... map poorly to neural networks!
Discrete 3D shape representations
Points
Voxels
Meshes
Functions mapping spatiotemporal coordinates to some quantity
... represented with a neural network
Functions mapping spatiotemporal coordinates to some quantity
\(\Rightarrow\) Determine the volume rendering ray integral
Differentiable Ray-Marching
3D Cartesian radiance field
\(\Rightarrow\)
Differentiable Ray-Marching
4 main strategies
4 main strategies
(Offline) Tabulation
Discretization artifacts (aliasing)
Many small networks
Trade compute for memory,
loss of global shape prior
Directly predict
the ray integral
100x speedup,
Learn a distance field
instead of density
10x speedup,
no transparency
No ray marching!
but difficult
Directly predict
the ray integral
100x speedup,
No ray marching!
but difficult
Plücker Coordinate
No singularities
or discontinuities
No singularities
or discontinuities
Plücker Coordinate
Two prior works use this ray encoding
Plücker Coordinate
Intersection Fields
(PRIF)
Networks
Light Field
Primary Ray
Both struggle with discontinuities!
V. Sitzmann, S. Rezchikov, W.T. Freeman, J.B. Tenenbaum, F. Durand, Light field networks: Neural scene representations with single-evaluation rendering, in: ArXiv, 2021.
B.Y. Feng, Y. Zhang, D. Tang, R. Du, A. Varshney, PRIF: Primary Ray-Based Implicit Function, in: S. Avidan, G. Brostow, M. Cissé, G.M. Farinella, T. Hassner (Eds.), Computer Vision – ECCV 2022, Springer Nature Switzerland, Cham, 2022: pp. 138–155. https://doi.org/10.1007/978-3-031-20062-5_9.
Intersection Fields
(PRIF)
Lipschitz bound!
Fuzzy edges
Requires filtering
Primary Ray
Both struggle with discontinuities!
Networks
Light Field
Intersection Fields
(PRIF)
Networks
Lipschitz bound
Light Field
Primary Ray
Intersection Fields
(PRIF)
Networks
Lipschitz bound
Light Field
Primary Ray
PRIF
Ours
We need to represent 3D shapes
Figures adapted from
D. Rebain, K. Li, V. Sitzmann, S. Yazdani, K.M. Yi, A. Tagliasacchi, Deep Medial Fields, ArXiv:2106.03804 [Cs]. (2021).
We need to represent 3D shapes
as a sum of spheres
Figures adapted from
D. Rebain, K. Li, V. Sitzmann, S. Yazdani, K.M. Yi, A. Tagliasacchi, Deep Medial Fields, ArXiv:2106.03804 [Cs]. (2021).
A. Tagliasacchi, T. Delame, M. Spagnuolo, N. Amenta, A. Telea, 3D Skeletons: A State-of-the-Art Report, Computer Graphics Forum. 35 (2016) 573–597. https://doi.org/10.1111/cgf.12865.
Set of maximally inscribed spheres
Ridges in distance transform
Points with two+ nearest neighbors
Local axis of reflectional symmetry
Defining
A. Tagliasacchi, T. Delame, M. Spagnuolo, N. Amenta, A. Telea, 3D Skeletons: A State-of-the-Art Report, Computer Graphics Forum. 35 (2016) 573–597. https://doi.org/10.1111/cgf.12865.
Set of maximally inscribed spheres
Ridges in distance transform
Points with two+ nearest neighbors
Local axis of reflectional symmetry
Defining
A. Tagliasacchi, T. Delame, M. Spagnuolo, N. Amenta, A. Telea, 3D Skeletons: A State-of-the-Art Report, Computer Graphics Forum. 35 (2016) 573–597. https://doi.org/10.1111/cgf.12865.
Set of maximally inscribed spheres
Ridges in distance transform
Points with two+ nearest neighbors
Local axis of reflectional symmetry
D. Rebain, B. Angles, J. Valentin, N. Vining, J. Peethambaran, S. Izadi, A. Tagliasacchi, LSMAT Least Squares Medial Axis Transform, Computer Graphics Forum. 38 (2019) 5–18. https://doi.org/10.1111/cgf.13599.
Defining
A. Tagliasacchi, T. Delame, M. Spagnuolo, N. Amenta, A. Telea, 3D Skeletons: A State-of-the-Art Report, Computer Graphics Forum. 35 (2016) 573–597. https://doi.org/10.1111/cgf.12865.
Set of maximally inscribed spheres
Ridges in distance transform
Points with two+ nearest neighbors
Local axis of reflectional symmetry
D. Rebain, B. Angles, J. Valentin, N. Vining, J. Peethambaran, S. Izadi, A. Tagliasacchi, LSMAT Least Squares Medial Axis Transform, Computer Graphics Forum. 38 (2019) 5–18. https://doi.org/10.1111/cgf.13599.
Defining
Fit for iteration
Medial Atom
Ray Field
(maximally inscribed sphere)
We propose
The Medial Atom Ray Field
Text
Medial Atom
Ray Field
We propose
The Medial Atom Ray Field
Medial Atom
Ray Field
s
We propose
The Medial Atom Ray Field
Medial Atom
Ray Field
We propose
The Medial Atom Ray Field
Medial Atom
Ray Field
We propose
The Medial Atom Ray Field
Supervising
Supervising
Training Data
Supervising
We show the normal can be computed by
differentiating the network w.r.t. the ray,
Training Data
Supervising
We show the normal can be computed by
differentiating the network w.r.t. the ray,
but its much cheaper to draw a line between the atom center and the intersection point!
Supervising
Training Data
Supervising
Supervising
Supervising
Constant positive
pressure on radius
Supervising
Supervising
Supervising
Supervising
Observation: a point should not change w.r.t. a infinitesimal change in incident viewing angle
Supervising
Observation: a point should not change w.r.t. a infinitesimal change in incident viewing angle
Supervising
Observation: a point should not change w.r.t. a infinitesimal change in incident viewing angle
PRIF
Observation: a point should not change w.r.t. a infinitesimal change in incident viewing angle
PRIF
Observation: a point should not change w.r.t. a infinitesimal change in incident viewing angle
PRIF
MARF
Observation: a point should not change w.r.t. a infinitesimal change in incident viewing angle
PRIF
MARF
Observation: a point should not change w.r.t. a infinitesimal change in incident viewing angle
MARF
Observation: a point should not change w.r.t. a infinitesimal change in incident viewing angle
PRIF
Light Fields
MARF
(Diffuse only)
Supervising
Supervising
Supervising
Supervising
Supervising
Supervising
Supervising
Supervising
Supervising
Supervising
MARF
Ray Intersection Field
Any
Supervising
MARF
Ray Intersection Field
Any
Supervising
MARF
Ray Intersection Field
Any
Supervising
Ours
Ground
Truth
Ours
Ground
Truth
Ours
Ground
Truth
Ours
w/axis
Ground
Truth
Ours
PRIF
w/axis
Ground
Truth
Ours
PRIF
w/axis
Ground
Truth
Ours
PRIF
w/axis
Ground
Truth
Questions?
Thank you
s.ntnu.no/marf
s.ntnu.no/marf
skipped slides
Ray Fields
6 DoF
3D Rigid Bodies
6 DoF
6 DoF
3 DoF
6 DoF
3 DoF
3 DoF
Compared to rigid bodies
Compared to rigid bodies
Rays loose two DoF
Rays are invariant to:
Compared to rigid bodies
Rays loose two DoF
Rotation about ray
Rotation about ray
Rays are invariant to:
Compared to rigid bodies
Rays loose two DoF
2-Sphere
Rotation about ray
Translation along ray
Rays are invariant to:
Compared to rigid bodies
Rays loose two DoF
2-Sphere
Rotation about ray
Translation along ray
Rays are invariant to:
(assuming no start)
Compared to rigid bodies
Rays loose two DoF
2-Sphere
Somehow
orthogonal
to ray
Rotation about ray
Translation along ray
Rays are invariant to:
(assuming no start)
Compared to rigid bodies
Rays loose two DoF
2-Sphere
Somehow
orthogonal
to ray
"Hairy ball theorem"
says no
Without singularities
or discontinuities?
V. Sitzmann, S. Rezchikov, W.T. Freeman, J.B. Tenenbaum, F. Durand, Light field networks: Neural scene representations with single-evaluation rendering, in: ArXiv, 2021.
V. Sitzmann, S. Rezchikov, W.T. Freeman, J.B. Tenenbaum, F. Durand, Light field networks: Neural scene representations with single-evaluation rendering, in: ArXiv, 2021.
V. Sitzmann, S. Rezchikov, W.T. Freeman, J.B. Tenenbaum, F. Durand, Light field networks: Neural scene representations with single-evaluation rendering, in: ArXiv, 2021.
V. Sitzmann, S. Rezchikov, W.T. Freeman, J.B. Tenenbaum, F. Durand, Light field networks: Neural scene representations with single-evaluation rendering, in: ArXiv, 2021.
4 DoF!
V. Sitzmann, S. Rezchikov, W.T. Freeman, J.B. Tenenbaum, F. Durand, Light field networks: Neural scene representations with single-evaluation rendering, in: ArXiv, 2021.
V. Sitzmann, S. Rezchikov, W.T. Freeman, J.B. Tenenbaum, F. Durand, Light field networks: Neural scene representations with single-evaluation rendering, in: ArXiv, 2021.
V. Sitzmann, S. Rezchikov, W.T. Freeman, J.B. Tenenbaum, F. Durand, Light field networks: Neural scene representations with single-evaluation rendering, in: ArXiv, 2021.
V. Sitzmann, S. Rezchikov, W.T. Freeman, J.B. Tenenbaum, F. Durand, Light field networks: Neural scene representations with single-evaluation rendering, in: ArXiv, 2021.
B.Y. Feng, Y. Zhang, D. Tang, R. Du, A. Varshney, PRIF: Primary Ray-Based Implicit Function, in: S. Avidan, G. Brostow, M. Cissé, G.M. Farinella, T. Hassner (Eds.), Computer Vision – ECCV 2022, Springer Nature Switzerland, Cham, 2022: pp. 138–155. https://doi.org/10.1007/978-3-031-20062-5_9.
B.Y. Feng, Y. Zhang, D. Tang, R. Du, A. Varshney, PRIF: Primary Ray-Based Implicit Function, in: S. Avidan, G. Brostow, M. Cissé, G.M. Farinella, T. Hassner (Eds.), Computer Vision – ECCV 2022, Springer Nature Switzerland, Cham, 2022: pp. 138–155. https://doi.org/10.1007/978-3-031-20062-5_9.
Fit surface
B.Y. Feng, Y. Zhang, D. Tang, R. Du, A. Varshney, PRIF: Primary Ray-Based Implicit Function, in: S. Avidan, G. Brostow, M. Cissé, G.M. Farinella, T. Hassner (Eds.), Computer Vision – ECCV 2022, Springer Nature Switzerland, Cham, 2022: pp. 138–155. https://doi.org/10.1007/978-3-031-20062-5_9.