-
公开(公告)号:US11967015B2
公开(公告)日:2024-04-23
申请号:US17145232
申请日:2021-01-08
Applicant: Apple Inc.
Inventor: Qi Shan , Joshua Susskind , Aditya Sankar , Robert Alex Colburn , Emilien Dupont , Miguel Angel Bautista Martin
CPC classification number: G06T15/205 , G06N3/08 , G06T3/60
Abstract: The subject technology provides a framework for learning neural scene representations directly from images, without three-dimensional (3D) supervision, by a machine-learning model. In the disclosed systems and methods, 3D structure can be imposed by ensuring that the learned representation transforms like a real 3D scene. For example, a loss function can be provided which enforces equivariance of the scene representation with respect to 3D rotations. Because naive tensor rotations may not be used to define models that are equivariant with respect to 3D rotations, a new operation called an invertible shear rotation is disclosed, which has the desired equivariance property. In some implementations, the model can be used to generate a 3D representation, such as mesh, of an object from an image of the object.