Abstract:
In various embodiments, a training application generates a trained encoder that automatically generates shape embeddings having a first size and representing three-dimensional (3D) geometry shapes. First, the training application generates a different view activation for each of multiple views associated with a first 3D geometry based on a first convolutional neural network (CNN) block. The training application then aggregates the view activations to generate a tiled activation. Subsequently, the training application generates a first shape embedding having the first size based on the tiled activation and a second CNN block. The training application then generates multiple re-constructed views based on the first shape embedding. The training application performs training operation(s) on at least one of the first CNN block and the second CNN block based on the views and the re-constructed views to generate the trained encoder.
Abstract:
A system and technique for capturing a workflow history and video of an electronic document are disclosed. Events generated by an application while modifying an electronic document are stored on a web server as metadata. In addition, a captured digital image or frames of captured digital video that reflect the state of the document at the time the event was generated are also stored on the web server. The metadata is associated with one or more portions of the document and with the captured digital image or frames of captured digital video.