-
公开(公告)号:US10984245B1
公开(公告)日:2021-04-20
申请号:US16286377
申请日:2019-02-26
Applicant: Facebook, Inc.
Inventor: Du Le Hong Tran , Kaiming He , Heng Wang , Matthew Dan Feiszli , Lorenzo Torresani
Abstract: In one embodiment, a method includes receiving a request for information associated with a video, determining the information associated with the video by processing the video using a machine-learning model which is based on a convolutional neural network comprising a plurality of layers, wherein at least one of the plurality of layers comprises one or more building blocks, wherein at least one of the one or more building blocks comprises a first filter configured to perform a three-dimensional (3D) pointwise convolutional operation and a second filter configured to perform a three-dimensional (3D) groupwise convolutional operation, and outputting the information associated with the video in response to the request.
-
公开(公告)号:US20200160064A1
公开(公告)日:2020-05-21
申请号:US16688367
申请日:2019-11-19
Applicant: Facebook, Inc.
Inventor: Heng Wang , Du Le Hong Tran , Antoine Miech , Lorenzo Torresani
Abstract: In one embodiment, a method includes accessing a first set of images of multiple images of a scene, wherein the first set of images show the scene during a time period. The method includes generating, by processing the first set of images using a first machine-learning model, one or more attributes representing observed actions performed in the scene during the time period. The method includes predicting, by processing the generated one or more attributes using a second machine-learning model, one or more actions that would happen in the scene after the time period.
-