-
公开(公告)号:US11106949B2
公开(公告)日:2021-08-31
申请号:US16362530
申请日:2019-03-22
Applicant: Microsoft Technology Licensing, LLC
Inventor: Muhammad Zeeshan Zia , Federica Bogo , Harpreet Singh Sawhney , Huseyin Coskun , Bugra Tekin
Abstract: A computing device, including a processor configured to receive a first video including a plurality of frames. For each frame, the processor may determine that a target region of the frame includes a target object. The processor may determine a surrounding region within which the target region is located. The surrounding region may be smaller than the frame. The processor may identify one or more features located in the surrounding region. From the one or more features, the processor may generate one or more manipulated object identifiers. For each of a plurality of pairs of frames, the processor may determine a respective manipulated object movement between a first manipulated object identifier of the first frame and a second manipulated object identifier of the second frame. The processor may classify at least one action performed in the first video based on the plurality of manipulated object movements.
-
公开(公告)号:US12216832B2
公开(公告)日:2025-02-04
申请号:US18463906
申请日:2023-09-08
Applicant: Microsoft Technology Licensing, LLC
Inventor: Julia Schwarz , Bugra Tekin , Sophie Stellmach , Erian Vazquez , Casey Leon Meekhof , Fabian Gobel
Abstract: A method for evaluating gesture input comprises receiving input data for sequential data frames, including hand tracking data for hands of a user. A first neural network is trained to recognize features indicative of subsequent gesture interactions and configured to evaluate input data for a sequence of data frames and to output an indication of a likelihood of the user performing gesture interactions during a predetermined window of data frames. A second neural network is trained to recognize features indicative of whether the user is currently performing one or more gesture interactions and configured to adjust parameters for gesture interaction recognition during the predetermined window based on the indicated likelihood. The second neural network evaluates the predetermined window for performed gesture interactions based on the adjusted parameters, and outputs a signal as to whether the user is performing one or more gesture interactions during the predetermined window.
-
公开(公告)号:US11004230B2
公开(公告)日:2021-05-11
申请号:US16362552
申请日:2019-03-22
Applicant: Microsoft Technology Licensing, LLC
Inventor: Marc Andre Leon Pollefeys , Bugra Tekin , Federica Bogo
IPC: G06T7/70
Abstract: A data processing system is provided that includes a processor having associated memory, the processor being configured to execute instructions using portions of the memory to cause the processor to, at classification time, receive an input image frame from an image source. The input image frame includes an articulated object and a target object. The processor is further caused to process the input image frame using a trained neural network configured to, for each input cell of a plurality of input cells in the input image frame predict a three-dimensional articulated object pose of the articulated object and a three-dimensional target object pose of the target object relative to the input cell. The processor is further caused to output the three-dimensional articulated object pose and the three-dimensional target object pose from the neural network.
-
公开(公告)号:US11768544B2
公开(公告)日:2023-09-26
申请号:US17649659
申请日:2022-02-01
Applicant: Microsoft Technology Licensing, LLC
Inventor: Julia Schwarz , Bugra Tekin , Sophie Stellmach , Erian Vazquez , Casey Leon Meekhof , Fabian Gobel
Abstract: A method for evaluating gesture input comprises receiving input data for sequential data frames, including hand tracking data for hands of a user. A first neural network is trained to recognize features indicative of subsequent gesture interactions and configured to evaluate input data for a sequence of data frames and to output an indication of a likelihood of the user performing gesture interactions during a predetermined window of data frames. A second neural network is trained to recognize features indicative of whether the user is currently performing one or more gesture interactions and configured to adjust parameters for gesture interaction recognition during the predetermined window based on the indicated likelihood. The second neural network evaluates the predetermined window for performed gesture interactions based on the adjusted parameters, and outputs a signal as to whether the user is performing one or more gesture interactions during the predetermined window.
-
-
-