-
公开(公告)号:US10860854B2
公开(公告)日:2020-12-08
申请号:US15912796
申请日:2018-03-06
Applicant: Google LLC
Inventor: Juan Carlos Anorga , David Lieb , Madhur Khandelwal , Evan Millar , Timothy Novikoff , Mugdha Kulkarni , Leslie Ikemoto , Jorge Verdu , Jingyu Cui , Sharadh Ramaswamy , Raja Ratna Murthy Ayyagari , Marc Cannon , Alexander Roe , Shaun Tungseth , Songbo Jin , Matthew Bridges , Ruirui Jiang , Jeremy Selier , Austin Suszek , Gang Song
IPC: G06K9/00 , G06K9/62 , G06N20/00 , G06F16/51 , G06F16/583 , G06F3/048 , G06F16/50 , G06N3/04 , G06F16/54 , G06N7/00 , G06N20/10 , G06K9/78
Abstract: Implementations relate to causing a command to be executed based on an image. In some implementations, a computer-implemented method includes obtaining and programmatically analyzing an image to determine suggested actions. The method causes a user interface to be displayed that includes user interface elements corresponding to default actions, and to suggested actions that are determined based on analyzing the image. The method receives user input indicative of selection of a particular action from the default actions and the suggested actions. The method causes a command to be executed by a computing device for the particular action that was selected.
-
公开(公告)号:US20200293783A1
公开(公告)日:2020-09-17
申请号:US16352605
申请日:2019-03-13
Applicant: Google LLC
Inventor: Sharadh Ramaswamy , Sourish Chaudhuri , Joseph Roth
Abstract: Implementations described herein relate to methods, devices, and computer-readable media to perform gating for video analysis. In some implementations, a computer-implemented method includes obtaining a video comprising a plurality of frames and corresponding audio. The method further includes performing sampling to select a subset of the plurality of frames based on a target frame rate and extracting a respective audio spectrogram for each frame in the subset of the plurality of frames. The method further includes reducing resolution of the subset of the plurality of frames. The method further includes applying a machine-learning based gating model to the subset of the plurality of frames and corresponding audio spectrograms and obtaining, as output of the gating model, an indication of whether to analyze the video to add one or more video annotations.
-