-
公开(公告)号:US12141199B2
公开(公告)日:2024-11-12
申请号:US17548859
申请日:2021-12-13
Applicant: Google LLC
Inventor: Balakrishnan Varadarajan , George Dan Toderici , Apostol Natsev , Nitin Khandelwal , Sudheendra Vijayanarasimhan , Weilong Yang , Sanketh Shetty
IPC: G06K9/62 , G06F16/78 , G06F16/783 , G06F18/214 , G06F18/22 , G06F18/2413 , G06V20/40 , G06V20/70 , H04N5/265
Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.
-
公开(公告)号:US20220207873A1
公开(公告)日:2022-06-30
申请号:US17548859
申请日:2021-12-13
Applicant: Google LLC
Inventor: Balakrishnan Varadarajan , George Dan Toderici , Apostol Natsev , Nitin Khandelwal , Sudheendra Vijayanarasimhan , Weilong Yang , Sanketh Shetty
Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.
-
公开(公告)号:US20220165259A1
公开(公告)日:2022-05-26
申请号:US17102408
申请日:2020-11-23
Applicant: Google LLC
Inventor: Aakash Goel , Tayfun Elmas , Keith Brady , Akshay Jaggi , Ester Lopez Berga , Arne Vansteenkiste , Robin Martinjak , Mahesh Palekar , Krish Narang , Nitin Khandelwal , Pravir Gupta
Abstract: Implementations include identifying, from a database of entries reflecting past automated assistant commands submitted within a threshold amount of time relative to a current time, particular entries that each reflect corresponding features of a corresponding user submission of a particular command. Further, those implementations include determining that the particular command is a golden command, for a particular automated assistant function, responsive to determining that: at least a threshold percentage of the user submissions of the particular command triggered the particular automated assistant function, and a quantity of the user submission of the particular command satisfies a threshold quantity. Those implementations further include, responsive to determining that the particular command is currently the golden command: processing a stream of current occurrences of submissions of the particular command to determine whether the particular automated assistant function is triggered; and when the processing indicates that a threshold quantity and/or percentage of the current occurrences fails to satisfy a potential problem threshold: causing one or more electronic alerts to be transmitted.
-
公开(公告)号:US20180025228A1
公开(公告)日:2018-01-25
申请号:US15722756
申请日:2017-10-02
Applicant: Google LLC
Inventor: Balakrishnan Varadarajan , George Dan Toderici , Apostol Natsev , Nitin Khandelwal , Sudheendra Vijayanarasimhan , Weilong Yang , Sanketh Shetty
CPC classification number: G06K9/00718 , G06F16/783 , G06F16/7867 , G06K9/52 , G06K9/6201 , G06K9/6256 , G06K9/627 , H04N5/265
Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.
-
公开(公告)号:US20240203423A1
公开(公告)日:2024-06-20
申请号:US18590549
申请日:2024-02-28
Applicant: GOOGLE LLC
Inventor: Akshay Goel , Nitin Khandelwal , Richard Park , Brian Chatham , Jonathan Eccles , David Sanchez , Dmytro Lapchuk
Abstract: Implementations described herein are directed to enabling collaborative ranking of interpretations of spoken utterances based on data that is available to an automated assistant and third-party agent(s), respectively. The automated assistant can determine first-party interpretation(s) of a spoken utterance provided by a user, and can cause the third-party agent(s) to determine third-party interpretation(s) of the spoken utterance provided by the user. In some implementations, the automated assistant can select a given interpretation, from the first-party interpretation(s) and the third-party interpretation(s), of the spoken utterance, and can cause a given third-party agent to satisfy the spoken utterance based on the given interpretation. In additional or alternative implementations, an independent third-party agent can obtain the first-party interpretation(s) and the third-party interpretation(s), select the given interpretation, and then transmit the given interpretation to the automated assistant and/or the given third-party agent.
-
公开(公告)号:US11948580B2
公开(公告)日:2024-04-02
申请号:US17537104
申请日:2021-11-29
Applicant: GOOGLE LLC
Inventor: Akshay Goel , Nitin Khandelwal , Richard Park , Brian Chatham , Jonathan Eccles , David Sanchez , Dmytro Lapchuk
Abstract: Implementations described herein are directed to enabling collaborative ranking of interpretations of spoken utterances based on data that is available to an automated assistant and third-party agent(s), respectively. The automated assistant can determine first-party interpretation(s) of a spoken utterance provided by a user, and can cause the third-party agent(s) to determine third-party interpretation(s) of the spoken utterance provided by the user. In some implementations, the automated assistant can select a given interpretation, from the first-party interpretation(s) and the third-party interpretation(s), of the spoken utterance, and can cause a given third-party agent to satisfy the spoken utterance based on the given interpretation. In additional or alternative implementations, an independent third-party agent can obtain the first-party interpretation(s) and the third-party interpretation(s), select the given interpretation, and then transmit the given interpretation to the automated assistant and/or the given third-party agent.
-
公开(公告)号:US11568869B2
公开(公告)日:2023-01-31
申请号:US17102408
申请日:2020-11-23
Applicant: Google LLC
Inventor: Aakash Goel , Tayfun Elmas , Keith Brady , Akshay Jaggi , Ester Lopez Berga , Arne Vansteenkiste , Robin Martinjak , Mahesh Palekar , Krish Narang , Nitin Khandelwal , Pravir Gupta
Abstract: Implementations include identifying, from a database of entries reflecting past automated assistant commands submitted within a threshold amount of time relative to a current time, particular entries that each reflect corresponding features of a corresponding user submission of a particular command. Further, those implementations include determining that the particular command is a golden command, for a particular automated assistant function, responsive to determining that: at least a threshold percentage of the user submissions of the particular command triggered the particular automated assistant function, and a quantity of the user submission of the particular command satisfies a threshold quantity. Those implementations further include, responsive to determining that the particular command is currently the golden command: processing a stream of current occurrences of submissions of the particular command to determine whether the particular automated assistant function is triggered; and when the processing indicates that a threshold quantity and/or percentage of the current occurrences fails to satisfy a potential problem threshold: causing one or more electronic alerts to be transmitted.
-
公开(公告)号:US11200423B2
公开(公告)日:2021-12-14
申请号:US16687118
申请日:2019-11-18
Applicant: Google LLC
Inventor: Balakrishnan Varadarajan , George Dan Toderici , Apostol Natsev , Nitin Khandelwal , Sudheendra Vijayanarasimhan , Weilong Yang , Sanketh Shetty
Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.
-
公开(公告)号:US11164576B2
公开(公告)日:2021-11-02
申请号:US16251982
申请日:2019-01-18
Applicant: Google LLC
Inventor: April Pufahl , Jared Strawderman , Harry Yu , Adriana Olmos Antillon , Jonathan Livni , Okan Kolak , James Giangola , Nitin Khandelwal , Jason Kearns , Andrew Watson , Joseph Ashear , Valerie Nygaard
Abstract: Systems, methods, and apparatus for using a multimodal response in the dynamic generation of client device output that is tailored to a current modality of a client device is disclosed herein. Multimodal client devices can engage in a variety of interactions across the multimodal spectrum including voice only interactions, voice forward interactions, multimodal interactions, visual forward interactions, visual only interactions etc. A multimodal response can include a core message to be rendered for all interaction types as well as one or more modality dependent components to provide a user with additional information.
-
公开(公告)号:US20210166035A1
公开(公告)日:2021-06-03
申请号:US17120525
申请日:2020-12-14
Applicant: Google LLC
Inventor: Sanketh Shetty , Tomas Izo , Min-Hsuan Tsai , Sudheendra Vijayanarasimhan , Apostol Natsev , Sami Abu-El-Haija , George Dan Toderici , Susana Ricco , Balakrishnan Varadarajan , Nicola Muscettola , WeiHsin Gu , Weilong Yang , Nitin Khandelwal , Phuong Le
IPC: G06K9/00 , G06F16/783
Abstract: A computer-implemented method for selecting representative frames for videos is provided. The method includes receiving a video and identifying a set of features for each of the frames of the video. The features including frame-based features and semantic features. The semantic features identifying likelihoods of semantic concepts being present as content in the frames of the video. A set of video segments for the video is subsequently generated. Each video segment includes a chronological subset of frames from the video and each frame is associated with at least one of the semantic features. The method generates a score for each frame of the subset of frames for each video segment based at least on the semantic features, and selecting a representative frame for each video segment based on the scores of the frames in the video segment. The representative frame represents and summarizes the video segment.
-
-
-
-
-
-
-
-
-