Patent search ap:("GOOGLE LLC") AND inv:"Nitin Khandelwal" Page 2

11.

发明授权
Feature-based video annotation 有权

公开(公告)号：US12141199B2

公开(公告)日：2024-11-12

申请号：US17548859

申请日：2021-12-13

Applicant: Google LLC

Inventor： Balakrishnan Varadarajan , George Dan Toderici , Apostol Natsev , Nitin Khandelwal , Sudheendra Vijayanarasimhan , Weilong Yang , Sanketh Shetty

IPC: G06K9/62 , G06F16/78 , G06F16/783 , G06F18/214 , G06F18/22 , G06F18/2413 , G06V20/40 , G06V20/70 , H04N5/265

Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.

12.

发明申请
FEATURE-BASED VIDEO ANNOTATION 有权

公开(公告)号：US20220207873A1

公开(公告)日：2022-06-30

申请号：US17548859

申请日：2021-12-13

Applicant: Google LLC

Inventor： Balakrishnan Varadarajan , George Dan Toderici , Apostol Natsev , Nitin Khandelwal , Sudheendra Vijayanarasimhan , Weilong Yang , Sanketh Shetty

IPC: G06V20/40 , G06F16/783 , G06F16/78 , G06V10/42 , G06K9/62 , H04N5/265

Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.

13.

发明申请
LOW LATENCY AUTOMATED IDENTIFICATION OF AUTOMATED ASSISTANT FUNCTION FAILURE 有权

公开(公告)号：US20220165259A1

公开(公告)日：2022-05-26

申请号：US17102408

申请日：2020-11-23

Applicant: Google LLC

Inventor： Aakash Goel , Tayfun Elmas , Keith Brady , Akshay Jaggi , Ester Lopez Berga , Arne Vansteenkiste , Robin Martinjak , Mahesh Palekar , Krish Narang , Nitin Khandelwal , Pravir Gupta

IPC: G10L15/22 , G10L15/08 , G08B21/18

Abstract: Implementations include identifying, from a database of entries reflecting past automated assistant commands submitted within a threshold amount of time relative to a current time, particular entries that each reflect corresponding features of a corresponding user submission of a particular command. Further, those implementations include determining that the particular command is a golden command, for a particular automated assistant function, responsive to determining that: at least a threshold percentage of the user submissions of the particular command triggered the particular automated assistant function, and a quantity of the user submission of the particular command satisfies a threshold quantity. Those implementations further include, responsive to determining that the particular command is currently the golden command: processing a stream of current occurrences of submissions of the particular command to determine whether the particular automated assistant function is triggered; and when the processing indicates that a threshold quantity and/or percentage of the current occurrences fails to satisfy a potential problem threshold: causing one or more electronic alerts to be transmitted.

14.

发明申请
FEATURE-BASED VIDEO ANNOTATION 审中-公开

公开(公告)号：US20180025228A1

公开(公告)日：2018-01-25

申请号：US15722756

申请日：2017-10-02

Applicant: Google LLC

Inventor： Balakrishnan Varadarajan , George Dan Toderici , Apostol Natsev , Nitin Khandelwal , Sudheendra Vijayanarasimhan , Weilong Yang , Sanketh Shetty

IPC: G06K9/00 , G06K9/52 , G06F17/30 , H04N5/265 , G06K9/62

CPC classification number: G06K9/00718 , G06F16/783 , G06F16/7867 , G06K9/52 , G06K9/6201 , G06K9/6256 , G06K9/627 , H04N5/265

Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.

15.

发明公开
COLLABORATIVE RANKING OF INTERPRETATIONS OF SPOKEN UTTERANCES 审中-公开

公开(公告)号：US20240203423A1

公开(公告)日：2024-06-20

申请号：US18590549

申请日：2024-02-28

Applicant: GOOGLE LLC

Inventor： Akshay Goel , Nitin Khandelwal , Richard Park , Brian Chatham , Jonathan Eccles , David Sanchez , Dmytro Lapchuk

IPC: G10L15/32 , G10L15/18 , G10L15/22 , G10L15/30

CPC classification number: G10L15/32 , G10L15/18 , G10L15/22 , G10L15/30

Abstract: Implementations described herein are directed to enabling collaborative ranking of interpretations of spoken utterances based on data that is available to an automated assistant and third-party agent(s), respectively. The automated assistant can determine first-party interpretation(s) of a spoken utterance provided by a user, and can cause the third-party agent(s) to determine third-party interpretation(s) of the spoken utterance provided by the user. In some implementations, the automated assistant can select a given interpretation, from the first-party interpretation(s) and the third-party interpretation(s), of the spoken utterance, and can cause a given third-party agent to satisfy the spoken utterance based on the given interpretation. In additional or alternative implementations, an independent third-party agent can obtain the first-party interpretation(s) and the third-party interpretation(s), select the given interpretation, and then transmit the given interpretation to the automated assistant and/or the given third-party agent.

16.

发明授权
Collaborative ranking of interpretations of spoken utterances 有权

公开(公告)号：US11948580B2

公开(公告)日：2024-04-02

申请号：US17537104

申请日：2021-11-29

Applicant: GOOGLE LLC

Inventor： Akshay Goel , Nitin Khandelwal , Richard Park , Brian Chatham , Jonathan Eccles , David Sanchez , Dmytro Lapchuk

IPC: G10L15/32 , G10L15/18 , G10L15/22 , G10L15/30

CPC classification number: G10L15/32 , G10L15/18 , G10L15/22 , G10L15/30

Abstract: Implementations described herein are directed to enabling collaborative ranking of interpretations of spoken utterances based on data that is available to an automated assistant and third-party agent(s), respectively. The automated assistant can determine first-party interpretation(s) of a spoken utterance provided by a user, and can cause the third-party agent(s) to determine third-party interpretation(s) of the spoken utterance provided by the user. In some implementations, the automated assistant can select a given interpretation, from the first-party interpretation(s) and the third-party interpretation(s), of the spoken utterance, and can cause a given third-party agent to satisfy the spoken utterance based on the given interpretation. In additional or alternative implementations, an independent third-party agent can obtain the first-party interpretation(s) and the third-party interpretation(s), select the given interpretation, and then transmit the given interpretation to the automated assistant and/or the given third-party agent.

17.

发明授权
Low latency automated identification of automated assistant function failure 有权

公开(公告)号：US11568869B2

公开(公告)日：2023-01-31

申请号：US17102408

申请日：2020-11-23

Applicant: Google LLC

Inventor： Aakash Goel , Tayfun Elmas , Keith Brady , Akshay Jaggi , Ester Lopez Berga , Arne Vansteenkiste , Robin Martinjak , Mahesh Palekar , Krish Narang , Nitin Khandelwal , Pravir Gupta

IPC: G10L15/22 , G10L15/08

Abstract: Implementations include identifying, from a database of entries reflecting past automated assistant commands submitted within a threshold amount of time relative to a current time, particular entries that each reflect corresponding features of a corresponding user submission of a particular command. Further, those implementations include determining that the particular command is a golden command, for a particular automated assistant function, responsive to determining that: at least a threshold percentage of the user submissions of the particular command triggered the particular automated assistant function, and a quantity of the user submission of the particular command satisfies a threshold quantity. Those implementations further include, responsive to determining that the particular command is currently the golden command: processing a stream of current occurrences of submissions of the particular command to determine whether the particular automated assistant function is triggered; and when the processing indicates that a threshold quantity and/or percentage of the current occurrences fails to satisfy a potential problem threshold: causing one or more electronic alerts to be transmitted.

18.

发明授权
Feature-based video annotation 有权

公开(公告)号：US11200423B2

公开(公告)日：2021-12-14

申请号：US16687118

申请日：2019-11-18

Applicant: Google LLC

Inventor： Balakrishnan Varadarajan , George Dan Toderici , Apostol Natsev , Nitin Khandelwal , Sudheendra Vijayanarasimhan , Weilong Yang , Sanketh Shetty

IPC: G06K9/62 , G06K9/00 , G06F16/783 , G06F16/78 , G06K9/52 , H04N5/265

Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.

19.

发明授权
Multimodal responses 有权

公开(公告)号：US11164576B2

公开(公告)日：2021-11-02

申请号：US16251982

申请日：2019-01-18

Applicant: Google LLC

Inventor： April Pufahl , Jared Strawderman , Harry Yu , Adriana Olmos Antillon , Jonathan Livni , Okan Kolak , James Giangola , Nitin Khandelwal , Jason Kearns , Andrew Watson , Joseph Ashear , Valerie Nygaard

IPC: G10L15/22 , G06F1/16 , G06F3/16

Abstract: Systems, methods, and apparatus for using a multimodal response in the dynamic generation of client device output that is tailored to a current modality of a client device is disclosed herein. Multimodal client devices can engage in a variety of interactions across the multimodal spectrum including voice only interactions, voice forward interactions, multimodal interactions, visual forward interactions, visual only interactions etc. A multimodal response can include a core message to be rendered for all interaction types as well as one or more modality dependent components to provide a user with additional information.

20.

发明申请
SELECTING AND PRESENTING REPRESENTATIVE FRAMES FOR VIDEO PREVIEWS 有权

公开(公告)号：US20210166035A1

公开(公告)日：2021-06-03

申请号：US17120525

申请日：2020-12-14

Applicant: Google LLC

Inventor： Sanketh Shetty , Tomas Izo , Min-Hsuan Tsai , Sudheendra Vijayanarasimhan , Apostol Natsev , Sami Abu-El-Haija , George Dan Toderici , Susana Ricco , Balakrishnan Varadarajan , Nicola Muscettola , WeiHsin Gu , Weilong Yang , Nitin Khandelwal , Phuong Le

IPC: G06K9/00 , G06F16/783

Abstract: A computer-implemented method for selecting representative frames for videos is provided. The method includes receiving a video and identifying a set of features for each of the frames of the video. The features including frame-based features and semantic features. The semantic features identifying likelihoods of semantic concepts being present as content in the frames of the video. A set of video segments for the video is subsequently generated. Each video segment includes a chronological subset of frames from the video and each frame is associated with at least one of the semantic features. The method generates a score for each frame of the subset of frames for each video segment based at least on the semantic features, and selecting a representative frame for each video segment based on the scores of the frames in the video segment. The representative frame represents and summarizes the video segment.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification