-
公开(公告)号:US20240005923A1
公开(公告)日:2024-01-04
申请号:US18368214
申请日:2023-09-14
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Sindhuja Chonat Sri , Aman Puniyani , Nishchit Mahajan
IPC: G10L15/22 , G06F16/638 , G06F16/683 , G06F16/635 , G10L15/08 , G10L25/51
CPC classification number: G10L15/22 , G06F16/638 , G06F16/683 , G06F16/635 , G10L15/08 , G10L25/51 , G10L2015/088 , G10L2015/223
Abstract: Systems and methods are described herein for disambiguating a voice search query that contains a command keyword by determining whether the user spoke a quotation from a content item and whether the user mimicked or approximated the way the quotation is spoken in the content item. The voice search query is transcribed into a string, and an audio signature of the voice search query is identified. Metadata of a quotation matching the string is retrieved from a database that includes audio signature information for the string as spoken within the content item. The audio signature of the voice search query is compared with the audio signature information in the metadata to determine whether the audio signature matches the audio signature information in the quotation metadata. If a match is detected, then a search result comprising an identifier of the content item from which the quotation comes is generated.
-
公开(公告)号:US20230267272A1
公开(公告)日:2023-08-24
申请号:US18139533
申请日:2023-04-26
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Susanto Sen
IPC: G06F40/279 , G10L15/22 , H04L12/18
CPC classification number: G06F40/279 , G10L15/22 , H04L12/1813
Abstract: Systems and methods are presented herein for providing a user with a notification or with access to live media on an audio/visual user entertainment system based on a user’s conditional request for media content. The user may provide the condition of the request by speaking or by entering the condition of the request into an interactive interface. An identification application analyzes the elements of the user’s request and generates a question. The application finds a live media stream with identifiers related to the elements and posts the generated question to a live chat forum associated with the live media stream. The application analyzes posts on the forum made by other users to determine if the condition of the user’s request is met. When the application determines a post confirms the condition is met, the application generates a notification and provides the user access to the live media stream.
-
公开(公告)号:US20230140273A1
公开(公告)日:2023-05-04
申请号:US17882289
申请日:2022-08-05
Applicant: ROVI GUIDES, INC.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
IPC: G10L13/033 , G10L25/63
Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.
-
公开(公告)号:US11527234B2
公开(公告)日:2022-12-13
申请号:US16590243
申请日:2019-10-01
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
Abstract: Systems and methods for determining hint words that improve the accuracy of automated speech recognition (ASR) systems. Hint words are determined in the context of a user issuing voice commands in connection with a voice interface system. Terms are initially taken from most frequently occurring terms in operation of a voice interface system. For example, most frequently occurring terms that arise in electronic search queries or received commands are selected. Certain of these terms are selected as hint words, and the selected hint words are then transmitted to an ASR system to assist in translation of speech to text.
-
公开(公告)号:US11450306B2
公开(公告)日:2022-09-20
申请号:US15931261
申请日:2020-05-13
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
IPC: G10L13/033 , A63H11/00 , A63H3/31 , G10L15/22 , G10L25/63
Abstract: The system trains a model to provide information used to provide a synthesized speech response to a voice input. The model takes as input prosodic information that may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example. The system receives a plurality of voice inputs, each associated with prosodic metric, as well as a plurality of responses, each also associated with prosodic metrics. The system trains the model based on the plurality of voice inputs, the plurality of responses, the prosodic metrics of the voice inputs, and the prosodic metrics of the responses such that the model outputs information used to generate the response. The model may also take as input user profile information, emotion metrics, and transition information to generate output. The output of the training model may be used by the system to provide synthesized speech responses having relevant prosodic character to received voice inputs.
-
公开(公告)号:US11443731B2
公开(公告)日:2022-09-13
申请号:US15931074
申请日:2020-05-13
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
IPC: G10L13/033 , G10L25/63 , A63H11/00
Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.
-
47.
公开(公告)号:US20220264170A1
公开(公告)日:2022-08-18
申请号:US17739469
申请日:2022-05-09
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Charishma Chundi
IPC: H04N21/2662 , H04L1/00 , H04N21/2343 , H04N21/24 , H04N21/258
Abstract: Systems and methods for dynamically adapting quality levels of content is disclosed herein. A content transmission system determines whether to reduce streaming bandwidth of a device that transmits content. In response to determining to reduce the streaming bandwidth, the content transmission system identifies a first plurality of frames of the content based on a first context and a second plurality of frames of the content based on a second context. The content transmission system transmits the first plurality of frames at a first quality level based on the first context and the second plurality of frames at a second quality level that is higher than the first quality level based on the second context.
-
公开(公告)号:US20220252412A1
公开(公告)日:2022-08-11
申请号:US17732084
申请日:2022-04-28
Applicant: Rovi Guides, Inc.
Inventor: Nishchit Mahajan , Ankur Aher
IPC: G01C21/34
Abstract: Systems and methods are disclosed herein for selecting alternate routes with fewer directions during vehicle navigation. The disclosed techniques herein determine directions from route data and direction timestamps for each of the directions. For each direction, a corresponding media asset from media assets in a playlist having a media asset duration that matches a direction duration is determined. The direction duration is the time difference between the direction timestamp and a subsequent direction timestamp.
-
49.
公开(公告)号:US11356725B2
公开(公告)日:2022-06-07
申请号:US17072083
申请日:2020-10-16
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Charishma Chundi
IPC: H04N21/2662 , H04N21/2343 , H04N21/24 , H04N21/258 , H04L1/00
Abstract: Systems and methods for dynamically adapting quality levels of content is disclosed herein. A content transmission system determines whether to reduce streaming bandwidth of a device that transmits content. In response to determining to reduce the streaming bandwidth, the content transmission system identifies a first plurality of frames of the content based on a first context and a second plurality of frames of the content based on a second context. The content transmission system transmits the first plurality of frames at a first quality level based on the first context and the second plurality of frames at a second quality level that is higher than the first quality level based on the second context.
-
50.
公开(公告)号:US20220124397A1
公开(公告)日:2022-04-21
申请号:US17072083
申请日:2020-10-16
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Charishma Chundi
IPC: H04N21/2662 , H04N21/258 , H04N21/24 , H04N21/2343
Abstract: Systems and methods for dynamically adapting quality levels of content is disclosed herein. A content transmission system determines whether to reduce streaming bandwidth of a device that transmits content. In response to determining to reduce the streaming bandwidth, the content transmission system identifies a first plurality of frames of the content based on a first context and a second plurality of frames of the content based on a second context. The content transmission system transmits the first plurality of frames at a first quality level based on the first context and the second plurality of frames at a second quality level that is higher than the first quality level based on the second context.
-
-
-
-
-
-
-
-
-