-
公开(公告)号:US11494434B2
公开(公告)日:2022-11-08
申请号:US16528541
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Indranil Coomar Doss , Aashish Goyal , Aman Puniyani , Kandala Reddy , Mithun Umesh
IPC: G06F16/635 , G06F16/68 , G06F16/632 , G06F16/2457 , G10L15/187 , G10L15/22 , G06F40/295 , G10L15/08
Abstract: The system receives a voice query at an audio interface and converts the voice query to text. The system can determine pronunciation information during conversion and generate metadata the indicates a pronunciation of one or more words of the query, include phonetic information in the text query, or both. A query includes one or more entities, which may be more accurately identified based on pronunciation. The system searches for information, content, or both among one or more databases based on the generated text query, pronunciation information, user profile information, search histories or trends, and optionally other information. The system identifies one or more entities or content items that match the text query, and retrieves the identified information to provide to the user.
-
公开(公告)号:US11410656B2
公开(公告)日:2022-08-09
申请号:US16528550
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Indranil Coomar Doss , Aashish Goyal , Aman Puniyani , Kandala Reddy , Mithun Umesh
IPC: G10L15/26 , G10L15/22 , G10L15/187 , G10L13/02
Abstract: The system identifies one or more entities or content items among a plurality of stored information. The system generates an audio file based on a first text string that represents the entity or content item. Based on the first text string and at least one speech criterion, the system generating, using a speech-to-text module a second text string based on the audio file. The system then compares the text strings and stores the second text string if it is not identical to the first text string. The system generates metadata that includes results from text-speech-text conversions to forecast possible misidentifications when responding to voice queries during search operations. The metadata includes alternative representations of the entity.
-
公开(公告)号:US20220114339A1
公开(公告)日:2022-04-14
申请号:US17067012
申请日:2020-10-09
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Susanto Sen
IPC: G06F40/279 , H04L12/18 , G10L15/22
Abstract: Systems and methods are presented herein for providing a user with a notification or with access to live media on an audio/visual user entertainment system based on a user's conditional request for media content. The user may provide the condition of the request by speaking or by entering the condition of the request into an interactive interface. An identification application analyzes the elements of the user's request and generates a question. The application finds a live media stream with identifiers related to the elements and posts the generated question to a live chat forum associated with the live media stream. The application analyzes posts on the forum made by other users to determine if the condition of the user's request is met. When the application determines a post confirms the condition is met, the application generates a notification and provides the user access to the live media stream.
-
公开(公告)号:US11133005B2
公开(公告)日:2021-09-28
申请号:US16397004
申请日:2019-04-29
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Sindhuja Chonat Sri , Aman Puniyani , Nishchit Mahajan
IPC: G10L15/22 , G06F16/638 , G06F16/683 , G06F16/635 , G10L15/08 , G10L25/51
Abstract: Systems and methods are described herein for disambiguating a voice search query that contains a command keyword by determining whether the user spoke a quotation from a content item and whether the user mimicked or approximated the way the quotation is spoken in the content item. The voice search query is transcribed into a string, and an audio signature of the voice search query is identified. Metadata of a quotation matching the string is retrieved from a database that includes audio signature information for the string as spoken within the content item. The audio signature of the voice search query is compared with the audio signature information in the metadata to determine whether the audio signature matches the audio signature information in the quotation metadata. If a match is detected, then a search result comprising an identifier of the content item from which the quotation comes is generated.
-
公开(公告)号:US20210097988A1
公开(公告)日:2021-04-01
申请号:US16590244
申请日:2019-10-01
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
IPC: G10L15/22 , G06F3/16 , G10L15/26 , G06F16/951 , G06F17/27
Abstract: Systems and methods for determining hint words that improve the accuracy of automated speech recognition (ASR) systems. Hint words are determined in the context of a user issuing voice commands in connection with a voice interface system. Terms are initially taken from most frequently occurring terms in operation of a voice interface system. For example, most frequently occurring terms that arise in electronic search queries or received commands are selected. Certain of these terms are selected as hint words, and the selected hint words are then transmitted to an ASR system to assist in translation of speech to text.
-
公开(公告)号:US20210063177A1
公开(公告)日:2021-03-04
申请号:US16557925
申请日:2019-08-30
Applicant: Rovi Guides, Inc.
Inventor: Nishchit Mahajan , Ankur Aher
IPC: G01C21/34
Abstract: Systems and methods are disclosed herein for selecting alternate routes with fewer directions during vehicle navigation. The disclosed techniques herein determine directions from route data and direction timestamps for each of the directions. For each direction, a corresponding media asset from media assets in a playlist having a media asset duration that matches a direction duration is determined. The direction duration is the time difference between the direction timestamp and a subsequent direction timestamp.
-
公开(公告)号:US20210037293A1
公开(公告)日:2021-02-04
申请号:US16528027
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Nikhil Gabhane , Raman Gupta , Aman Puniyani
IPC: H04N21/81 , H04N21/422 , H04N21/431
Abstract: Methods and systems are described for providing content, such as a movie, with dialogue including a quotation that was input. For example, using a voice search a viewer may input a quotation famous from a movie to find the original fil and related content. The methods and systems use a quotation engine in a digital device to receive an input including the quotation and access a plurality of content items that include dialogue. The quotation engine identifies a subset of content items that include dialogue similar to the input quotation. The quotation engine accesses metadata of each of the subset of content, ranks the subset based on predetermined criteria and the metadata, and provides the ranked subset of the plurality of content items for consumption. The quotation engine may use a graphical user interface to identify the earliest release, trending content, or the program best known for the quote.
-
公开(公告)号:US20250006173A1
公开(公告)日:2025-01-02
申请号:US18883144
申请日:2024-09-12
Applicant: ROVI GUIDES, INC.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
IPC: G10L13/033 , G10L13/00 , G10L13/02 , G10L13/06 , G10L13/08 , G10L15/00 , G10L15/10 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26 , G10L25/63
Abstract: The system provides a synthesized speech response to a voice input, based on the emotion of the voice input. The system receives the voice input and determines the emotion of the voice input. The system identifies, based on the first emotion of the voice input, a second emotion, different from the first emotion, for a response to the voice input. The system generates a synthesized speech voice output of the response comprising prosodic characteristics corresponding to the second emotion. The system outputs the synthesized speech response.
-
公开(公告)号:US12073179B2
公开(公告)日:2024-08-27
申请号:US18139533
申请日:2023-04-26
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Susanto Sen
IPC: G06F40/279 , G10L15/22 , H04L12/18
CPC classification number: G06F40/279 , G10L15/22 , H04L12/1813
Abstract: Systems and methods are presented herein for providing a user with a notification or with access to live media on an audio/visual user entertainment system based on a user's conditional request for media content. The user may provide the condition of the request by speaking or by entering the condition of the request into an interactive interface. An identification application analyzes the elements of the user's request and generates a question. The application finds a live media stream with identifiers related to the elements and posts the generated question to a live chat forum associated with the live media stream. The application analyzes posts on the forum made by other users to determine if the condition of the user's request is met. When the application determines a post confirms the condition is met, the application generates a notification and provides the user access to the live media stream.
-
公开(公告)号:US20240153492A1
公开(公告)日:2024-05-09
申请号:US18387892
申请日:2023-11-08
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
CPC classification number: G10L15/02 , G10L15/22 , G10L2015/025 , G10L2015/223
Abstract: Systems and methods for determining hint words that improve the accuracy of automated speech recognition (ASR) systems. Hint words are determined in the context of a user issuing voice commands in connection with a voice interface system. Terms are initially taken from most frequently occurring terms in operation of a voice interface system. For example, most frequently occurring terms that arise in electronic search queries or received commands are selected. Certain of these terms are selected as hint words, and the selected hint words are then transmitted to an ASR system to assist in translation of speech to text.
-
-
-
-
-
-
-
-
-