-
公开(公告)号:US11248927B2
公开(公告)日:2022-02-15
申请号:US16557918
申请日:2019-08-30
Applicant: Rovi Guides, Inc.
Inventor: Nishchit Mahajan , Ankur Aher
IPC: G01C21/36
Abstract: Systems and methods are disclosed herein for providing uninterrupted media content during vehicle navigation. The disclosed techniques herein discuss determining directions from route data and navigation announcements for each of the directions. For each navigation announcement, a determination is made whether current playback of a media asset in a playlist ends within a predefined time threshold before the navigation announcement. In a positive determination, the playback of the playlist is paused until the navigation announcement has elapsed.
-
公开(公告)号:US11205430B2
公开(公告)日:2021-12-21
申请号:US16590244
申请日:2019-10-01
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
Abstract: Systems and methods for determining hint words that improve the accuracy of automated speech recognition (ASR) systems. Hint words are determined in the context of a user issuing voice commands in connection with a voice interface system. Terms are initially taken from most frequently occurring terms in operation of a voice interface system. For example, most frequently occurring terms that arise in electronic search queries or received commands are selected. Certain of these terms are selected as hint words, and the selected hint words are then transmitted to an ASR system to assist in translation of speech to text.
-
公开(公告)号:US20210063194A1
公开(公告)日:2021-03-04
申请号:US16557921
申请日:2019-08-30
Applicant: Rovi Guides, Inc.
Inventor: Nishchit Mahajan , Ankur Aher
IPC: G01C21/36 , G06F16/635 , G06F16/638 , G06F16/64
Abstract: Systems and methods are disclosed herein for providing uninterrupted media content by reordering playlists during vehicle navigation. The disclosed techniques herein determine directions from route data and navigation announcements for each of the directions. For each direction, a corresponding media asset from media assets in a playlist having a media asset duration that matches a direction duration is determined. The direction duration is the time difference between the navigation announcement and a subsequent navigation announcement.
-
公开(公告)号:US20210035587A1
公开(公告)日:2021-02-04
申请号:US16528550
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Indranil Coomar Doss , Aashish Goyal , Aman Puniyani , Kandala Reddy , Mithun Umesh
IPC: G10L15/26 , G10L15/22 , G10L13/02 , G10L15/187
Abstract: The system identifies one or more entities or content items among a plurality of stored information. The system generates an audio file based on a first text string that represents the entity or content item. Based on the first text string and at least one speech criterion, the system generating, using a speech-to-text module a second text string based on the audio file. The system then compares the text strings and stores the second text string if it is not identical to the first text string. The system generates metadata that includes results from text-speech-text conversions to forecast possible misidentifications when responding to voice queries during search operations. The metadata includes alternative representations of the entity, to improve reachability in cases where the speech-to-text conversion does generate a pr
-
公开(公告)号:US20210034662A1
公开(公告)日:2021-02-04
申请号:US16528539
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Indranil Coomar Doss , Aashish Goyal , Aman Puniyani , Kandala Reddy , Mithun Umesh
IPC: G06F16/635 , G06F16/68 , G10L15/22 , G10L15/187 , G06F17/27
Abstract: The system receives a voice query at an audio interface and converts the voice query to text. The system can determine pronunciation information during conversion and generate metadata that indicates a pronunciation of one or more words of the query, include phonetic information in the text query, or both. A query includes one or more entities that may be more accurately identified based on pronunciation. The system searches for information, content, or both among one or more databases based on the generated text query, pronunciation information, user profile information, search histories or trends, and optionally other information. The system identifies one or more entities or content items that match the text query, and retrieves the identified information to provide to the user.
-
公开(公告)号:US20200342859A1
公开(公告)日:2020-10-29
申请号:US16397004
申请日:2019-04-29
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Sindhuja Chonat Sri , Aman Puniyani , Nishchit Mahajan
IPC: G10L15/22 , G10L25/51 , G10L15/08 , G06F16/638 , G06F16/635 , G06F16/683
Abstract: Systems and methods are described herein for disambiguating a voice search query that contains a command keyword by determining whether the user spoke a quotation from a content item and whether the user mimicked or approximated the way the quotation is spoken in the content item. The voice search query is transcribed into a string, and an audio signature of the voice search query is identified. Metadata of a quotation matching the string is retrieved from a database that includes audio signature information for the string as spoken within the content item. The audio signature of the voice search query is compared with the audio signature information in the metadata to determine whether the audio signature matches the audio signature information in the quotation metadata. If a match is detected, then a search result comprising an identifier of the content item from which the quotation comes is generated.
-
-
-
-
-