-
公开(公告)号:US20240153483A1
公开(公告)日:2024-05-09
申请号:US18387211
申请日:2023-11-06
Applicant: ROVI GUIDES, INC.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
IPC: G10L13/033 , G10L25/63
CPC classification number: G10L13/0335 , G10L25/63
Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.
-
32.
公开(公告)号:US11889167B2
公开(公告)日:2024-01-30
申请号:US17466077
申请日:2021-09-03
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Sandeep Jangra , Aman Puniyani , Mohammed Yasir
IPC: H04N21/8545 , H04N21/8405 , H04N21/845
CPC classification number: H04N21/8545 , H04N21/8405 , H04N21/8456
Abstract: Systems and methods are provided for presenting an interactive content item matching a user-selected category to a user for a desired duration. A user selects a category and selects a first interactive content item on a media system. The system calculates a total duration of a storyline from the selected interactive content item that matches the selected category (e.g., a genre “comedy”) and compares the calculated duration to a desired predetermined duration for which the user wishes to watch the selected show. If the system determines, for instance, that the total duration of the selected storyline is less than the predetermined duration, the system identifies scenes from another show and interleaves them with scenes from the first interactive content item to generate a combined interactive content item that satisfies the user viewing preferences.
-
公开(公告)号:US11722749B2
公开(公告)日:2023-08-08
申请号:US16528027
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Nikhil Gabhane , Raman Gupta , Aman Puniyani
IPC: H04N21/81 , H04N21/431 , H04N21/422
CPC classification number: H04N21/8133 , H04N21/42203 , H04N21/4314 , H04N21/4316 , H04N21/8146
Abstract: Methods and systems are described for providing content, such as a movie, with dialogue including a quotation that was input. For example, using a voice search a viewer may input a quotation famous from a movie to find the original fil and related content. The methods and systems use a quotation engine in a digital device to receive an input including the quotation and access a plurality of content items that include dialogue. The quotation engine identifies a subset of content items that include dialogue similar to the input quotation. The quotation engine accesses metadata of each of the subset of content, ranks the subset based on predetermined criteria and the metadata, and provides the ranked subset of the plurality of content items for consumption. The quotation engine may use a graphical user interface to identify the earliest release, trending content, or the program best known for the quote.
-
公开(公告)号:US20230206920A1
公开(公告)日:2023-06-29
申请号:US18118343
申请日:2023-03-07
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Sindhuja Chonat Sri , Aman Puniyani , Nishchit Mahajan
IPC: G10L15/22 , G06F16/638 , G06F16/683 , G06F16/635 , G10L15/08 , G10L25/51
CPC classification number: G10L15/22 , G06F16/638 , G06F16/683 , G06F16/635 , G10L15/08 , G10L25/51 , G10L2015/088 , G10L2015/223
Abstract: Systems and methods are described herein for disambiguating a voice search query that contains a command keyword by determining whether the user spoke a quotation from a content item and whether the user mimicked or approximated the way the quotation is spoken in the content item. The voice search query is transcribed into a string, and an audio signature of the voice search query is identified. Metadata of a quotation matching the string is retrieved from a database that includes audio signature information for the string as spoken within the content item. The audio signature of the voice search query is compared with the audio signature information in the metadata to determine whether the audio signature matches the audio signature information in the quotation metadata. If a match is detected, then a search result comprising an identifier of the content item from which the quotation comes is generated.
-
公开(公告)号:US11227593B2
公开(公告)日:2022-01-18
申请号:US16456275
申请日:2019-06-28
Applicant: Rovi Guides, Inc.
IPC: G10L15/22 , G06F16/2457 , G06F16/248 , G10L15/26 , G06F3/01 , G06K9/00 , G06F16/242
Abstract: Systems and methods are described herein for disambiguating a voice search query by determining whether the user made a gesture while speaking a quotation from a content item and whether the user mimicked or approximated a gesture made by a character in the content item when the character spoke the words quoted by the user. If so, a search result comprising an identifier of the content item is generated. A search result representing the content item from which the quotation comes may be ranked highest among other search results returned and therefore presented first in a list of search results. If the user did not mimic or approximate a gesture made by a character in the content item when the quotation is spoken in the content item, then a search result may not be generated for the content item or may be ranked lowest among other search results.
-
公开(公告)号:US20210319779A1
公开(公告)日:2021-10-14
申请号:US15931074
申请日:2020-05-13
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
IPC: G10L13/033 , G10L25/63
Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.
-
公开(公告)号:US20210174795A1
公开(公告)日:2021-06-10
申请号:US16709734
申请日:2019-12-10
Applicant: Rovi Guides, Inc.
Inventor: Jeffry Copps Robert Jose , Ankur Aher
Abstract: The system provides a voice command recommendation to a user to avoid a non-voice command. The system determines a command that is expected to be received, and generates a voice command recommendation that corresponds to the predicted command. The predicted command can be based on the user's behavior, a plurality of users' behavior, environmental circumstances such as a phone call ring, or a combination thereof. The system may access one or more databases to determine the predicted command. The voice command recommendation may include a displayed notification that describes the recommended voice command, and exemplary voice inputs that are recognized. The system also activates an audio interface, such as a microphone, that is configured to receive a voice input. If the system receives a recognizable voice input at the audio interface that corresponds to the recommendation, the system performs the predicted command in response to receiving the voice input.
-
公开(公告)号:US20210063193A1
公开(公告)日:2021-03-04
申请号:US16557918
申请日:2019-08-30
Applicant: Rovi Guides, Inc.
Inventor: Nishchit Mahajan , Ankur Aher
IPC: G01C21/36
Abstract: Systems and methods are disclosed herein for providing uninterrupted media content during vehicle navigation. The disclosed techniques herein discuss determining directions from route data and navigation announcements for each of the directions. For each navigation announcement, a determination is made whether current playback of a media asset in a playlist ends within a predefined time threshold before the navigation announcement. In a positive determination, the playback of the playlist is paused until the navigation announcement has elapsed.
-
公开(公告)号:US20210034663A1
公开(公告)日:2021-02-04
申请号:US16528541
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Indranil Coomar Doss , Aashish Goyal , Aman Puniyani , Kandala Reddy , Mithun Umesh
IPC: G06F16/635 , G06F16/68 , G06F16/632 , G06F16/2457 , G10L15/22 , G10L15/187 , G06F17/27
Abstract: The system receives a voice query at an audio interface and converts the voice query to text. The system can determine pronunciation information during conversion and generate metadata the indicates a pronunciation of one or more words of the query, include phonetic information in the text query, or both. A query includes one or more entities, which may be more accurately identified based on pronunciation. The system searches for information, content, or both among one or more databases based on the generated text query, pronunciation information, user profile information, search histories or trends, and optionally other information. The system identifies one or more entities or content items that match the text query, and retrieves the identified information to provide to the user.
-
公开(公告)号:US20240040210A1
公开(公告)日:2024-02-01
申请号:US18211020
申请日:2023-06-16
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Nikhil Gabhane , Raman Gupta , Aman Puniyani
IPC: H04N21/81 , H04N21/431 , H04N21/422
CPC classification number: H04N21/8133 , H04N21/8146 , H04N21/4314 , H04N21/4316 , H04N21/42203
Abstract: Methods and systems are described for providing content, such as a movie, with dialogue including a quotation that was input. For example, using a voice search a viewer may input a quotation famous from a movie to find the original fil and related content. The methods and systems use a quotation engine in a digital device to receive an input including the quotation and access a plurality of content items that include dialogue. The quotation engine identifies a subset of content items that include dialogue similar to the input quotation. The quotation engine accesses metadata of each of the subset of content, ranks the subset based on predetermined criteria and the metadata, and provides the ranked subset of the plurality of content items for consumption. The quotation engine may use a graphical user interface to identify the earliest release, trending content, or the program best known for the quote.
-
-
-
-
-
-
-
-
-