-
公开(公告)号:US12118978B2
公开(公告)日:2024-10-15
申请号:US18387211
申请日:2023-11-06
Applicant: ROVI GUIDES, INC.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
IPC: G10L13/06 , G10L13/00 , G10L13/02 , G10L13/033 , G10L13/08 , G10L15/00 , G10L15/10 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26 , G10L25/63
CPC classification number: G10L13/0335 , G10L25/63 , G10L13/00 , G10L13/02 , G10L13/06 , G10L13/08 , G10L15/00 , G10L15/10 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26
Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.
-
公开(公告)号:US20240330609A1
公开(公告)日:2024-10-03
申请号:US18742539
申请日:2024-06-13
Applicant: Rovi Guides, Inc.
Inventor: Ajay Kumar Mishra , Jeffry Copps Robert Jose
Abstract: Systems and methods are presented herein for generating a new language understanding model, based on a user request. A user may input a root language and a locale into an application for generating a student language model. The application may generate the student language model and may identify a teacher language model related to the student language model. The application may compare data from the identified teacher language model to the student language model. The application may determine a subset of data from the teacher language model is not contained in the student language model. If the application determines at least a subset of data from the teacher language model is not in the student language model, the application may add at least the subset of data from the teacher language model to the student language model.
-
公开(公告)号:US12096076B2
公开(公告)日:2024-09-17
申请号:US18242835
申请日:2023-09-06
Applicant: Rovi Guides, Inc.
Inventor: Susanto Sen , Ankur Anil Aher , Jeffry Copps Robert Jose
IPC: H04N21/442 , H04N21/45 , H04N21/472 , H04N21/81 , H04N21/845
CPC classification number: H04N21/44222 , H04N21/44218 , H04N21/4532 , H04N21/47217 , H04N21/812 , H04N21/8456
Abstract: Systems and methods are described for managing presentation of content. An action may be scheduled to occur at a first time within the presentation of the media asset, where the action may interrupt the presentation of the media asset. When a current presentation position is approaching the first time, an option to delay the action may be generated for presentation. In response to receiving selection of the option to delay the action, the action may be scheduled to occur at a later second time within the presentation of the media asset.
-
4.
公开(公告)号:US12062386B2
公开(公告)日:2024-08-13
申请号:US17876825
申请日:2022-07-29
Applicant: Rovi Guides, Inc.
Inventor: Jeffry Copps Robert Jose , Reda Harb
IPC: G11B27/00 , G11B27/036 , G11B27/34 , H04N5/00
CPC classification number: G11B27/34 , G11B27/005 , G11B27/036
Abstract: A plurality of video clips comprising audio of a song are identified from a pool of videos. A portion of the song corresponding to the audio in each video clip is then determined. Using the determined portion, each video clip is mapped to a timeline of the song. A subset of video clips comprising different sections of the song are then selected. Video clips comprising audio from different sections of the song are selected to be included in the personalized video, which is then generated from the selected subset of video clips. The personalized video is then generated for display to the user.
-
公开(公告)号:US20240265213A1
公开(公告)日:2024-08-08
申请号:US18436347
申请日:2024-02-08
Applicant: Rovi Guides, Inc.
Inventor: Ajay Kumar Mishra , Jeffry Copps Robert Jose
IPC: G06F40/58 , G06F16/2452 , G06F40/263 , G06F40/47 , G06F40/51
CPC classification number: G06F40/58 , G06F16/24522 , G06F40/263 , G06F40/47 , G06F40/51
Abstract: Systems and methods for handling multilingual queries are provided. One example method includes receiving, at a computing device, an input, wherein the input comprises a multi-lingual query comprising at least a first source language and a second source language. The multi-lingual query is translated, word for word, into a destination language to produce a monolingual query, with the word order of the multilingual query and the word order of the monolingual query being the same. The monolingual query is processed using natural language processing to map the mono-lingual query to a natural language query in the destination language.
-
6.
公开(公告)号:US12046230B2
公开(公告)日:2024-07-23
申请号:US18113984
申请日:2023-02-24
Applicant: Rovi Guides, Inc.
Inventor: Jeffry Copps Robert Jose , Mithun Umesh
CPC classification number: G10L15/063 , G06F16/23 , G06F16/90332 , G10L15/142 , G10L15/16 , G10L15/18 , G10L15/26
Abstract: Systems and methods for determining to perform an action of a query using a trained natural language model of a natural language understanding (NLU) system are disclosed herein. A text string corresponding to a prescribed action includes at least a content entity is received. A determination is made as to whether the text string corresponds to an audio input of a first group. In response to determining the text string corresponds to an audio input of a first group, a determination is made as to whether the text string includes an obsequious expression. In response to determining the text string corresponds to an audio input of a first group and in response to determining the text string includes an obsequious expression, a determination is made to perform the prescribed action. In response to determining the text string corresponds to an audio input of a first group and in response to determining the text string does not include the obsequious expression, a determination is made to not perform the prescribed action.
-
公开(公告)号:US12039265B2
公开(公告)日:2024-07-16
申请号:US17108549
申请日:2020-12-01
Applicant: Rovi Guides, Inc.
Inventor: Ajay Kumar Mishra , Jeffry Copps Robert Jose
IPC: G06F40/279 , G06F16/2455 , G06F40/263 , G06N20/00
CPC classification number: G06F40/279 , G06F16/2455 , G06F40/263 , G06N20/00
Abstract: Systems and methods are presented herein for generating a new language understanding model, based on a user request. A user may input a root language and a locale into an application for generating a student language model. The application may generate the student language model and may identify a teacher language model related to the student language model. The application may compare data from the identified teacher language model to the student language model. The application may determine a subset of data from the teacher language model is not contained in the student language model. If the application determines at least a subset of data from the teacher language model is not in the student language model, the application may add at least the subset of data from the teacher language model to the student language model.
-
公开(公告)号:US20240194185A1
公开(公告)日:2024-06-13
申请号:US18581666
申请日:2024-02-20
Applicant: Rovi Guides, Inc.
Inventor: Ankur Anil Aher , Jeffry Copps Robert Jose
CPC classification number: G10L15/01 , G10L15/063 , G10L15/075 , G10L15/1815 , G10L15/20
Abstract: Systems and methods are disclosed and described for correcting errors in ASR transcriptions. For an incorrect transcription, different words or phrases from the transcription, and/or related words or phrases, are submitted as hint words to the ASR system, and the voice query is submitted again, to determine new transcriptions. This process is repeated with different transcription terms, until a different and more proper transcription is generated. This increases the accuracy of ASR systems.
-
公开(公告)号:US12002466B2
公开(公告)日:2024-06-04
申请号:US17984394
申请日:2022-11-10
Applicant: Rovi Guides, Inc.
Inventor: Ankur Anil Aher , Jeffry Copps Robert Jose
CPC classification number: G10L15/22 , G10L25/18 , G10L25/60 , G10L2015/223
Abstract: Systems and methods are provided herein for avoiding inadvertently trigging a voice assistant with audio played through a speaker. An audio signal is captured by sampling a microphone of the voice assistant at a sampling frequency that is higher than an expected finite sampling frequency of previously recorded audio played through the speaker to generate a voice data sample. A quality metric of the generated voice data sample is calculated by determining whether the generated voice data sample comprises artifacts resulting from previous compression or approximation by the expected finite sampling frequency. Based on the calculated quality metric, it is determined whether the captured audio signal is previously recorded audio played through the speaker. Responsive to the determination that the captured audio signal is previously recorded audio played through the speaker, the voice assistant refrains from being activated.
-
公开(公告)号:US11948551B2
公开(公告)日:2024-04-02
申请号:US17968239
申请日:2022-10-18
Applicant: Rovi Guides, Inc.
Inventor: Ankur Anil Aher , Jeffry Copps Robert Jose
CPC classification number: G10L15/01 , G10L15/063 , G10L15/075 , G10L15/1815 , G10L15/20
Abstract: Systems and methods are disclosed and described for correcting errors in ASR transcriptions. For an incorrect transcription, different words or phrases from the transcription, and/or related words or phrases, are submitted as hint words to the ASR system, and the voice query is submitted again, to determine new transcriptions. This process is repeated with different transcription terms, until a different and more proper transcription is generated. This increases the accuracy of ASR systems.
-
-
-
-
-
-
-
-
-