-
公开(公告)号:US20220375465A1
公开(公告)日:2022-11-24
申请号:US17623379
申请日:2019-12-20
Applicant: Rovi Guides, Inc.
Inventor: Ankur Anil Aher , Jeffry Copps Robert Jose
Abstract: Systems and methods for processing audio streams are disclosed herein. An audio stream including speech content is received. The audio stream is compacted to generate a compacted audio stream and the compacted audio stream is transmitted to an automatic speech recognition (ASR) service for transcription of the speech content to text content. In response to transmitting the compacted audio stream for transcription, text content, a transcription of the audio stream, is received from the ASR service.
-
公开(公告)号:US11507572B2
公开(公告)日:2022-11-22
申请号:US17038643
申请日:2020-09-30
Applicant: Rovi Guides, Inc.
Inventor: Jeffry Copps Robert Jose , Ajay Kumar Mishra
IPC: G06F16/242 , G06F16/248 , G10L15/26 , G06F16/2457 , G06F16/28
Abstract: Systems and methods are described herein for interpreting natural language search queries that account for contextual relevance of words of the search query that would ordinarily not be processed, including, for example, processing each word of the query. Each term or phrase is associated with a respective part of speech, and a frequency of occurrence of a combination of adjacent terms or phrases public domain is determined. A relevance of each term is then determined based on its respective type of term and frequency of occurrence in the public domain. The natural language search query is then interpreted based on the importance or relevance of each term.
-
公开(公告)号:US11474999B2
公开(公告)日:2022-10-18
申请号:US17038643
申请日:2020-09-30
Applicant: Rovi Guides, Inc.
Inventor: Jeffry Copps Robert Jose , Ajay Kumar Mishra
IPC: G06F16/242 , G06F16/248 , G10L15/26 , G06F16/2457 , G06F16/28
Abstract: Systems and methods are described herein for interpreting natural language search queries that account for contextual relevance of words of the search query that would ordinarily not be processed, including, for example, processing each word of the query. Each term or phrase is associated with a respective part of speech, and a frequency of occurrence of a combination of adjacent terms or phrases public domain is determined. A relevance of each term is then determined based on its respective type of term and frequency of occurrence in the public domain. The natural language search query is then interpreted based on the importance or relevance of each term.
-
公开(公告)号:US20220318283A1
公开(公告)日:2022-10-06
申请号:US17218963
申请日:2021-03-31
Applicant: Rovi Guides, Inc.
Inventor: Ajay Kumar Mishra , Jeffry Copps Robert Jose
IPC: G06F16/332 , G06F16/33 , G06F16/31 , G06N20/00
Abstract: Systems and methods are described to access a set of reattempt query pairs, where each respective pair comprises an initial query and a reattempt of the initial query, and is associated with an indication of whether a reply generated for output based on the respective query pair was acceptable. In response to determining that a second query received after a first query constitutes a reattempt of the first query, a query pair in the set of reattempt query pairs may be identified that matches at least one of the first query and the second query, and is associated with an indication that a reply generated for output based on the query pair was acceptable. A search may be performed based on the identified query pair in the set of reattempt query pairs, and a reply may be generated for output based on the performed search.
-
公开(公告)号:US11450306B2
公开(公告)日:2022-09-20
申请号:US15931261
申请日:2020-05-13
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
IPC: G10L13/033 , A63H11/00 , A63H3/31 , G10L15/22 , G10L25/63
Abstract: The system trains a model to provide information used to provide a synthesized speech response to a voice input. The model takes as input prosodic information that may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example. The system receives a plurality of voice inputs, each associated with prosodic metric, as well as a plurality of responses, each also associated with prosodic metrics. The system trains the model based on the plurality of voice inputs, the plurality of responses, the prosodic metrics of the voice inputs, and the prosodic metrics of the responses such that the model outputs information used to generate the response. The model may also take as input user profile information, emotion metrics, and transition information to generate output. The output of the training model may be used by the system to provide synthesized speech responses having relevant prosodic character to received voice inputs.
-
公开(公告)号:US11443731B2
公开(公告)日:2022-09-13
申请号:US15931074
申请日:2020-05-13
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Jeffry Copps Robert Jose
IPC: G10L13/033 , G10L25/63 , A63H11/00
Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.
-
公开(公告)号:US20220269473A1
公开(公告)日:2022-08-25
申请号:US17180254
申请日:2021-02-19
Applicant: Rovi Guides, Inc.
Inventor: Jeffry Copps Robert Jose
IPC: G06F3/16
Abstract: Systems and methods for adjusting a sound level during a conference call are disclosed herein. Conferencing application receives an audio input including a first sound and a second sound. A user selectable element is generated for each sound in an user interface, where a user selection setting a first user selectable element associated with the first sound at a user-specified level is received. The sound level for the first sound is adjusted based on the user selection and output at the user-specified level while the second sound is output at a default sound level.
-
公开(公告)号:US20220157348A1
公开(公告)日:2022-05-19
申请号:US17591358
申请日:2022-02-02
Applicant: Rovi Guides, Inc.
Inventor: Jeffry Copps Robert Jose , Mithun Umesh , Sindhuja Chonat Sri
Abstract: Systems and methods for generating individualized content trailers. Content such as a video is divided into segments each representing a set of common features. With reference to a set of stored user preferences, certain segments are selected as aligning with the user's interests. Each selected segment may then be assigned a label corresponding to the plot portion or element to which it belongs. A coherent trailer may then be assembled from the selected segments, ordered according to their plot elements. This allows a user to see not only segments containing subject matter that aligns with their interests, but also a set of such segments arranged to give the user an idea of the plot, and a sense of drama, increasing the likelihood of engagement with the content.
-
公开(公告)号:US11281291B1
公开(公告)日:2022-03-22
申请号:US17075229
申请日:2020-10-20
Applicant: Rovi Guides, Inc.
Abstract: Systems and methods are described for extended reality environment interaction. An extended reality environment including an object is generated for display, and a sensor is used to detect a gaze directed to a first portion of the extended reality environment, where the object is included in the first portion of the extended reality environment. Opacity-based indicators are generated for display in the vicinity of the first portion of the extended reality environment, and a boundary of the object is identified. Based on the identified boundary of the object, an opacity of the at least one of the opacity-based indicators is varied.
-
公开(公告)号:US20220067308A1
公开(公告)日:2022-03-03
申请号:US17001911
申请日:2020-08-25
Applicant: Rovi Guides, Inc.
Inventor: Ajay Kumar Mishra , Jeffry Copps Robert Jose
IPC: G06F40/58 , G06F16/2452 , G06F40/263 , G06F40/51 , G06F40/47
Abstract: Systems and methods for handling multilingual queries are provided. One example method includes receiving, at a computing device, an input, wherein the input comprises a multi-lingual query comprising at least a first source language and a second source language. The multi-lingual query is translated, word for word, into a destination language to produce a monolingual query, with the word order of the multilingual query and the word order of the monolingual query being the same. The monolingual query is processed using natural language processing to map the mono-lingual query to a natural language query in the destination language.
-
-
-
-
-
-
-
-
-