-
公开(公告)号:US20210034663A1
公开(公告)日:2021-02-04
申请号:US16528541
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Indranil Coomar Doss , Aashish Goyal , Aman Puniyani , Kandala Reddy , Mithun Umesh
IPC: G06F16/635 , G06F16/68 , G06F16/632 , G06F16/2457 , G10L15/22 , G10L15/187 , G06F17/27
Abstract: The system receives a voice query at an audio interface and converts the voice query to text. The system can determine pronunciation information during conversion and generate metadata the indicates a pronunciation of one or more words of the query, include phonetic information in the text query, or both. A query includes one or more entities, which may be more accurately identified based on pronunciation. The system searches for information, content, or both among one or more databases based on the generated text query, pronunciation information, user profile information, search histories or trends, and optionally other information. The system identifies one or more entities or content items that match the text query, and retrieves the identified information to provide to the user.
-
公开(公告)号:US20210035587A1
公开(公告)日:2021-02-04
申请号:US16528550
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Indranil Coomar Doss , Aashish Goyal , Aman Puniyani , Kandala Reddy , Mithun Umesh
IPC: G10L15/26 , G10L15/22 , G10L13/02 , G10L15/187
Abstract: The system identifies one or more entities or content items among a plurality of stored information. The system generates an audio file based on a first text string that represents the entity or content item. Based on the first text string and at least one speech criterion, the system generating, using a speech-to-text module a second text string based on the audio file. The system then compares the text strings and stores the second text string if it is not identical to the first text string. The system generates metadata that includes results from text-speech-text conversions to forecast possible misidentifications when responding to voice queries during search operations. The metadata includes alternative representations of the entity, to improve reachability in cases where the speech-to-text conversion does generate a pr
-
公开(公告)号:US20210034662A1
公开(公告)日:2021-02-04
申请号:US16528539
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Indranil Coomar Doss , Aashish Goyal , Aman Puniyani , Kandala Reddy , Mithun Umesh
IPC: G06F16/635 , G06F16/68 , G10L15/22 , G10L15/187 , G06F17/27
Abstract: The system receives a voice query at an audio interface and converts the voice query to text. The system can determine pronunciation information during conversion and generate metadata that indicates a pronunciation of one or more words of the query, include phonetic information in the text query, or both. A query includes one or more entities that may be more accurately identified based on pronunciation. The system searches for information, content, or both among one or more databases based on the generated text query, pronunciation information, user profile information, search histories or trends, and optionally other information. The system identifies one or more entities or content items that match the text query, and retrieves the identified information to provide to the user.
-
公开(公告)号:US11494434B2
公开(公告)日:2022-11-08
申请号:US16528541
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Indranil Coomar Doss , Aashish Goyal , Aman Puniyani , Kandala Reddy , Mithun Umesh
IPC: G06F16/635 , G06F16/68 , G06F16/632 , G06F16/2457 , G10L15/187 , G10L15/22 , G06F40/295 , G10L15/08
Abstract: The system receives a voice query at an audio interface and converts the voice query to text. The system can determine pronunciation information during conversion and generate metadata the indicates a pronunciation of one or more words of the query, include phonetic information in the text query, or both. A query includes one or more entities, which may be more accurately identified based on pronunciation. The system searches for information, content, or both among one or more databases based on the generated text query, pronunciation information, user profile information, search histories or trends, and optionally other information. The system identifies one or more entities or content items that match the text query, and retrieves the identified information to provide to the user.
-
公开(公告)号:US11410656B2
公开(公告)日:2022-08-09
申请号:US16528550
申请日:2019-07-31
Applicant: Rovi Guides, Inc.
Inventor: Ankur Aher , Indranil Coomar Doss , Aashish Goyal , Aman Puniyani , Kandala Reddy , Mithun Umesh
IPC: G10L15/26 , G10L15/22 , G10L15/187 , G10L13/02
Abstract: The system identifies one or more entities or content items among a plurality of stored information. The system generates an audio file based on a first text string that represents the entity or content item. Based on the first text string and at least one speech criterion, the system generating, using a speech-to-text module a second text string based on the audio file. The system then compares the text strings and stores the second text string if it is not identical to the first text string. The system generates metadata that includes results from text-speech-text conversions to forecast possible misidentifications when responding to voice queries during search operations. The metadata includes alternative representations of the entity.
-
-
-
-