User-perceived latency while maintaining accuracy

Invention Grant

US11532312B2 User-perceived latency while maintaining accuracy 有权

Please log in to see more content

Patent Title: User-perceived latency while maintaining accuracy
Application No.: US17123087

Application Date: 2020-12-15
Publication No.: US11532312B2

Publication Date: 2022-12-20
Inventor: Hosam Adel Khalil , Emilian Stoimenov , Christopher Hakan Basoglu , Kshitiz Kumar , Jian Wu
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Main IPC: G10L15/30
IPC: G10L15/30 ; G10L15/16 ; G10L19/16 ; G10L25/51 ; G10L15/08

Abstract:

Disclosed speech recognition techniques improve user-perceived latency while maintaining accuracy by: receiving an audio stream, in parallel, by a primary (e.g., accurate) speech recognition engine (SRE) and a secondary (e.g., fast) SRE; generating, with the primary SRE, a primary result; generating, with the secondary SRE, a secondary result; appending the secondary result to a word list; and merging the primary result into the secondary result in the word list. Combining output from the primary and secondary SREs into a single decoder as described herein improves user-perceived latency while maintaining or improving accuracy, among other advantages.

Public/Granted literature

US20220189467A1 USER-PERCEIVED LATENCY WHILE MAINTAINING ACCURACY Public/Granted day:2022-06-16

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/28	.语音识别系统的结构细节
G10L15/30	..分布式识别，例如：客户端-服务器系统，为移动电话或网络应用