Patent search ap:("Microsoft Technology Licensing Page LLC") AND inv:"Shuangyu Chang"

1.

发明授权
Online language model interpolation for automatic speech recognition 有权

公开(公告)号：US11562738B2

公开(公告)日：2023-01-24

申请号：US16665574

申请日：2019-10-28

Applicant: Microsoft Technology Licensing, LLC

Inventor： Ziad Al Bawab , Anand U Desai , Shuangyu Chang , Amit K Agarwal , Zoltan Romocsa , Veljko Miljanic , Aadyot Bhatnagar , Hosam Khalil , Christopher Basoglu

IPC: G10L15/00 , G10L15/18 , G10L15/193 , G06Q10/10 , G10L15/197 , G10L15/22 , G10L15/30

Abstract: A system includes acquisition of a domain grammar, determination of an interpolated grammar based on the domain grammar and a base grammar, determination of a delta domain grammar based on an augmented first grammar and the interpolated grammar, determination of an out-of-vocabulary class based on the domain grammar and the base grammar, insertion of the out-of-vocabulary class into a composed transducer composed of the augmented first grammar and one or more other transducers to generate an updated composed transducer, composition of the delta domain grammar and the updated composed transducer, and application of the composition of the delta domain grammar and the updated composed transducer to an output of an acoustic model.

2.

发明授权
Incremental utterance decoder combination for efficient and accurate decoding 有权
Title translation: 增量语音解码器组合，用于高效准确的解码

公开(公告)号：US09552817B2

公开(公告)日：2017-01-24

申请号：US14219642

申请日：2014-03-19

Applicant: Microsoft Technology Licensing, LLC

Inventor： Shuangyu Chang , Michael Levit , Abhik Lahiri , Barlas Oguz , Benoit Dumoulin

IPC: G10L19/005 , G10L15/32

CPC classification number: G10L15/32 , G10L15/063 , G10L15/14 , G10L19/005

Abstract: An incremental speech recognition system. The incremental speech recognition system incrementally decodes a spoken utterance using an additional utterance decoder only when the additional utterance decoder is likely to add significant benefit to the combined result. The available utterance decoders are ordered in a series based on accuracy, performance, diversity, and other factors. A recognition management engine coordinates decoding of the spoken utterance by the series of utterance decoders, combines the decoded utterances, and determines whether additional processing is likely to significantly improve the recognition result. If so, the recognition management engine engages the next utterance decoder and the cycle continues. If the accuracy cannot be significantly improved, the result is accepted and decoding stops. Accordingly, a decoded utterance with accuracy approaching the maximum for the series is obtained without decoding the spoken utterance using all utterance decoders in the series, thereby minimizing resource usage.

Abstract translation: 增量语音识别系统。只有当附加话语解码器可能对组合结果增加显着的益处时，增量语音识别系统才会使用附加话音解码器递增地解码语音话语。可用的话语解码器是基于准确性，性能，多样性等因素进行排序的。识别管理引擎通过一系列话音解码器来协调语音发音的解码，组合解码的话语，并确定附加处理是否可能显着提高识别结果。如果是这样，识别管理引擎接合下一个话音解码器，并且该周期继续。如果精度无法显着提高，结果被接受，解码停止。因此，在使用系列中的所有话语解码器对语音发音进行解码的情况下，获得具有接近该系列的最大值的精确解码语音，从而最小化资源使用。

3.

发明申请
SPEECH RECOGNITION ERROR DIAGNOSIS 有权
Title translation: 语音识别错误诊断

公开(公告)号：US20160253989A1

公开(公告)日：2016-09-01

申请号：US14634714

申请日：2015-02-27

Applicant: Microsoft Technology Licensing, LLC

Inventor： Shiun-Zu Kuo , Thomas Reutter , Yifan Gong , Mark T. Hanson , Ye Tian , Shuangyu Chang , Jon Hamaker , Qi Miao , Yuancheng Tu

IPC: G10L15/01 , G10L15/26 , G10L15/19

CPC classification number: G10L15/01 , G10L15/183

Abstract: Techniques and technologies for diagnosing speech recognition errors are described. In an example implementation, a system for diagnosing speech recognition errors may include an error detection module configured to determine that a speech recognition result is least partially erroneous, and a recognition error diagnostics module. The recognition error diagnostics module may be configured to (a) perform a first error analysis of the at least partially erroneous speech recognition result to provide a first error analysis result; (b) perform a second error analysis of the at least partially erroneous speech recognition result to provide a second error analysis result; and (c) determine at least one category of recognition error associated with the at least partially erroneous speech recognition result based on a combination of the first error analysis result and the second error analysis result.

Abstract translation: 描述用于诊断语音识别错误的技术和技术。在示例实现中，用于诊断语音识别错误的系统可以包括被配置为确定语音识别结果是最小部分错误的错误检测模块，以及识别错误诊断模块。识别错误诊断模块可以被配置为（a）对所述至少部分错误的语音识别结果执行第一误差分析以提供第一误差分析结果; （b）对所述至少部分错误的语音识别结果进行第二误差分析以提供第二误差分析结果; 以及（c）基于所述第一误差分析结果和所述第二误差分析结果的组合来确定与所述至少部分错误的语音识别结果相关联的至少一类识别错误。

4.

发明授权
Organizational-based language model generation 有权

公开(公告)号：US11676576B2

公开(公告)日：2023-06-13

申请号：US17400055

申请日：2021-08-11

Applicant: Microsoft Technology Licensing, LLC

Inventor： Ziad Al Bawab , Anand U Desai , Cem Aksoylar , Michael Levit , Xin Meng , Shuangyu Chang , Suyash Choudhury , Dhiresh Rawal , Tao Li , Rishi Girish , Marcus Jager , Ananth Rampura Sheshagiri Rao

IPC: G10L15/06 , G06N20/00 , G10L15/14 , G10L15/183

CPC classification number: G10L15/063 , G06N20/00 , G10L15/14 , G10L15/183

Abstract: Systems and methods are provided for acquiring training data and building an organizational-based language model based on the training data. In organizational data is generated via one or more applications associated with an organization, the collected organizational data is aggregated and filtered into training data that is used for training an organizational-based language model for speech processing based on the training data.

5.

发明授权
Meeting-adapted language model for speech recognition 有权

公开(公告)号：US11348574B2

公开(公告)日：2022-05-31

申请号：US16531435

申请日：2019-08-05

Applicant: Microsoft Technology Licensing, LLC

Inventor： Ziad Al Bawab , Anand U Desai , Shuangyu Chang , Amit K Agarwal , Zoltan Romocsa , Christopher H Basoglu , Nathan E Wohlgemuth

IPC: G10L15/193 , G06Q10/10 , G10L15/197 , G10L15/22 , G10L15/30

Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.

6.

发明授权
Eyes-off training for automatic speech recognition 有权

公开(公告)号：US10679610B2

公开(公告)日：2020-06-09

申请号：US16036721

申请日：2018-07-16

Applicant: Microsoft Technology Licensing, LLC

Inventor： Hemant Malhotra , Shuangyu Chang , Pradip Kumar Fatehpuria

IPC: G10L15/06 , G06F40/279 , G06F40/166 , G10L15/02 , G10L15/26

Abstract: A method for eyes-off training of a dictation system includes translating an audio signal featuring speech audio of a speaker into an initial recognized text using a previously-trained general language model. The initial recognized text is provided to the speaker for error correction. The audio signal is re-translated into an updated recognized text using a specialized language model biased to recognize words included in the corrected text. The general language model is retrained in an “eyes-off” manner, based on the audio signal and the updated recognized text.

7.

发明申请
CORRECTION OF SPEECH RECOGNITION ON REPETITIVE QUERIES 审中-公开

公开(公告)号：US20190287519A1

公开(公告)日：2019-09-19

申请号：US15920231

申请日：2018-03-13

Applicant: Microsoft Technology Licensing, LLC

Inventor： Meryem Pinar Donmez Ediz , Ranjitha Gurunath Kulkarni , Shuangyu Chang , Nitin Kamra

IPC: G10L15/197 , G10L15/22 , G10L15/18 , G10L15/01 , G10L15/07

Abstract: Disclosed in various examples are methods, systems, and machine-readable mediums for providing improved computer implemented speech recognition by detecting and correcting speech recognition errors during a speech session. The system recognizes repeated speech commands from a user in a speech session that are similar or identical to each other. To correct these repeated errors, the system creates a customized language model that is then utilized by the language modeler to produce a refined prediction of the meaning of the repeated speech commands. The custom language model may comprise clusters of similar past predictions of speech commands from the speech session of the user.

8.

发明授权
Language modeling based on spoken and unspeakable corpuses 有权

公开(公告)号：US09761220B2

公开(公告)日：2017-09-12

申请号：US14711447

申请日：2015-05-13

Applicant: Microsoft Technology Licensing, LLC

Inventor： Michael Levit , Shuangyu Chang , Benoit Dumoulin

IPC: G10L15/06 , G10L15/10 , G10L15/14 , G10L15/18 , G10L15/19

CPC classification number: G10L15/063 , G10L15/10 , G10L15/14 , G10L15/18 , G10L15/19 , G10L2015/0633 , G10L2015/0635

Abstract: A computer system for language modeling may collect training data from one or more information sources, generate a spoken corpus containing text of transcribed speech, and generate a typed corpus containing typed text. The computer system may derive feature vectors from the spoken corpus, analyze the typed corpus to determine feature vectors representing items of typed text, and generate an unspeakable corpus by filtering the typed corpus to remove each item of typed text represented by a feature vector that is within a similarity threshold of a feature vector derived from the spoken corpus. The computer system may derive feature vectors from the unspeakable corpus and train a classifier to perform discriminative data selection for language modeling based on the feature vectors derived from the spoken corpus and the feature vectors derived from the unspeakable corpus.

9.

发明授权
Custom display post processing in speech recognition 有权

公开(公告)号：US12061861B2

公开(公告)日：2024-08-13

申请号：US17815211

申请日：2022-07-26

Applicant: Microsoft Technology Licensing, LLC

Inventor： Wei Liu , Padma Varadharajan , Piyush Behre , Nicholas Kibre , Edward C. Lin , Shuangyu Chang , Che Zhao , Khuram Shahid , Heiko Willy Rahmel

IPC: G06F40/284 , G06F40/117 , G06F40/151 , G06F40/166

CPC classification number: G06F40/151 , G06F40/117 , G06F40/166 , G06F40/284

Abstract: Solutions for custom display post processing (DPP) in speech recognition (SR) use a customized multi-stage DPP pipeline that transforms a stream of SR tokens from lexical form to display form. A first transformation stage of the DPP pipeline receives the stream of tokens, in turn, by an upstream filter, a base model stage, and a downstream filter, and transforms a first aspect of the stream of tokens (e.g., disfluency, inverse text normalization (ITN), capitalization, etc.) from lexical form into display form. The upstream filter and/or the downstream filter alter the stream of tokens to change the default behavior of the DPP pipeline into custom behavior. Additional transformation stages of the DPP pipeline perform further transforms, allowing for outputting final text in a display format that is customized for a specific user. This permits each user to efficiently leverage a common baseline DPP pipeline to produce a custom output.

10.

发明授权
Meeting-adapted language model for speech recognition 有权

公开(公告)号：US11636854B2

公开(公告)日：2023-04-25

申请号：US17752623

申请日：2022-05-24

Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventor： Ziad Al Bawab , Anand U Desai , Shuangyu Chang , Amit K Agarwal , Zoltan Romocsa , Christopher H Basoglu , Nathan E Wohlgemuth

IPC: G10L15/193 , G06Q10/10 , G10L15/197 , G10L15/22 , G10L15/30 , G06Q10/107 , G06Q10/1093

Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification