专利检索 ap:("Microsoft Technology Licensing, LLC") AND inv:"GONG, Yifan" 第 2 页

11.

发明公开
ATTENTIVE ADVERSARIAL DOMAIN-INVARIANT TRAINING 审中-公开

公开(公告)号：EP3956880A1

公开(公告)日：2022-02-23

申请号：EP20713802.5

申请日：2020-03-02

申请人： Microsoft Technology Licensing, LLC

发明人： MENG, Zhong , LI, Jinyu , GONG, Yifan

IPC分类号： G10L15/06 , G10L15/16 , G06N3/08 , G10L15/02 , G10L15/22 , G06N3/04

12.

发明公开
VARIABLE-COMPONENT DEEP NEURAL NETWORK FOR ROBUST SPEECH RECOGNITION 审中-公开
标题翻译：用于鲁棒语音识别的变分量深度神经网络

公开(公告)号：EP3192071A1

公开(公告)日：2017-07-19

申请号：EP14901500.0

申请日：2014-09-09

申请人： Microsoft Technology Licensing, LLC

发明人： LI, Jinyu , ZHAO, Rui , GONG, Yifan

IPC分类号： G10L15/20 , G10L15/06

CPC分类号： G10L15/20 , G10L15/16 , G10L19/24 , G10L25/84

摘要： System and method for speech recognition incorporating environmental variables are provided. The system comprises: a speech capture device (202); a feature extraction module (204); an environment variable module (206), wherein the environment variable module determines a value for an environment variable; and a speech recognition decoder (208), wherein the speech recognition decoder utilizes a deep neural network (DNN) to recognize speech captured by the speech capture device, wherein one or more components of the DNN are modeled as a set of functions of the environment variable.

摘要翻译： 提供了结合环境变量的语音识别系统和方法。该系统和方法捕捉到需要识别的语音。然后使用可变分量深度神经网络（DNN）识别语音。可变组件DNN通过合并环境变量来处理捕获的语音。环境变量可以是取决于环境条件或用户，客户端设备和环境的关系的任何变量。例如，环境变量可以基于环境噪声并表示为信噪比。可变组件DNN可以以不同方式结合环境变量。例如，可以将环境变量并入DNN的加权矩阵和偏差，DNN的隐藏层的输出或DNN的节点的激活功能。

13.

发明公开
ADAPTIVE FRAME BATCHING TO REDUCE SPEECH RECOGNITION LATENCY 审中-公开

公开(公告)号：EP4091163A1

公开(公告)日：2022-11-23

申请号：EP20842072.9

申请日：2020-12-15

申请人： Microsoft Technology Licensing, LLC

发明人： KHALIL, Hosam A. , STOIMENOV, Emilian Y. , GONG, Yifan , LIU, Chaojun , BASOGLU, Christopher H. , AGARWAL, Amit K. , PARIHAR, Naveen , PATHAK, Sayan

IPC分类号： G10L15/32 , G10L15/02 , G10L15/16

14.

发明公开
ADVERSARIAL SPEAKER ADAPTATION 审中-公开

公开(公告)号：EP3956881A1

公开(公告)日：2022-02-23

申请号：EP20718071.2

申请日：2020-03-16

申请人： Microsoft Technology Licensing, LLC

发明人： MENG, Zhong , LI, Jinyu , GONG, Yifan

IPC分类号： G10L15/07 , G10L15/16 , G10L25/30

15.

发明公开
AUTOMATED SPEECH RECOGNITION CONFIDENCE CLASSIFIER 审中-公开

公开(公告)号：EP3953928A1

公开(公告)日：2022-02-16

申请号：EP20719516.5

申请日：2020-03-05

申请人： Microsoft Technology Licensing, LLC

发明人： KUMAR, Kshitiz , ANASTASAKOS, Anastasios , GONG, Yifan

IPC分类号： G10L15/08 , G06F40/30

16.

发明公开
DYNAMIC COMBINATION OF ACOUSTIC MODEL STATES 审中-公开

公开(公告)号：EP3948851A1

公开(公告)日：2022-02-09

申请号：EP20708877.4

申请日：2020-01-30

申请人： Microsoft Technology Licensing, LLC

发明人： KUMAR, Kshitiz , GONG, Yifan

IPC分类号： G10L15/16 , G10L15/065 , G06N3/04 , G10L15/20

17.

发明授权
LOW-FOOTPRINT ADAPTATION AND PERSONALIZATION FOR A DEEP NEURAL NETWORK 有权

公开(公告)号：EP3114680B1

公开(公告)日：2020-06-24

申请号：EP15717284.2

申请日：2015-02-27

申请人： Microsoft Technology Licensing, LLC

发明人： XUE, Jian , LI, Jinyu , YU, Dong , SELTZER, Michael L. , GONG, Yifan

IPC分类号： G06N3/08 , G10L15/07 , G10L15/16

18.

发明公开
LOW-FOOTPRINT ADAPTATION AND PERSONALIZATION FOR A DEEP NEURAL NETWORK 有权
标题翻译：适应和个性化的小体积FOR A深层神经网络

公开(公告)号：EP3114680A1

公开(公告)日：2017-01-11

申请号：EP15717284.2

申请日：2015-02-27

申请人： Microsoft Technology Licensing, LLC

发明人： XUE, Jian , LI, Jinyu , YU, Dong , SELTZER, Michael L. , GONG, Yifan

IPC分类号： G10L15/07 , G10L15/16

摘要： The adaptation and personalization of a deep neural network (DNN) model for automatic speech recognition is provided. An utterance which includes speech features for one or more speakers may be received in ASR tasks such as voice search or short message dictation. A decomposition approach may then be applied to an original matrix in the DNN model. In response to applying the decomposition approach, the original matrix may be converted into multiple new matrices which are smaller than the original matrix. A square matrix may then be added to the new matrices. Speaker-specific parameters may then be stored in the square matrix. The DNN model may then be adapted by updating the square matrix. This process may be applied to all of a number of original matrices in the DNN model. The adapted DNN model may include a reduced number of parameters than those received in the original DNN model.

摘要翻译： 用于自动语音识别的深层神经网络（DNN）模型的适配和个性化设置。如语音搜索或短信听写：其中包括一个或多个扬声器的语音功能的话语可以在ASR任务接收。甲分解方法可接着在DNN模型被应用到原始矩阵。响应于施加的分解的方法中，原始矩阵可以被转换成多个新的矩阵，它们比原矩阵小。然后，方阵可被添加到新的矩阵。说话者特定参数然后可被存储在方阵。 DNN的模型然后可以通过更新方阵来适配。此过程可被应用到所有的数在DNN模型原始矩阵的。该angepasst DNN模型可以包括的参数比在原始DNN模型接收到的减少数目。

19.

发明公开
ADAPTIVE FRAME BATCHING TO REDUCE SPEECH RECOGNITION LATENCY 审中-实审

公开(公告)号：EP4401073A1

公开(公告)日：2024-07-17

申请号：EP24164657.9

申请日：2020-12-15

申请人： Microsoft Technology Licensing, LLC

发明人： KHALIL, Hosam A. , STOIMENOV, Emilian Y. , GONG, Yifan , LIU, Chaojun , BASOGLU, Christopher H. , AGARWAL, Amit K. , PARIHAR, Naveen , PATHAK, Sayan

IPC分类号： G10L15/32 , G10L15/02 , G10L15/16

CPC分类号： G10L15/02 , G10L15/16 , G10L15/32

摘要： Embodiments may include collection of a first batch of acoustic feature frames of an audio signal, the number of acoustic feature frames of the first batch equal to a first batch size, input of the first batch to a speech recognition network, collection, in response to detection of a word hypothesis output by the speech recognition network, of a second batch of acoustic feature frames of the audio signal, the number of acoustic feature frames of the second batch equal to a second batch size, and input of the second batch to the speech recognition network.

20.

发明公开
SEQUENCE-TO-SEQUENCE SPEECH RECOGNITION WITH LATENCY THRESHOLD 审中-公开

公开(公告)号：EP4133478A1

公开(公告)日：2023-02-15

申请号：EP21710789.5

申请日：2021-02-15

申请人： Microsoft Technology Licensing, LLC

发明人： GAUR, Yashesh , LI, Jinyu , LU, Liang , INAGUMA, Hirofumi , GONG, Yifan

IPC分类号： G10L15/16 , G10L15/32 , G10L15/06

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类