SPEECH RECOGNITION

Invention Application

US20250078839A1 SPEECH RECOGNITION 有权

Please log in to see more content

Patent Title: SPEECH RECOGNITION
Application No.: US18819018

Application Date: 2024-08-29
Publication No.: US20250078839A1

Publication Date: 2025-03-06
Inventor: Xiaoyin FU , Qiguang ZANG , Fenfen SHENG , Haifeng WANG , Lei JIA
Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Applicant Address: CN BEIJING
Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Current Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Current Assignee Address: CN BEIJING
Priority: CN202311104070.7 20230829
Main IPC: G10L15/32
IPC: G10L15/32 ; G10L15/02 ; G10L15/04 ; G10L15/06 ; G10L15/183

Abstract:

A speech recognition method and a method for training a deep learning model are provided. The speech recognition method includes: obtaining a first speech feature of a speech to-be-recognized, which includes a plurality of speech segment features corresponding to a plurality of speech segments; decoding the first speech feature using a first decoder to obtain a plurality of first decoding results corresponding to a plurality of the words, indicating a first recognition result of words; extracting a second speech feature from the first speech feature based on first a priori information, which includes the plurality of first decoding results, and the second speech feature includes first word-level audio features corresponding to the plurality of words; and decoding the second speech feature using a second decoder to obtain a plurality of second decoding results corresponding to the plurality of words, indicating a second recognition result of the word.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/28	.语音识别系统的结构细节
G10L15/32	..以顺序或并行使用的多个识别器；相应的记分组合系统，例如投票系统