Invention Application
- Patent Title: SPEECH RECOGNITION
-
Application No.: US18819018Application Date: 2024-08-29
-
Publication No.: US20250078839A1Publication Date: 2025-03-06
- Inventor: Xiaoyin FU , Qiguang ZANG , Fenfen SHENG , Haifeng WANG , Lei JIA
- Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
- Applicant Address: CN BEIJING
- Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
- Current Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
- Current Assignee Address: CN BEIJING
- Priority: CN202311104070.7 20230829
- Main IPC: G10L15/32
- IPC: G10L15/32 ; G10L15/02 ; G10L15/04 ; G10L15/06 ; G10L15/183

Abstract:
A speech recognition method and a method for training a deep learning model are provided. The speech recognition method includes: obtaining a first speech feature of a speech to-be-recognized, which includes a plurality of speech segment features corresponding to a plurality of speech segments; decoding the first speech feature using a first decoder to obtain a plurality of first decoding results corresponding to a plurality of the words, indicating a first recognition result of words; extracting a second speech feature from the first speech feature based on first a priori information, which includes the plurality of first decoding results, and the second speech feature includes first word-level audio features corresponding to the plurality of words; and decoding the second speech feature using a second decoder to obtain a plurality of second decoding results corresponding to the plurality of words, indicating a second recognition result of the word.
Information query