- 专利标题: AUDIO DATA PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, MEDIUM, AND PROGRAM PRODUCT
-
申请号: EP22826640.9申请日: 2022-07-27
-
公开(公告)号: EP4261819A1公开(公告)日: 2023-10-18
- 发明人: WANG, Yipeng
- 申请人: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
- 申请人地址: CN Beijing 100085 2/F Baidu Campus No. 10 Shangdi 10th Street Haidian District
- 代理机构: Laqua, Bernd Christian Kurt
- 优先权: CN202210106767 20220128
- 国际公布: WO2023142413 20230803
- 主分类号: G10H1/36
- IPC分类号: G10H1/36 ; G10H1/40 ; G10L25/03 ; G10L25/51
摘要:
The present disclosure provides an audio data processing method and apparatus, an electronic device, a computer-readable storage medium and a computer program product and relates to the field of artificial intelligence, in particular to an audio processing technology. An implementation solution is: obtaining human voice audio data to be adjusted and reference human voice audio data, wherein the reference human voice audio data and the human voice audio data to be adjusted are obtained based on the same text information; performing framing on the human voice audio data to be adjusted and the reference human voice audio data respectively so as to obtain a first audio frame set and a second audio frame set respectively; recognizing a pronunciation unit corresponding to each audio frame respectively; determining, based on a timestamp of each audio frame, a timestamp of each pronunciation unit in the human voice audio data to be adjusted and the reference human voice audio data respectively; and adjusting the timestamp of at least one pronunciation unit in the human voice audio data to be adjusted to make the timestamp of the pronunciation unit in the human voice audio data to be adjusted to be consistent with the timestamp of the corresponding pronunciation unit in the reference human voice audio data.
信息查询