AUDIO DATA PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, MEDIUM, AND PROGRAM PRODUCT

发明公开

EP4261819A1 AUDIO DATA PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, MEDIUM, AND PROGRAM PRODUCT 审中-公开

请登陆查看更多内容

专利标题： AUDIO DATA PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, MEDIUM, AND PROGRAM PRODUCT
申请号： EP22826640.9

申请日： 2022-07-27
公开(公告)号： EP4261819A1

公开(公告)日： 2023-10-18
发明人: WANG, Yipeng
申请人： BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
申请人地址： CN Beijing 100085 2/F Baidu Campus No. 10 Shangdi 10th Street Haidian District
代理机构： Laqua, Bernd Christian Kurt
优先权： CN202210106767 20220128
国际公布： WO2023142413 20230803
主分类号： G10H1/36
IPC分类号： G10H1/36 ; G10H1/40 ; G10L25/03 ; G10L25/51

AUDIO DATA PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, MEDIUM, AND PROGRAM PRODUCT

摘要：

The present disclosure provides an audio data processing method and apparatus, an electronic device, a computer-readable storage medium and a computer program product and relates to the field of artificial intelligence, in particular to an audio processing technology. An implementation solution is: obtaining human voice audio data to be adjusted and reference human voice audio data, wherein the reference human voice audio data and the human voice audio data to be adjusted are obtained based on the same text information; performing framing on the human voice audio data to be adjusted and the reference human voice audio data respectively so as to obtain a first audio frame set and a second audio frame set respectively; recognizing a pronunciation unit corresponding to each audio frame respectively; determining, based on a timestamp of each audio frame, a timestamp of each pronunciation unit in the human voice audio data to be adjusted and the reference human voice audio data respectively; and adjusting the timestamp of at least one pronunciation unit in the human voice audio data to be adjusted to make the timestamp of the pronunciation unit in the human voice audio data to be adjusted to be consistent with the timestamp of the corresponding pronunciation unit in the reference human voice audio data.

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10H	电声乐器；由机电装置或电子发生器产生音调的乐器，或从数据存储器合成音调的乐器
G10H1/00	电声乐器的零部件（也可适用于其他乐器的键盘入G10B，G10C；用于产生混响或回声的装置入G10K15/08）
G10H1/36	.伴奏设备