Patent search ap:("BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD.") AND inv:"Jiankang HOU"

1.

发明申请
SPEECH SYNTHESIS METHOD AND APPARATUS, DEVICE AND COMPUTER STORAGE MEDIUM 有权

公开(公告)号：US20230059882A1

公开(公告)日：2023-02-23

申请号：US17738186

申请日：2022-05-06

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Liqiang ZHANG , Jiankang HOU , Tao SUN , Lei JIA

IPC: G10L13/10 , G06F40/20 , G10L13/047

Abstract: The present disclosure discloses a speech synthesis method and apparatus, a device and a computer storage medium, and relates to speech and deep learning technologies in the field of artificial intelligence technologies. A specific implementation solution involves: acquiring to-be-synthesized text; acquiring a prosody feature extracted from the text; inputting the text and the prosody feature into a speech synthesis model to obtain a vocoder feature; and inputting the vocoder feature into a vocoder to obtain synthesized speech.

2.

发明申请
METHOD OF PROCESSING AUDIO DATA, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20230087531A1

公开(公告)日：2023-03-23

申请号：US18071187

申请日：2022-11-29

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Jiankang HOU , Zhipeng NIE , Liqiang ZHANG , Tao SUN , Lei JIA

IPC: G10L25/18 , G10L25/30

Abstract: A method of processing audio data, an electronic device, and a storage medium, which relates to a field of artificial intelligence, in particular to a field of speech processing technology. The method includes: processing spectral data of the audio data to obtain a first feature information; obtaining a fundamental frequency indication information according to the first feature information, wherein the fundamental frequency indication information indicates valid audio data of the first feature information and invalid audio data of the first feature information; obtaining a fundamental frequency information and a spectral energy information according to the first feature information and the fundamental frequency indication information; and obtaining a harmonic structure information of the audio data according to the fundamental frequency information and the spectral energy information.

3.

发明申请
SPEECH PROCESSING METHOD AND APPARATUS, DEVICE AND COMPUTER STORAGE MEDIUM 有权

公开(公告)号：US20230056128A1

公开(公告)日：2023-02-23

申请号：US17736175

申请日：2022-05-04

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Liqiang ZHANG , Jiankang HOU , Tao SUN , Lei JIA

IPC: G10L13/10 , G10L25/21 , G10L25/18

Abstract: The present disclosure discloses a speech processing method and apparatus, a device and a computer storage medium, and relates to speech and deep learning technologies in the field of artificial intelligence technologies. A specific implementation solution involves: acquiring a vocoder feature obtained for text; correcting a value of an unvoiced and voiced (UV) feature in the vocoder feature according to an energy feature and/or a speech spectrum feature in the vocoder feature; and providing the corrected vocoder feature for a vocoder, so as to obtain synthesized speech.

Patent Agency Ranking