-
公开(公告)号:US20210217433A1
公开(公告)日:2021-07-15
申请号:US17215850
申请日:2021-03-29
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Zhenyi LIU , Wenbin ZHAO , Feng LI
IPC: G10L21/0272 , G10L21/0208 , G10L25/78 , G10L25/57 , G06K9/00 , G10L13/02
Abstract: A voice processing method is provided, including: when a terminal records a video, if a current video frame includes a face and a current audio frame includes a voice, determining a target face in the current video frame; obtaining a target distance between the target face and the terminal; determining a target gain based on the target distance, where a larger target distance indicates a larger target gain; separating a voice signal from a voice signal of the current audio frame; and performing enhancement processing on the voice signal based on the target gain, to obtain a target voice signal. This implements adaptive enhancement of a human voice signal during video recording.
-
公开(公告)号:US20240096343A1
公开(公告)日:2024-03-21
申请号:US18522743
申请日:2023-11-29
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Shanyi WEI , Chao WU , Yan QIU , Meng LIAO , Fan FAN , Shiqiang PENG , Bin LI , Wenbin ZHAO , Jiang LI , Haiting LI , Xueyan HUANG
IPC: G10L21/0232 , G10L21/0308
CPC classification number: G10L21/0232 , G10L21/0308
Abstract: This application relates to the artificial intelligence (AI) field, and specifically, to a voice quality enhancement method and a related device. The method includes: after a PNR mode is enabled, obtaining a noisy voice signal and target voice-related data, where the noisy-carrying voice signal includes a voice signal of a target user and an interfering noise signal, and the target voice-related data indicates a voice feature of the target user; and performing noise reduction on the noisy voice signal based on the target voice-related data by using a trained voice noise reduction model to obtain a noise-reduced voice signal of the target user, where the voice noise reduction model is implemented based on a neural network. In embodiments of this application, voice of a target person can be enhanced, and interference can be suppressed.
-