-
公开(公告)号:US11715480B2
公开(公告)日:2023-08-01
申请号:US17209621
申请日:2021-03-23
Applicant: QUALCOMM Incorporated
Inventor: Kyungguen Byun , Shuhua Zhang , Lae-Hoon Kim , Erik Visser , Sunkuk Moon , Vahid Montazeri
IPC: G10L21/0232 , G10L21/038 , G10L21/02
CPC classification number: G10L21/0232 , G10L21/02 , G10L21/038
Abstract: A device to perform speech enhancement includes one or more processors configured to obtain input spectral data based on an input signal. The input signal represents sound that includes speech. The one or more processors are also configured to process, using a multi-encoder transformer, the input spectral data and context data to generate output spectral data that represents a speech enhanced version of the input signal.
-
公开(公告)号:US11676571B2
公开(公告)日:2023-06-13
申请号:US17154372
申请日:2021-01-21
Applicant: QUALCOMM Incorporated
Inventor: Kyungguen Byun , Sunkuk Moon , Shuhua Zhang , Vahid Montazeri , Lae-Hoon Kim , Erik Visser
IPC: G10L13/10 , G10L13/06 , G10L15/22 , G10L13/00 , G10L13/047 , G10L13/033 , G10L19/02 , G10L25/63 , G06N3/045 , G10L21/013
CPC classification number: G10L13/047 , G06N3/045 , G10L13/033 , G10L19/02 , G10L25/63 , G10L2021/0135
Abstract: A device for speech generation includes one or more processors configured to receive one or more control parameters indicating target speech characteristics. The one or more processors are also configured to process, using a multi-encoder, an input representation of speech based on the one or more control parameters to generate encoded data corresponding to an audio signal that represents a version of the speech based on the target speech characteristics.
-