Patent search ap:("Adobe Inc.") AND inv:"Maxwell Morrison" Page 1

1.

发明授权
Context-aware prosody correction of edited speech 有权

公开(公告)号：US11830481B2

公开(公告)日：2023-11-28

申请号：US17538683

申请日：2021-11-30

Applicant: Adobe Inc.

Inventor： Maxwell Morrison , Zeyu Jin , Nicholas Bryan , Juan Pablo Caceres Chomali , Lucas Rencker

IPC: G10L15/18 , G10L25/90 , G10L15/187 , G10L15/02 , G10L15/04 , G10L15/16 , G10L21/0208

CPC classification number: G10L15/1807 , G10L15/02 , G10L15/04 , G10L15/16 , G10L15/187 , G10L21/0208 , G10L25/90 , G10L2015/025 , G10L2021/02082

Abstract: Methods are performed by one or more processing devices for correcting prosody in audio data. A method includes operations for accessing subject audio data in an audio edit region of the audio data. The subject audio data in the audio edit region potentially lacks prosodic continuity with unedited audio data in an unedited audio portion of the audio data. The operations further include predicting, based on a context of the unedited audio data, phoneme durations including a respective phoneme duration of each phoneme in the unedited audio data. The operations further include predicting, based on the context of the unedited audio data, a pitch contour comprising at least one respective pitch value of each phoneme in the unedited audio data. Additionally, the operations include correcting prosody of the subject audio data in the audio edit region by applying the phoneme durations and the pitch contour to the subject audio data.

2.

发明授权
Neural pitch-shifting and time-stretching 有权

公开(公告)号：US11915714B2

公开(公告)日：2024-02-27

申请号：US17558580

申请日：2021-12-21

Applicant: Adobe Inc. , Northwestern University

Inventor： Maxwell Morrison , Juan Pablo Caceres Chomali , Zeyu Jin , Nicholas Bryan , Bryan A. Pardo

IPC: G10L21/013 , G10L15/02 , G10L15/18 , G10L25/90 , G10L25/30 , G10L19/032 , G10L21/04 , G10L25/24 , G10L15/06 , G10L19/028

CPC classification number: G10L21/013 , G10L15/02 , G10L15/063 , G10L15/1807 , G10L19/028 , G10L19/032 , G10L21/04 , G10L25/24 , G10L25/30 , G10L25/90 , G10L2021/0135

Abstract: Methods for modifying audio data include operations for accessing audio data having a first prosody, receiving a target prosody differing from the first prosody, and computing acoustic features representing samples. Computing respective acoustic features for a sample includes computing a pitch feature as a quantized pitch value of the sample by assigning a pitch value, of the target prosody or the audio data, to at least one of a set of pitch bins having equal widths in cents. Computing the respective acoustic features further includes computing a periodicity feature from the audio data. The respective acoustic features for the sample include the pitch feature, the periodicity feature, and other acoustic features. A neural vocoder is applied to the acoustic features to pitch-shift and time-stretch the audio data from the first prosody toward the target prosody.

3.

发明公开
NEURAL PITCH-SHIFTING AND TIME-STRETCHING 审中-公开

公开(公告)号：US20230197093A1

公开(公告)日：2023-06-22

申请号：US17558580

申请日：2021-12-21

Applicant: Adobe Inc. , Northwestern University

Inventor： Maxwell Morrison , Juan Pablo Caceres Chomali , Zeyu Jin , Nicholas Bryan , Bryan A. Pardo

IPC: G10L21/013 , G10L15/02 , G10L15/18 , G10L25/90 , G10L25/30 , G10L19/028 , G10L19/032 , G10L21/04 , G10L25/24 , G10L15/06

CPC classification number: G10L21/013 , G10L15/02 , G10L15/1807 , G10L25/90 , G10L25/30 , G10L19/028 , G10L19/032 , G10L21/04 , G10L25/24 , G10L15/063 , G10L2021/0135

Abstract: Methods for modifying audio data include operations for accessing audio data having a first prosody, receiving a target prosody differing from the first prosody, and computing acoustic features representing samples. Computing respective acoustic features for a sample includes computing a pitch feature as a quantized pitch value of the sample by assigning a pitch value, of the target prosody or the audio data, to at least one of a set of pitch bins having equal widths in cents. Computing the respective acoustic features further includes computing a periodicity feature from the audio data. The respective acoustic features for the sample include the pitch feature, the periodicity feature, and other acoustic features. A neural vocoder is applied to the acoustic features to pitch-shift and time-stretch the audio data from the first prosody toward the target prosody.

4.

发明公开
CONTEXT-AWARE PROSODY CORRECTION OF EDITED SPEECH 审中-公开

公开(公告)号：US20230169961A1

公开(公告)日：2023-06-01

申请号：US17538683

申请日：2021-11-30

Applicant: Adobe Inc.

Inventor： Maxwell Morrison , Zeyu Jin , Nicholas Bryan , Juan Pablo Caceres Chomali , Lucas Rencker

IPC: G10L15/18 , G10L25/90 , G10L15/187 , G10L15/02 , G10L15/04 , G10L21/0208 , G10L15/16 , G06N3/08

CPC classification number: G10L15/1807 , G10L25/90 , G10L15/187 , G10L15/02 , G10L15/04 , G10L21/0208 , G10L15/16 , G06N3/088 , G10L2015/025 , G10L2021/02082 , G06N3/0454

Abstract: Methods are performed by one or more processing devices for correcting prosody in audio data. A method includes operations for accessing subject audio data in an audio edit region of the audio data. The subject audio data in the audio edit region potentially lacks prosodic continuity with unedited audio data in an unedited audio portion of the audio data. The operations further include predicting, based on a context of the unedited audio data, phoneme durations including a respective phoneme duration of each phoneme in the unedited audio data. The operations further include predicting, based on the context of the unedited audio data, a pitch contour comprising at least one respective pitch value of each phoneme in the unedited audio data. Additionally, the operations include correcting prosody of the subject audio data in the audio edit region by applying the phoneme durations and the pitch contour to the subject audio data.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification