Invention Application
- Patent Title: SYSTEM AND METHOD FOR VOICE MORPHING IN A DATA ANNOTATOR TOOL
-
Application No.: US17539182Application Date: 2021-11-30
-
Publication No.: US20220092273A1Publication Date: 2022-03-24
- Inventor: Dylan H. Ross
- Applicant: SoundHound, Inc.
- Applicant Address: US CA Santa Clara
- Assignee: SoundHound, Inc.
- Current Assignee: SoundHound, Inc.
- Current Assignee Address: US CA Santa Clara
- Main IPC: G06F40/56
- IPC: G06F40/56 ; G10L19/26 ; G10L19/125 ; G10L15/18 ; G06F40/58

Abstract:
A system and method for masking an identity of a speaker of natural language speech, such as speech clips to be labeled by humans in a system generating voice transcriptions for training an automatic speech recognition model. The natural language speech is morphed prior to being presented to the human for labeling. In one embodiment, morphing comprises pitch shifting the speech randomly either up or down, then frequency shifting the speech, then pitch shifting the speech in a direction opposite the first pitch shift. Labeling the morphed speech comprises at least one or more of transcribing the morphed speech, identifying a gender of the speaker, identifying an accent of the speaker, and identifying a noise type of the morphed speech.
Public/Granted literature
- US12086564B2 System and method for voice morphing in a data annotator tool Public/Granted day:2024-09-10
Information query