-
公开(公告)号:US12086564B2
公开(公告)日:2024-09-10
申请号:US17539182
申请日:2021-11-30
Applicant: SoundHound, Inc.
Inventor: Dylan H. Ross
IPC: G10L15/18 , G06F40/56 , G06F40/58 , G10L15/06 , G10L19/125 , G10L19/26 , G10L21/013
CPC classification number: G06F40/56 , G06F40/58 , G10L15/06 , G10L15/18 , G10L19/125 , G10L19/265 , G10L21/013 , G10L2021/0135
Abstract: A system and method for masking an identity of a speaker of natural language speech, such as speech clips to be labeled by humans in a system generating voice transcriptions for training an automatic speech recognition model. The natural language speech is morphed prior to being presented to the human for labeling. In one embodiment, morphing comprises pitch shifting the speech randomly either up or down, then frequency shifting the speech, then pitch shifting the speech in a direction opposite the first pitch shift. Labeling the morphed speech comprises at least one or more of transcribing the morphed speech, identifying a gender of the speaker, identifying an accent of the speaker, and identifying a noise type of the morphed speech.
-
公开(公告)号:US20220092273A1
公开(公告)日:2022-03-24
申请号:US17539182
申请日:2021-11-30
Applicant: SoundHound, Inc.
Inventor: Dylan H. Ross
IPC: G06F40/56 , G10L19/26 , G10L19/125 , G10L15/18 , G06F40/58
Abstract: A system and method for masking an identity of a speaker of natural language speech, such as speech clips to be labeled by humans in a system generating voice transcriptions for training an automatic speech recognition model. The natural language speech is morphed prior to being presented to the human for labeling. In one embodiment, morphing comprises pitch shifting the speech randomly either up or down, then frequency shifting the speech, then pitch shifting the speech in a direction opposite the first pitch shift. Labeling the morphed speech comprises at least one or more of transcribing the morphed speech, identifying a gender of the speaker, identifying an accent of the speaker, and identifying a noise type of the morphed speech.
-
公开(公告)号:US20210089626A1
公开(公告)日:2021-03-25
申请号:US16578386
申请日:2019-09-22
Applicant: SoundHound, Inc.
Inventor: Dylan H. Ross
IPC: G06F17/28 , G10L15/18 , G10L19/125 , G10L19/26
Abstract: A system and method for masking an identity of a speaker of natural language speech, such as speech clips to be labeled by humans in a system generating voice transcriptions for training an automatic speech recognition model. The natural language speech is morphed prior to being presented to the human for labeling. In one embodiment, morphing comprises pitch shifting the speech randomly either up or down, then frequency shifting the speech, then pitch shifting the speech in a direction opposite the first pitch shift.
-
-