Invention Grant
- Patent Title: Self-supervised speech representations for fake audio detection
-
Application No.: US17110278Application Date: 2020-12-02
-
Publication No.: US11756572B2Publication Date: 2023-09-12
- Inventor: Joel Shor , Alanna Foster Slocum
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Honigman LLP
- Agent Brett A. Krueger; Grant J. Griffith
- Main IPC: G10L25/69
- IPC: G10L25/69 ; G10L15/02 ; G10L15/06 ; G10L15/22

Abstract:
A method for determining synthetic speech includes receiving audio data characterizing speech in audio data obtained by a user device. The method also includes generating, using a trained self-supervised model, a plurality of audio features vectors each representative of audio features of a portion of the audio data. The method also includes generating, using a shallow discriminator model, a score indicating a presence of synthetic speech in the audio data based on the corresponding audio features of each audio feature vector of the plurality of audio feature vectors. The method also includes determining whether the score satisfies a synthetic speech detection threshold. When the score satisfies the synthetic speech detection threshold, the method includes determining that the speech in the audio data obtained by the user device comprises synthetic speech.
Public/Granted literature
- US20220172739A1 Self-Supervised Speech Representations for Fake Audio Detection Public/Granted day:2022-06-02
Information query