Invention Application
- Patent Title: DETECTION OF LIVE SPEECH
-
Application No.: US16953104Application Date: 2020-11-19
-
Publication No.: US20220157334A1Publication Date: 2022-05-19
- Inventor: César ALONSO
- Applicant: Cirrus Logic International Semiconductor Ltd.
- Applicant Address: GB Edinburgh
- Assignee: Cirrus Logic International Semiconductor Ltd.
- Current Assignee: Cirrus Logic International Semiconductor Ltd.
- Current Assignee Address: GB Edinburgh
- Main IPC: G10L25/93
- IPC: G10L25/93 ; G10L25/18 ; G10L25/21 ; G10L15/22 ; G10L25/78

Abstract:
A method of detecting live speech comprises: receiving a signal containing speech; forming a framed version of the received signal that comprises a plurality of frames; forming a first subset of the plurality of frames, wherein each frame of the first subset contains a signal that contains voiced speech; forming a second subset of the plurality of frames, wherein each frame of the second subset contains a signal that contains unvoiced speech; forming a first frame that is representative of a sum of a plurality of frames of the first subset; forming a second frame that is representative of a sum of a plurality of frames of the second subset; performing a time-frequency transformation operation on the first frame, to form an average voiced frequency spectrum; performing a time-frequency transformation operation on the second frame, to form an average unvoiced frequency spectrum; obtaining one or more voiced features from the voiced frequency spectrum; and obtaining one or more unvoiced features from the unvoiced frequency spectrum. Based on the one or more voiced features and the one or more unvoiced features, a determination is made whether the speech is live speech, or not.
Information query