DETECTION OF LIVE SPEECH

Invention Application

US20220157334A1 DETECTION OF LIVE SPEECH 有权

Please log in to see more content

Patent Title: DETECTION OF LIVE SPEECH
Application No.: US16953104

Application Date: 2020-11-19
Publication No.: US20220157334A1

Publication Date: 2022-05-19
Inventor: César ALONSO
Applicant: Cirrus Logic International Semiconductor Ltd.
Applicant Address: GB Edinburgh
Assignee: Cirrus Logic International Semiconductor Ltd.
Current Assignee: Cirrus Logic International Semiconductor Ltd.
Current Assignee Address: GB Edinburgh
Main IPC: G10L25/93
IPC: G10L25/93 ; G10L25/18 ; G10L25/21 ; G10L15/22 ; G10L25/78

Abstract:

A method of detecting live speech comprises: receiving a signal containing speech; forming a framed version of the received signal that comprises a plurality of frames; forming a first subset of the plurality of frames, wherein each frame of the first subset contains a signal that contains voiced speech; forming a second subset of the plurality of frames, wherein each frame of the second subset contains a signal that contains unvoiced speech; forming a first frame that is representative of a sum of a plurality of frames of the first subset; forming a second frame that is representative of a sum of a plurality of frames of the second subset; performing a time-frequency transformation operation on the first frame, to form an average voiced frequency spectrum; performing a time-frequency transformation operation on the second frame, to form an average unvoiced frequency spectrum; obtaining one or more voiced features from the voiced frequency spectrum; and obtaining one or more unvoiced features from the unvoiced frequency spectrum. Based on the one or more voiced features and the one or more unvoiced features, a determination is made whether the speech is live speech, or not.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/93	.判别语音信号之间的浊音和清音部分（G10L25/90优先）