VOICE MODIFICATION DETECTION USING PHYSICAL MODELS OF SPEECH PRODUCTION

Invention Application

US20230015189A1 VOICE MODIFICATION DETECTION USING PHYSICAL MODELS OF SPEECH PRODUCTION 有权

Please log in to see more content

Patent Title: VOICE MODIFICATION DETECTION USING PHYSICAL MODELS OF SPEECH PRODUCTION
Application No.: US17953156

Application Date: 2022-09-26
Publication No.: US20230015189A1

Publication Date: 2023-01-19
Inventor: David Looney , Nikolay D. Gaubitch
Applicant: Pindrop Security, Inc.
Applicant Address: US GA Atlanta
Assignee: Pindrop Security, Inc.
Current Assignee: Pindrop Security, Inc.
Current Assignee Address: US GA Atlanta
Main IPC: G10L25/51
IPC: G10L25/51 ; G10L25/90 ; G10L15/06 ; G10L15/22

VOICE MODIFICATION DETECTION USING PHYSICAL MODELS OF SPEECH PRODUCTION

Abstract:

A computer may train a single-class machine learning using normal speech recordings. The machine learning model or any other model may estimate the normal range of parameters of a physical speech production model based on the normal speech recordings. For example, the computer may use a source-filter model of speech production, where voiced speech is represented by a pulse train and unvoiced speech by a random noise and a combination of the pulse train and the random noise is passed through an auto-regressive filter that emulates the human vocal tract. The computer leverages the fact that intentional modification of human voice introduces errors to source-filter model or any other physical model of speech production. The computer may identify anomalies in the physical model to generate a voice modification score for an audio signal. The voice modification score may indicate a degree of abnormality of human voice in the audio signal.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/48	.专门适用于特定用途
G10L25/51	..比较或判别