Invention Grant
US09378755B2 Detecting a user's voice activity using dynamic probabilistic models of speech features
有权
使用语音特征的动态概率模型来检测用户的语音活动
- Patent Title: Detecting a user's voice activity using dynamic probabilistic models of speech features
- Patent Title (中): 使用语音特征的动态概率模型来检测用户的语音活动
-
Application No.: US14502795Application Date: 2014-09-30
-
Publication No.: US09378755B2Publication Date: 2016-06-28
- Inventor: Harvey D. Thornburg , Charles P. Clark
- Applicant: Apple Inc.
- Applicant Address: US CA Cupertino
- Assignee: Apple Inc.
- Current Assignee: Apple Inc.
- Current Assignee Address: US CA Cupertino
- Agency: Blakely, Sokoloff, Taylor & Zafman LLP
- Main IPC: G10L15/20
- IPC: G10L15/20 ; G10L19/04 ; G10L25/84

Abstract:
Method of detecting voice activity starts with by generating probabilistic models that respectively model features of speech dynamically over time. Probabilistic models may model each feature dependent on a past feature and a current state. Features of speech may include a nonstationary signal presence feature, a periodicity feature, and a sparsity feature. Noise suppressor may then perform noise suppression on an acoustic signal to generate a nonstationary signal presence signal and a noise suppressed acoustic signal. An LPC module may then perform residual analysis on the noise suppressed data signal to generate a periodicity signal and a sparsity signal. Inference generator receives the probabilistic models and receives, in real-time, nonstationary signal presence signal, periodicity signal, and sparsity signal. Inference generator may then generate in real time an estimate of voice activity based on the probabilistic models, nonstationary signal presence signal, periodicity signal, and sparsity signal. Other embodiments are also described.
Public/Granted literature
- US20150348572A1 DETECTING A USER'S VOICE ACTIVITY USING DYNAMIC PROBABILISTIC MODELS OF SPEECH FEATURES Public/Granted day:2015-12-03
Information query