Method for detecting and classifying coughs or other non-semantic sounds using audio feature set learned from speech

Invention Grant

US11862188B2 Method for detecting and classifying coughs or other non-semantic sounds using audio feature set learned from speech 有权

Please log in to see more content

Patent Title: Method for detecting and classifying coughs or other non-semantic sounds using audio feature set learned from speech
Application No.: US17507461

Application Date: 2021-10-21
Publication No.: US11862188B2

Publication Date: 2024-01-02
Inventor: Jacob Garrison , Jacob Scott Peplinski , Joel Shor
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: McDonnell Boehnen Hulbert & Berghoff LLP
Main IPC: G10L25/66
IPC: G10L25/66 ; G10L15/02 ; G10L15/06 ; G10L15/04 ; A61B5/00 ; G16H40/67 ; A61B5/08 ; G10L25/78 ; G10L25/51 ; G10L25/30

Method for detecting and classifying coughs or other non-semantic sounds using audio feature set learned from speech

Abstract:

A method of detecting a cough in an audio stream includes a step of performing one or more pre-processing steps on the audio stream to generate an input audio sequence comprising a plurality of time-separated audio segments. An embedding is generated by a self-supervised triplet loss embedding model for each of the segments of the input audio sequence using an audio feature set, the embedding model having been trained to learn the audio feature set in a self-supervised triplet loss manner from a plurality of speech audio clips from a speech dataset. The embedding for each of the segments is provided to a model performing cough detection inference. This model generates a probability that each of the segments of the input audio sequence includes a cough episode. The method includes generating cough metrics for each of the cough episodes detected in the input audio sequence.

Public/Granted literature

US20220130415A1 Method for Detecting and Classifying Coughs or Other Non-Semantic Sounds Using Audio Feature Set Learned from Speech Public/Granted day:2022-04-28

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/48	.专门适用于特定用途
G10L25/51	..比较或判别
G10L25/66	...提取与健康状况相关的参数（用于诊断目的的检测或测量的入A61B5/00）