-
公开(公告)号:US11862188B2
公开(公告)日:2024-01-02
申请号:US17507461
申请日:2021-10-21
Applicant: Google LLC
Inventor: Jacob Garrison , Jacob Scott Peplinski , Joel Shor
IPC: G10L25/66 , G10L15/02 , G10L15/06 , G10L15/04 , A61B5/00 , G16H40/67 , A61B5/08 , G10L25/78 , G10L25/51 , G10L25/30
CPC classification number: G10L25/66 , A61B5/0823 , A61B5/4803 , A61B5/7267 , A61B5/7282 , G10L15/02 , G10L15/04 , G10L15/063 , G10L25/30 , G10L25/51 , G10L25/78 , G16H40/67
Abstract: A method of detecting a cough in an audio stream includes a step of performing one or more pre-processing steps on the audio stream to generate an input audio sequence comprising a plurality of time-separated audio segments. An embedding is generated by a self-supervised triplet loss embedding model for each of the segments of the input audio sequence using an audio feature set, the embedding model having been trained to learn the audio feature set in a self-supervised triplet loss manner from a plurality of speech audio clips from a speech dataset. The embedding for each of the segments is provided to a model performing cough detection inference. This model generates a probability that each of the segments of the input audio sequence includes a cough episode. The method includes generating cough metrics for each of the cough episodes detected in the input audio sequence.
-
2.
公开(公告)号:US20240161769A1
公开(公告)日:2024-05-16
申请号:US18509722
申请日:2023-11-15
Applicant: Google LLC
Inventor: Jacob Garrison , Jacob Scott Peplinski , Joel Shor
IPC: G10L25/66 , A61B5/00 , A61B5/08 , G10L15/02 , G10L15/04 , G10L15/06 , G10L25/30 , G10L25/51 , G10L25/78 , G16H40/67
CPC classification number: G10L25/66 , A61B5/0823 , A61B5/4803 , A61B5/7267 , A61B5/7282 , G10L15/02 , G10L15/04 , G10L15/063 , G10L25/30 , G10L25/51 , G10L25/78 , G16H40/67
Abstract: A method of detecting a cough in an audio stream includes a step of performing one or more pre-processing steps on the audio stream to generate an input audio sequence comprising a plurality of time-separated audio segments. An embedding is generated by a self-supervised triplet loss embedding model for each of the segments of the input audio sequence using an audio feature set, the embedding model having been trained to learn the audio feature set in a self-supervised triplet loss manner from a plurality of speech audio clips from a speech dataset. The embedding for each of the segments is provided to a model performing cough detection inference. This model generates a probability that each of the segments of the input audio sequence includes a cough episode. The method includes generating cough metrics for each of the cough episodes detected in the input audio sequence.
-
公开(公告)号:US12249346B2
公开(公告)日:2025-03-11
申请号:US18509722
申请日:2023-11-15
Applicant: Google LLC
Inventor: Jacob Garrison , Jacob Scott Peplinski , Joel Shor
IPC: G10L25/66 , A61B5/00 , A61B5/08 , G10L15/02 , G10L15/04 , G10L15/06 , G10L25/30 , G10L25/51 , G10L25/78 , G16H40/67
Abstract: A method of detecting a cough in an audio stream includes a step of performing one or more pre-processing steps on the audio stream to generate an input audio sequence comprising a plurality of time-separated audio segments. An embedding is generated by a self-supervised triplet loss embedding model for each of the segments of the input audio sequence using an audio feature set, the embedding model having been trained to learn the audio feature set in a self-supervised triplet loss manner from a plurality of speech audio clips from a speech dataset. The embedding for each of the segments is provided to a model performing cough detection inference. This model generates a probability that each of the segments of the input audio sequence includes a cough episode. The method includes generating cough metrics for each of the cough episodes detected in the input audio sequence.
-
公开(公告)号:US20220130415A1
公开(公告)日:2022-04-28
申请号:US17507461
申请日:2021-10-21
Applicant: Google LLC
Inventor: Jacob Garrison , Jacob Scott Peplinski , Joel Shor
Abstract: A method of detecting a cough in an audio stream includes a step of performing one or more pre-processing steps on the audio stream to generate an input audio sequence comprising a plurality of time-separated audio segments. An embedding is generated by a self-supervised triplet loss embedding model for each of the segments of the input audio sequence using an audio feature set, the embedding model having been trained to learn the audio feature set in a self-supervised triplet loss manner from a plurality of speech audio clips from a speech dataset. The embedding for each of the segments is provided to a model performing cough detection inference. This model generates a probability that each of the segments of the input audio sequence includes a cough episode. The method includes generating cough metrics for each of the cough episodes detected in the input audio sequence.
-
-
-