-
公开(公告)号:US11468904B2
公开(公告)日:2022-10-11
申请号:US16718847
申请日:2019-12-18
申请人: Audio Analytic Ltd
IPC分类号: G10L15/02 , G10L17/26 , G06F16/683 , H04N5/232
摘要: A computing device comprising a processor, the processor configured to: receive, from an image capture system, an image captured in an environment and image metadata associated with the image, the image metadata comprising an image capture time; receive a sound recognition message from a sound recognition module, the sound recognition message comprising (i) a sound recognition identifier indicating a target sound or scene that has been recognised based on captured audio data captured in the environment, and (ii) time information associated with the sound recognition identifier; detect that the target sound or scene occurred at a time that the image was captured based on the image metadata and the time information in the sound recognition message; and output a camera control command to said image capture system based on said detection.
-
公开(公告)号:US20210097727A1
公开(公告)日:2021-04-01
申请号:US16586050
申请日:2019-09-27
申请人: Audio Analytic Ltd
发明人: Chris James Mitchell , Sacha Krstulovic , Cagdas Bilen , Neil Cooper , Julian Harris , Arnoldas Jasonas , Joe Patrick Lynas
摘要: Sound detection and identification leads to responsiveness within an augmented reality environment. Information about an identified sound can be converted into a command for implementation by an augmented reality system, for display of a desired on-screen augmented reality effect.
-
公开(公告)号:US10783434B1
公开(公告)日:2020-09-22
申请号:US16594605
申请日:2019-10-07
申请人: Audio Analytic Ltd
发明人: Christopher James Mitchell , Sacha Krstulovic , Cagdas Bilen , Juan Azcarreta Ortiz , Giacomo Ferroni , Arnoldas Jasonas , Francesco Tuveri
摘要: A method of training a non-verbal sound class detection machine learning system, the non-verbal sound class detection machine learning system comprising a machine learning model configured to: receive data for each frame of a sequence of frames of audio data obtained from an audio signal; for each frame of the sequence of frames: process the data for multiple frames; and output data for at least one sound class score representative of a degree of affiliation of the frame with at least one sound class of a plurality of sound classes, wherein the plurality of sound classes comprises: one or more target sound classes; and a non-target sound class representative of an absence of each of the one or more target sound classes; wherein the method comprises: training the machine learning model using a loss function.
-
公开(公告)号:US09286911B2
公开(公告)日:2016-03-15
申请号:US14533837
申请日:2014-11-05
申请人: AUDIO ANALYTIC LTD.
IPC分类号: G06F15/18 , G10L25/51 , G10L15/02 , G06F17/18 , G10L15/14 , G10L25/63 , G10L25/57 , G08B13/00 , G10L15/20
CPC分类号: G10L25/51 , G06F17/18 , G08B13/00 , G08B13/16 , G10L15/02 , G10L15/142 , G10L15/20 , G10L25/57 , G10L25/63
摘要: A digital sound identification system for storing a Markov model is disclosed. A processor is coupled to a sound data input, working memory, and a stored program memory for executing processor control code to input sound data for a sound to be identified. The sample sound data defines a sample frequency domain data energy in a range of frequency. Mean and variance values for a Markov model of the sample sound are generated. The Markov model is stored in the non-volatile memory. Interference sound data defining interference frequency domain data is inputted. The mean and variance values of the Markov model using the interference frequency domain data are adjusted. Sound data defining other sound frequency domain data are inputted. A probability of the other sound frequency domain data fitting the Markov model is determined. Finally, sound identification data dependent on the probability is outputted.
摘要翻译: 公开了一种用于存储马尔可夫模型的数字声音识别系统。 处理器耦合到声音数据输入,工作存储器和存储的程序存储器,用于执行处理器控制代码以输入要识别的声音的声音数据。 样本声音数据定义频率范围内的采样频域数据能量。 产生样本声音的马尔可夫模型的均值和方差值。 马尔可夫模型存储在非易失性存储器中。 输入定义干扰频域数据的干扰声音数据。 调整使用干扰频域数据的马尔科夫模型的均值和方差值。 输入定义其他声频域数据的声音数据。 确定其他声频域数据拟合马尔可夫模型的概率。 最后,输出取决于概率的声音识别数据。
-
公开(公告)号:US10224019B2
公开(公告)日:2019-03-05
申请号:US15893015
申请日:2018-02-09
申请人: Audio Analytic Ltd.
摘要: Broadly speaking, embodiments of the present invention provide a wearable audio device including one or a plurality of microphones, a sound recognition systems and a controller to control the device based on one or more recognized sounds or classes of sound. Embodiments use stored sound models.
-
公开(公告)号:US08918343B2
公开(公告)日:2014-12-23
申请号:US13128588
申请日:2009-11-26
IPC分类号: G06F15/18 , G10L17/26 , G06N99/00 , G10L15/02 , G10L21/0216
CPC分类号: G10L17/26 , G06N99/005 , G10L15/02 , G10L21/0216 , G10L25/48
摘要: A digital sound identification system for storing a Markov model is disclosed. A processor is coupled to a sound data input, working memory, and a stored program memory for executing processor control code to input sound data for a sound to be identified. The sample sound data defines a sample frequency domain data energy in a range of frequency. Mean and variance values for a Markov model of the sample sound are generated. The Markov model is stored in the non-volatile memory. Interference sound data defining interference frequency domain data is inputted. The mean and variance values of the Markov model using the interference frequency domain data are adjusted. Sound data defining other sound frequency domain data are inputted. A probability of the other sound frequency domain data fitting the Markov model is determined. Finally, sound identification data dependent on the probability is outputted.
摘要翻译: 公开了一种用于存储马尔可夫模型的数字声音识别系统。 处理器耦合到声音数据输入,工作存储器和存储的程序存储器,用于执行处理器控制代码以输入要识别的声音的声音数据。 样本声音数据定义频率范围内的采样频域数据能量。 产生样本声音的马尔可夫模型的均值和方差值。 马尔可夫模型存储在非易失性存储器中。 输入定义干扰频域数据的干扰声音数据。 调整使用干扰频域数据的马尔科夫模型的均值和方差值。 输入定义其他声频域数据的声音数据。 确定其他声频域数据拟合马尔可夫模型的概率。 最后,输出取决于概率的声音识别数据。
-
公开(公告)号:US11250877B2
公开(公告)日:2022-02-15
申请号:US16521949
申请日:2019-07-25
申请人: Audio Analytic Ltd
IPC分类号: G10L25/66 , G06F16/483 , A61B5/00 , G10L25/93
摘要: A method for generating a health indicator for at least one person of a group of people, the method comprising: receiving, at a processor, captured sound, where the captured sound is sound captured from the group of people; comparing the captured sound to a plurality of sound models to detect at least one non-speech sound event in the captured sound, each of the plurality of sound models associated with a respective health-related sound type; determining metadata associated with the at least one non-speech sound event; assigning the at least one non-speech sound event and the metadata to at least one person of the group of people; and outputting a message identifying the at least one non-speech event and the metadata to a health indicator generator module to generate a health indicator for the at least one person to whom the at least one non-speech sound event is assigned.
-
公开(公告)号:US10978093B1
公开(公告)日:2021-04-13
申请号:US16718761
申请日:2019-12-18
申请人: Audio Analytic Ltd
摘要: A computing device, the computing device comprising a processor configured to: receive audio information relating to one or more non-verbal sounds captured by a microphone in an environment of a user; receive motion information that is based on motion sensor data captured by a motion sensor, said motion information relating to motion of said user in the environment; process the audio information and the motion information to recognise an activity of said user; and output an activity recognition notification indicating said activity.
-
公开(公告)号:US20210104255A1
公开(公告)日:2021-04-08
申请号:US16594274
申请日:2019-10-07
申请人: Audio Analytic Ltd
发明人: Christopher James Mitchell , Sacha Krstulovic , Cagdas Bilen , Neil Cooper , Julian Harris , Arnoldas Jasonas , Joe Patrick Lynas
摘要: A device or system is provided which is configured to detect one or more sound events and/or scenes associa ted with a predetermined context, and to provide an assistive output on fulfilment of that context.
-
公开(公告)号:US20210104230A1
公开(公告)日:2021-04-08
申请号:US16594624
申请日:2019-10-07
申请人: Audio Analytic Ltd.
发明人: Christopher James Mitchell , Sacha Krstulovic , Cagdas Bilen , Juan Azcarreta Ortiz , Giacomo Ferroni , Amoldas Jasonas , Francesco Tuveri
摘要: A method for recognising at least one of a non-verbal sound event and a scene in an audio signal comprising a sequence of frames of audio data, the method comprising: for each frame of the sequence: processing the frame of audio data to extract multiple acoustic features for the frame of audio data; and classifying the acoustic features to classify the frame by determining, for each of a set of sound classes, a score that the frame represents the sound class; processing the sound class scores for multiple frames of the sequence of frames to generate, for each frame, a sound class decision for each frame; and processing the sound class decisions for the sequence of frames to recognise the at least one of a non-verbal sound event and a scene.
-
-
-
-
-
-
-
-
-