-
公开(公告)号:US10224019B2
公开(公告)日:2019-03-05
申请号:US15893015
申请日:2018-02-09
申请人: Audio Analytic Ltd.
摘要: Broadly speaking, embodiments of the present invention provide a wearable audio device including one or a plurality of microphones, a sound recognition systems and a controller to control the device based on one or more recognized sounds or classes of sound. Embodiments use stored sound models.
-
公开(公告)号:US20180233125A1
公开(公告)日:2018-08-16
申请号:US15893015
申请日:2018-02-09
申请人: Audio Analytic Ltd
CPC分类号: G10K11/17827 , G10K11/17823 , G10K11/17837 , G10K11/34 , G10K2210/1081 , G10L25/21 , G10L25/51 , G10L25/78 , H04R1/1041 , H04R1/1083 , H04R1/406 , H04R3/005 , H04R2420/07 , H04S7/304 , H04S2400/11 , H04S2420/01
摘要: Broadly speaking, embodiments of the present invention provide a wearable audio device including one or a plurality of microphones, a sound recognition systems and a controller to control the device based on one or more recognised sounds or classes of sound. Embodiments use stored sound models.
-
公开(公告)号:US11587556B2
公开(公告)日:2023-02-21
申请号:US16594624
申请日:2019-10-07
申请人: Audio Analytic Ltd
发明人: Christopher James Mitchell , Sacha Krstulovic , Cagdas Bilen , Juan Azcarreta Ortiz , Giacomo Ferroni , Arnoldas Jasonas , Francesco Tuveri
摘要: A method for recognising at least one of a non-verbal sound event and a scene in an audio signal comprising a sequence of frames of audio data, the method comprising: for each frame of the sequence: processing the frame of audio data to extract multiple acoustic features for the frame of audio data; and classifying the acoustic features to classify the frame by determining, for each of a set of sound classes, a score that the frame represents the sound class; processing the sound class scores for multiple frames of the sequence of frames to generate, for each frame, a sound class decision for each frame; and processing the sound class decisions for the sequence of frames to recognise the at least one of a non-verbal sound event and a scene.
-
公开(公告)号:US20210090573A1
公开(公告)日:2021-03-25
申请号:US16580959
申请日:2019-09-24
申请人: Audio Analytic Ltd
IPC分类号: G10L15/30 , G10L15/22 , G06F3/0484 , G10L25/51
摘要: A computing device for controlling a user interface of the computing device, the computing device comprising a processor configured to: detect at least one target sound in a monitored environment; determine an operating mode of the computing device that is associated with the at least one target sound; output content, via the user interface of the computing device, that is associated with the operating mode, wherein the content prompts a user of the computing device to perform an action using an input device of the computing device to instruct the computing device to control a controllable device in the monitored environment in response to the recognition of the at least one target sound.
-
公开(公告)号:US11468904B2
公开(公告)日:2022-10-11
申请号:US16718847
申请日:2019-12-18
申请人: Audio Analytic Ltd
IPC分类号: G10L15/02 , G10L17/26 , G06F16/683 , H04N5/232
摘要: A computing device comprising a processor, the processor configured to: receive, from an image capture system, an image captured in an environment and image metadata associated with the image, the image metadata comprising an image capture time; receive a sound recognition message from a sound recognition module, the sound recognition message comprising (i) a sound recognition identifier indicating a target sound or scene that has been recognised based on captured audio data captured in the environment, and (ii) time information associated with the sound recognition identifier; detect that the target sound or scene occurred at a time that the image was captured based on the image metadata and the time information in the sound recognition message; and output a camera control command to said image capture system based on said detection.
-
公开(公告)号:US20210097727A1
公开(公告)日:2021-04-01
申请号:US16586050
申请日:2019-09-27
申请人: Audio Analytic Ltd
发明人: Chris James Mitchell , Sacha Krstulovic , Cagdas Bilen , Neil Cooper , Julian Harris , Arnoldas Jasonas , Joe Patrick Lynas
摘要: Sound detection and identification leads to responsiveness within an augmented reality environment. Information about an identified sound can be converted into a command for implementation by an augmented reality system, for display of a desired on-screen augmented reality effect.
-
公开(公告)号:US20200329330A1
公开(公告)日:2020-10-15
申请号:US16635788
申请日:2018-07-31
申请人: Audio Analytic LTD
摘要: A method, and system, of digital room correction for a device, such as a smart speaker, including a loudspeaker. The method comprises capturing audio from an environment local to the device, for example from one or more microphones of a smart speaker. The captured audio is then processed to recognize one or more categories of sound. A digital room correction procedure may then be controlled dependent upon recognition and/or analysis of at least one of the categories of sound.
-
公开(公告)号:US10783434B1
公开(公告)日:2020-09-22
申请号:US16594605
申请日:2019-10-07
申请人: Audio Analytic Ltd
发明人: Christopher James Mitchell , Sacha Krstulovic , Cagdas Bilen , Juan Azcarreta Ortiz , Giacomo Ferroni , Arnoldas Jasonas , Francesco Tuveri
摘要: A method of training a non-verbal sound class detection machine learning system, the non-verbal sound class detection machine learning system comprising a machine learning model configured to: receive data for each frame of a sequence of frames of audio data obtained from an audio signal; for each frame of the sequence of frames: process the data for multiple frames; and output data for at least one sound class score representative of a degree of affiliation of the frame with at least one sound class of a plurality of sound classes, wherein the plurality of sound classes comprises: one or more target sound classes; and a non-target sound class representative of an absence of each of the one or more target sound classes; wherein the method comprises: training the machine learning model using a loss function.
-
公开(公告)号:US20200035261A1
公开(公告)日:2020-01-30
申请号:US16521949
申请日:2019-07-25
申请人: Audio Analytic Ltd
IPC分类号: G10L25/66 , G10L25/93 , A61B5/00 , G06F16/483
摘要: A method for generating a health indicator for at least one person of a group of people, the method comprising: receiving, at a processor, captured sound, where the captured sound is sound captured from the group of people; comparing the captured sound to a plurality of sound models to detect at least one non-speech sound event in the captured sound, each of the plurality of sound models associated with a respective health-related sound type; determining metadata associated with the at least one non-speech sound event; assigning the at least one non-speech sound event and the metadata to at least one person of the group of people; and outputting a message identifying the at least one non-speech event and the metadata to a health indicator generator module to generate a health indicator for the at least one person to whom the at least one non-speech sound event is assigned.
-
公开(公告)号:US09286911B2
公开(公告)日:2016-03-15
申请号:US14533837
申请日:2014-11-05
申请人: AUDIO ANALYTIC LTD.
IPC分类号: G06F15/18 , G10L25/51 , G10L15/02 , G06F17/18 , G10L15/14 , G10L25/63 , G10L25/57 , G08B13/00 , G10L15/20
CPC分类号: G10L25/51 , G06F17/18 , G08B13/00 , G08B13/16 , G10L15/02 , G10L15/142 , G10L15/20 , G10L25/57 , G10L25/63
摘要: A digital sound identification system for storing a Markov model is disclosed. A processor is coupled to a sound data input, working memory, and a stored program memory for executing processor control code to input sound data for a sound to be identified. The sample sound data defines a sample frequency domain data energy in a range of frequency. Mean and variance values for a Markov model of the sample sound are generated. The Markov model is stored in the non-volatile memory. Interference sound data defining interference frequency domain data is inputted. The mean and variance values of the Markov model using the interference frequency domain data are adjusted. Sound data defining other sound frequency domain data are inputted. A probability of the other sound frequency domain data fitting the Markov model is determined. Finally, sound identification data dependent on the probability is outputted.
摘要翻译: 公开了一种用于存储马尔可夫模型的数字声音识别系统。 处理器耦合到声音数据输入,工作存储器和存储的程序存储器,用于执行处理器控制代码以输入要识别的声音的声音数据。 样本声音数据定义频率范围内的采样频域数据能量。 产生样本声音的马尔可夫模型的均值和方差值。 马尔可夫模型存储在非易失性存储器中。 输入定义干扰频域数据的干扰声音数据。 调整使用干扰频域数据的马尔科夫模型的均值和方差值。 输入定义其他声频域数据的声音数据。 确定其他声频域数据拟合马尔可夫模型的概率。 最后,输出取决于概率的声音识别数据。
-
-
-
-
-
-
-
-
-