Computer apparatus and method implementing sound detection with an image capture system

    公开(公告)号:US11468904B2

    公开(公告)日:2022-10-11

    申请号:US16718847

    申请日:2019-12-18

    摘要: A computing device comprising a processor, the processor configured to: receive, from an image capture system, an image captured in an environment and image metadata associated with the image, the image metadata comprising an image capture time; receive a sound recognition message from a sound recognition module, the sound recognition message comprising (i) a sound recognition identifier indicating a target sound or scene that has been recognised based on captured audio data captured in the environment, and (ii) time information associated with the sound recognition identifier; detect that the target sound or scene occurred at a time that the image was captured based on the image metadata and the time information in the sound recognition message; and output a camera control command to said image capture system based on said detection.

    Method of training a sound event recognition system

    公开(公告)号:US10783434B1

    公开(公告)日:2020-09-22

    申请号:US16594605

    申请日:2019-10-07

    摘要: A method of training a non-verbal sound class detection machine learning system, the non-verbal sound class detection machine learning system comprising a machine learning model configured to: receive data for each frame of a sequence of frames of audio data obtained from an audio signal; for each frame of the sequence of frames: process the data for multiple frames; and output data for at least one sound class score representative of a degree of affiliation of the frame with at least one sound class of a plurality of sound classes, wherein the plurality of sound classes comprises: one or more target sound classes; and a non-target sound class representative of an absence of each of the one or more target sound classes; wherein the method comprises: training the machine learning model using a loss function.

    Sound identification systems
    4.
    发明授权
    Sound identification systems 有权
    声音识别系统

    公开(公告)号:US09286911B2

    公开(公告)日:2016-03-15

    申请号:US14533837

    申请日:2014-11-05

    摘要: A digital sound identification system for storing a Markov model is disclosed. A processor is coupled to a sound data input, working memory, and a stored program memory for executing processor control code to input sound data for a sound to be identified. The sample sound data defines a sample frequency domain data energy in a range of frequency. Mean and variance values for a Markov model of the sample sound are generated. The Markov model is stored in the non-volatile memory. Interference sound data defining interference frequency domain data is inputted. The mean and variance values of the Markov model using the interference frequency domain data are adjusted. Sound data defining other sound frequency domain data are inputted. A probability of the other sound frequency domain data fitting the Markov model is determined. Finally, sound identification data dependent on the probability is outputted.

    摘要翻译: 公开了一种用于存储马尔可夫模型的数字声音识别系统。 处理器耦合到声音数据输入,工作存储器和存储的程序存储器,用于执行处理器控制代码以输入要识别的声音的声音数据。 样本声音数据定义频率范围内的采样频域数据能量。 产生样本声音的马尔可夫模型的均值和方差值。 马尔可夫模型存储在非易失性存储器中。 输入定义干扰频域数据的干扰声音数据。 调整使用干扰频域数据的马尔科夫模型的均值和方差值。 输入定义其他声频域数据的声音数据。 确定其他声频域数据拟合马尔可夫模型的概率。 最后,输出取决于概率的声音识别数据。

    Sound identification systems
    6.
    发明授权
    Sound identification systems 有权
    声音识别系统

    公开(公告)号:US08918343B2

    公开(公告)日:2014-12-23

    申请号:US13128588

    申请日:2009-11-26

    摘要: A digital sound identification system for storing a Markov model is disclosed. A processor is coupled to a sound data input, working memory, and a stored program memory for executing processor control code to input sound data for a sound to be identified. The sample sound data defines a sample frequency domain data energy in a range of frequency. Mean and variance values for a Markov model of the sample sound are generated. The Markov model is stored in the non-volatile memory. Interference sound data defining interference frequency domain data is inputted. The mean and variance values of the Markov model using the interference frequency domain data are adjusted. Sound data defining other sound frequency domain data are inputted. A probability of the other sound frequency domain data fitting the Markov model is determined. Finally, sound identification data dependent on the probability is outputted.

    摘要翻译: 公开了一种用于存储马尔可夫模型的数字声音识别系统。 处理器耦合到声音数据输入,工作存储器和存储的程序存储器,用于执行处理器控制代码以输入要识别的声音的声音数据。 样本声音数据定义频率范围内的采样频域数据能量。 产生样本声音的马尔可夫模型的均值和方差值。 马尔可夫模型存储在非易失性存储器中。 输入定义干扰频域数据的干扰声音数据。 调整使用干扰频域数据的马尔科夫模型的均值和方差值。 输入定义其他声频域数据的声音数据。 确定其他声频域数据拟合马尔可夫模型的概率。 最后,输出取决于概率的声音识别数据。

    Sound detection
    7.
    发明授权

    公开(公告)号:US11250877B2

    公开(公告)日:2022-02-15

    申请号:US16521949

    申请日:2019-07-25

    摘要: A method for generating a health indicator for at least one person of a group of people, the method comprising: receiving, at a processor, captured sound, where the captured sound is sound captured from the group of people; comparing the captured sound to a plurality of sound models to detect at least one non-speech sound event in the captured sound, each of the plurality of sound models associated with a respective health-related sound type; determining metadata associated with the at least one non-speech sound event; assigning the at least one non-speech sound event and the metadata to at least one person of the group of people; and outputting a message identifying the at least one non-speech event and the metadata to a health indicator generator module to generate a health indicator for the at least one person to whom the at least one non-speech sound event is assigned.