Computer apparatus and method implementing sound detection with an image capture system

    公开(公告)号:US11468904B2

    公开(公告)日:2022-10-11

    申请号:US16718847

    申请日:2019-12-18

    摘要: A computing device comprising a processor, the processor configured to: receive, from an image capture system, an image captured in an environment and image metadata associated with the image, the image metadata comprising an image capture time; receive a sound recognition message from a sound recognition module, the sound recognition message comprising (i) a sound recognition identifier indicating a target sound or scene that has been recognised based on captured audio data captured in the environment, and (ii) time information associated with the sound recognition identifier; detect that the target sound or scene occurred at a time that the image was captured based on the image metadata and the time information in the sound recognition message; and output a camera control command to said image capture system based on said detection.

    IMPROVEMENTS IN SOUND REPRODUCTION
    2.
    发明申请

    公开(公告)号:US20200329330A1

    公开(公告)日:2020-10-15

    申请号:US16635788

    申请日:2018-07-31

    IPC分类号: H04S7/00 H04R5/02 H04R3/00

    摘要: A method, and system, of digital room correction for a device, such as a smart speaker, including a loudspeaker. The method comprises capturing audio from an environment local to the device, for example from one or more microphones of a smart speaker. The captured audio is then processed to recognize one or more categories of sound. A digital room correction procedure may then be controlled dependent upon recognition and/or analysis of at least one of the categories of sound.

    Method of training a sound event recognition system

    公开(公告)号:US10783434B1

    公开(公告)日:2020-09-22

    申请号:US16594605

    申请日:2019-10-07

    摘要: A method of training a non-verbal sound class detection machine learning system, the non-verbal sound class detection machine learning system comprising a machine learning model configured to: receive data for each frame of a sequence of frames of audio data obtained from an audio signal; for each frame of the sequence of frames: process the data for multiple frames; and output data for at least one sound class score representative of a degree of affiliation of the frame with at least one sound class of a plurality of sound classes, wherein the plurality of sound classes comprises: one or more target sound classes; and a non-target sound class representative of an absence of each of the one or more target sound classes; wherein the method comprises: training the machine learning model using a loss function.

    Sound identification systems
    4.
    发明授权
    Sound identification systems 有权
    声音识别系统

    公开(公告)号:US09286911B2

    公开(公告)日:2016-03-15

    申请号:US14533837

    申请日:2014-11-05

    摘要: A digital sound identification system for storing a Markov model is disclosed. A processor is coupled to a sound data input, working memory, and a stored program memory for executing processor control code to input sound data for a sound to be identified. The sample sound data defines a sample frequency domain data energy in a range of frequency. Mean and variance values for a Markov model of the sample sound are generated. The Markov model is stored in the non-volatile memory. Interference sound data defining interference frequency domain data is inputted. The mean and variance values of the Markov model using the interference frequency domain data are adjusted. Sound data defining other sound frequency domain data are inputted. A probability of the other sound frequency domain data fitting the Markov model is determined. Finally, sound identification data dependent on the probability is outputted.

    摘要翻译: 公开了一种用于存储马尔可夫模型的数字声音识别系统。 处理器耦合到声音数据输入,工作存储器和存储的程序存储器,用于执行处理器控制代码以输入要识别的声音的声音数据。 样本声音数据定义频率范围内的采样频域数据能量。 产生样本声音的马尔可夫模型的均值和方差值。 马尔可夫模型存储在非易失性存储器中。 输入定义干扰频域数据的干扰声音数据。 调整使用干扰频域数据的马尔科夫模型的均值和方差值。 输入定义其他声频域数据的声音数据。 确定其他声频域数据拟合马尔可夫模型的概率。 最后,输出取决于概率的声音识别数据。

    CONTROLLING A USER INTERFACE
    6.
    发明申请

    公开(公告)号:US20210090573A1

    公开(公告)日:2021-03-25

    申请号:US16580959

    申请日:2019-09-24

    摘要: A computing device for controlling a user interface of the computing device, the computing device comprising a processor configured to: detect at least one target sound in a monitored environment; determine an operating mode of the computing device that is associated with the at least one target sound; output content, via the user interface of the computing device, that is associated with the operating mode, wherein the content prompts a user of the computing device to perform an action using an input device of the computing device to instruct the computing device to control a controllable device in the monitored environment in response to the recognition of the at least one target sound.