-
公开(公告)号:US20230332796A1
公开(公告)日:2023-10-19
申请号:US18340470
申请日:2023-06-23
CPC分类号: F24F11/63 , G06F3/167 , G10L15/22 , G10L25/51 , G10L25/78 , H04R1/406 , H04R3/005 , F24F2120/10
摘要: An occupancy tracking device configured to receive a plurality of sound samples over a predetermined time period. The device is further configured to compute an audio signature for each sound sample. The device is further configured to populate entries in the voice data log for the sound samples, to identify one or more clusters based on an audio signature that is associated with the populated entries, and to determine a number of clusters that are identified. The device is further configured to determine a predicted occupancy level based on the number of clusters that are identified and to control a Heating, Ventilation, and Air Conditioning (HVAC) system based on the predicted occupancy level.
-
公开(公告)号:US11790936B1
公开(公告)日:2023-10-17
申请号:US17808406
申请日:2022-06-23
申请人: RPS Group, Inc.
发明人: David John O'Hara , Patrick Rath
摘要: Methods and systems for detecting marine mammals. Acoustic data can be received from one or more hydrophones. The acoustic data can be sampled, and the sampled acoustic data can be transformed to time-frequency image data. The image data can be processed to transform the data for input to a model. The model can be trained to detect the presence or absence of marine mammal vocalizations in the acoustic data. The model can output a prediction of whether or not a mammal is present.
-
公开(公告)号:US20230315176A1
公开(公告)日:2023-10-05
申请号:US17921310
申请日:2021-04-12
发明人: CHUNJIAN LI , Dong Shi
摘要: A speech wakeup method and device, and a readable storage medium are provided. The voice wakeup method includes: detecting voice signals that are input into at least two microphones and that meet a first condition; and determining, based on whether voice energy of the voice signals input into the at least two microphones meets a second condition, whether to wake up the electronic device; and if the second condition is met, waking up the electronic device; or if the second condition is not met, continuing to detect a voice signal input into the microphone. The electronic device can be woken up in a wakeup-keyword-free manner.
-
84.
公开(公告)号:US20230306986A1
公开(公告)日:2023-09-28
申请号:US18135500
申请日:2023-04-17
发明人: Lahar GUPTA , Sachin Chugh , Mangi Lal Sharma , Mohit Sharma
CPC分类号: G10L25/78 , G10L25/93 , G10L15/22 , G10L2025/786 , G10L2015/227
摘要: A method of adjusting a predefined listening time of a voice assistant device includes receiving an audio input; extracting at least one of a speech component and a non-speech artifact from the audio input; determining a user breathing pattern based on the at least one of the speech component and the non-speech artifact; identifying at least one attribute that impact the user breathing pattern based on at least one non-speech component, captured from an environment and the voice assistant device; determining, after detecting a pause in the audio input, whether a user's intention is to continue a conversation based on an analysis of the user breathing pattern and the at least one attribute; and dynamically adjusting the predefined listening time of the voice assistant device to continue listening for voice commands in the conversation based on a determination that the user's intention is to continue the conversation.
-
公开(公告)号:US11770665B2
公开(公告)日:2023-09-26
申请号:US17876041
申请日:2022-07-28
申请人: Thomas Stachura
发明人: Thomas Stachura
IPC分类号: G06F3/16 , H04R29/00 , G10L15/18 , G10L25/78 , G10L15/08 , G10L15/22 , G10L15/30 , G10L25/51 , G10L17/24 , H04R3/00 , G06F3/01 , H04R5/04
CPC分类号: H04R29/004 , G06F3/011 , G06F3/017 , G10L15/08 , G10L15/18 , G10L15/22 , G10L15/30 , G10L17/24 , G10L25/51 , G10L25/78 , H04R3/005 , H04R5/04 , G10L2015/088 , G10L2015/223 , G10L2025/783 , H04R2420/01 , H04R2499/11
摘要: Systems, apparatuses, and methods are described for a privacy blocking device configured to prevent receipt, by a listening device, of video and/or audio data until a trigger occurs. A blocker may be configured to prevent receipt of video and/or audio data by one or more microphones and/or one or more cameras of a listening device. The blocker may use the one or more microphones, the one or more cameras, and/or one or more second microphones and/or one or more second cameras to monitor for a trigger. The blocker may process the data. Upon detecting the trigger, the blocker may transmit data to the listening device. For example, the blocker may transmit all or a part of a spoken phrase to the listening device.
-
公开(公告)号:US11769508B2
公开(公告)日:2023-09-26
申请号:US18050958
申请日:2022-10-28
申请人: LG ELECTRONICS INC.
发明人: Hansuk Shim
CPC分类号: G10L15/22 , G06F3/167 , G06F9/542 , G06F18/241 , G06N20/00 , G10L15/08 , G10L25/78 , G10L2015/088
摘要: Disclosed herein is an artificial intelligence apparatus including an input interface configured to receive speech data, and a processor configured to detect a non-utterance interval included in the speech data and determine presence/absence of a second utterance after the non-utterance interval according to characteristics of a first utterance before the non-utterance interval, when the non-utterance interval exceeds a set time.
-
公开(公告)号:US20230300526A1
公开(公告)日:2023-09-21
申请号:US18164052
申请日:2023-02-03
申请人: GN Audio A/S
CPC分类号: H04R3/005 , G10L25/78 , G10L25/60 , H04R1/08 , H04R1/406 , H04R29/005 , H04R2201/401
摘要: The present disclosure relates to a microphone apparatus and an associated computer implemented method. The microphone apparatus comprising a main microphone array, an adaptive beamformer, a fixed beamformer, and an analyzer. The analyzer is configured to determine a first relative score based on the output of the fixed beamformer and the adaptive beamformer. The first relative score indicating a difference between the adaptive beamformer and the fixed beamformer.
-
88.
公开(公告)号:US20230298583A1
公开(公告)日:2023-09-21
申请号:US18200518
申请日:2023-05-22
申请人: GOOGLE LLC
发明人: Matthew Sharifi , Victor Carbune
IPC分类号: G10L15/22 , G06F3/04886 , G10L25/78 , G06F3/16
CPC分类号: G10L15/22 , G06F3/04886 , G10L25/78 , G06F3/167 , G10L2015/223 , G10L2015/228
摘要: Implementations set forth relate to suggesting an alternate interface modality when an automated assistant and/or a user is expected to not understand a particular interaction between the user and the automated assistant. In some instances, the automated assistant can pre-emptively determine that a forthcoming and/or ongoing interaction between a user and an automated assistant may experience interference. Based on this determination, the automated assistant can provide an indication that the interaction may not be successful and/or that the user should interact with the automated assistant through a different modality. For example, the automated assistant can render a keyboard interface at a portable computing device when the automated assistant determines that an audio interface of the portable computing device is experiencing interference.
-
89.
公开(公告)号:US11756576B2
公开(公告)日:2023-09-12
申请号:US17692640
申请日:2022-03-11
发明人: Zhe Wang
摘要: An audio signal classification method includes determining, according to voice activity of a current audio frame, whether to obtain a frequency spectrum fluctuation of the current audio frame and store the frequency spectrum fluctuation in a frequency spectrum fluctuation memory, and updating, according to whether the audio frame is percussive music or activity of a historical audio frame, frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory, and classifying the current audio frame as a speech frame or a music frame according to statistics of a part or all of effective data of the frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory.
-
公开(公告)号:US11749294B2
公开(公告)日:2023-09-05
申请号:US16999233
申请日:2020-08-21
发明人: Wai Chung Chu
IPC分类号: G10L21/028 , H04R1/40 , G10L25/78 , G10L21/0216
CPC分类号: G10L21/028 , G10L25/78 , H04R1/406 , G10L2021/02166 , H04R2430/20
摘要: A system configured to perform directional speech separation. The system may dynamically associate direction-of-arrivals with one or more audio sources in order to generate output audio data that separates each of the audio sources. The system identifies a target direction for each audio source, dynamically determines directions that are correlated with the target direction, and generates output signals for each audio source. The system may associate individual frequency bands with specific directions based on a time delay detected by two or more microphones. The system may determine a cross-correlation between each direction and the target direction and select directions with strong correlation. The system may generate time-frequency mask data indicating frequency bands corresponding to the directions associated with a particular audio source. Using the mask data, the system generates output audio data specific to the audio source, resulting in directional speech separation between different audio sources.
-
-
-
-
-
-
-
-
-