-
公开(公告)号:US20230129867A1
公开(公告)日:2023-04-27
申请号:US17978730
申请日:2022-11-01
发明人: Raul Alejandro Casas , Marcin Ciolek , Samer Hijazi , Dror Maydan , Hua Mu , Erik Panu , Christopher Rowen
摘要: Systems and methods are disclosed for audio group identification for conferencing. For example, methods may include joining a conference call using a network interface; accessing an audio signal that has been captured using a microphone; detecting a control signal in the audio signal; and, responsive to detection of the control signal, invoking modification of an audio path of the conference call.
-
公开(公告)号:US11638078B2
公开(公告)日:2023-04-25
申请号:US17408630
申请日:2021-08-23
发明人: John Paul Lesso
摘要: There is described a switchable microphone device which may be switched between a digital output mode and an analog output mode. There is further described a system for use of such a device, which allows for the switching between analog and digital computing modes.
-
公开(公告)号:US11636674B2
公开(公告)日:2023-04-25
申请号:US16683369
申请日:2019-11-14
摘要: Techniques for virtual assistant situation commentary are provided. At least one image frame of a field of view (FOV) of a camera may be received, the at least one image frame intended to be sent to at least one participant of a talk group. A description associated with each element of a plurality of elements within the FOV of the camera may be generated. It may be determined that the at least one participant of the talk group is not currently visually engaged. Audio communication of a sender of the at least one image frame may be monitored to identify a reference to an element of the plurality of elements. The audio communication may be supplemented to include portions of the description of the element that were not included in the audio communication from the sender when it is determined that the at least one participant is not visually engaged.
-
公开(公告)号:US11632429B2
公开(公告)日:2023-04-18
申请号:US17207149
申请日:2021-03-19
发明人: Ramazan Demirli , Jiewei Jiang , Omid Sayadi , Steven Jay Young
IPC分类号: H04L67/125 , G10L21/0232 , H04L12/28 , H04R3/00 , H04K1/00 , A61B5/00 , A61F5/56 , G10L25/78 , G10L21/0208
摘要: An acoustic sensor is positioned in an environment and configured to generate a data stream responsive to acoustic energy in the environment. A controller is configured to receive the data stream. The controller is further configured to analyze the data stream to determine ambient acoustic signals. The controller is further configured to generate an ambient acoustic template based on the determined ambient acoustic signals. The controller is further configured to apply the ambient acoustic template to the data stream so that the ambient acoustic signals are suppressed in the data stream. The controller is further configured to analyze the data stream after the ambient acoustic signals are suppressed in order to determine if the acoustic energy in the environment includes acoustic energy of human snoring. The controller is further configured to issue a control signal to a second controller in order to engage a home automation device.
-
公开(公告)号:US11632346B1
公开(公告)日:2023-04-18
申请号:US16712761
申请日:2019-12-12
发明人: Abinash Mahapatra , Anuj Saluja , Ouning Zhang , Xinyu Miao , Ting Liu , Yanina Potashnik , Alfred Ying Fai Lui , Choon-Mun Hooi , Jeffrey John Easter , Oliver Huy Doan , Jonathan B. Assayag
IPC分类号: H04L12/58 , G06F3/0484 , G06F8/38 , G06F9/445 , G06F9/451 , H04L51/224 , H04R1/02 , G06F3/01 , G10L25/78 , G06F3/16 , G10L13/00
摘要: A device, such as a head-mounted wearable device (HMWD), provides audible notifications to a user with a voice user interface (VUI). A filtered subset of notifications addressed to the user, such as notifications from contacts associated with specified applications, are processed by a text to speech system that generates audio output for presentation to the user. The audio output may be presented using the HMWD. For example, the audio output generated from a text message received from a contact may be played on the device. The user may provide an input to play the notification again, initiate a reply, or take another action. The input may comprise a gesture on a touch sensor, activation of a button, verbal input acquired by a microphone, and so forth.
-
公开(公告)号:US20230093585A1
公开(公告)日:2023-03-23
申请号:US17480740
申请日:2021-09-21
摘要: An audio system for spatializing virtual sound sources is described. A microphone array of the audio system is configured to monitor sound in a local area. A controller of the audio system identifies sound sources within the local area using the monitored sound from the microphone array and determines their locations. The controller of the audio system generates a target position for a virtual sound source based on one or more constraints. The one or more constraints include that the target position be at least a threshold distance away from each of the determined locations of the identified sound sources. The controller generates one or more sound filters based in part on the target position to spatialize the virtual sound source. A transducer array of the audio system presents spatialized audio including the virtual sound source content based in part on the one or more sound filters.
-
公开(公告)号:US20230082325A1
公开(公告)日:2023-03-16
申请号:US17800943
申请日:2020-02-26
申请人: NEC Corporation
发明人: Shuji KOMEIJI , Hitoshi YAMAMOTO
摘要: An utterance end detection apparatus (2000) acquires source data 10 representing an audio signal including one or more utterances. The utterance end detection apparatus (2000) converts the source data (10) into text data (30). The utterance end detection apparatus (2000) detects a conversion unit that analyzes text data (30), acquires source data, and converts the source data into text data, and an end of each utterance included in an audio signal represented by the source data (10).
-
公开(公告)号:US11605381B2
公开(公告)日:2023-03-14
申请号:US17315890
申请日:2021-05-10
发明人: Dushyant Sharma , Patrick A. Naylor
IPC分类号: G10L15/22 , H04R1/40 , H04R3/00 , G10L25/84 , G10L15/32 , G10L15/20 , G06F16/65 , G10L17/06 , G10L25/78 , H04R5/04 , H04S7/00 , H04R29/00 , G10L21/028 , G06F16/68 , H04R3/04 , G16H15/00 , G06N20/00 , G16H10/60 , G16H40/20 , G10L15/26 , G10L21/0216 , G10L21/0272
摘要: A method, computer program product, and computing system for receiving information associated with an acoustic environment. Acoustic metadata associated with audio encounter information received by a first microphone system may be received. One or more speaker representations may be defined based upon, at least in part, the acoustic metadata associated with the audio encounter information and the information associated with the acoustic environment. One or more portions of the audio encounter information may be labeled with the one or more speaker representations and a speaker location within the acoustic environment.
-
公开(公告)号:US20230062598A1
公开(公告)日:2023-03-02
申请号:US17893693
申请日:2022-08-23
发明人: Roi Nathan , Tal Rosenwein , Nir Sancho , Yonatan Wexler , Amnon Shashua
IPC分类号: G06F3/16 , G10L15/22 , G10L15/30 , G10L25/78 , G06V20/50 , G06V40/16 , G10L15/25 , G06V20/62 , G06V40/20 , H04R1/08 , H04R3/00 , G06V30/19
摘要: A method for adjusting an audio transmission when a user of the system is being spoken to by another person includes receiving audio signals representative of sounds from an environment of the user captured by at least one microphone; determining at least from the received audio signals that the another person is speaking to user; and subject to the user being spoken to by the another person, adjusting the audio transmission to the user and signaling to the user that the user is being spoken to.
-
公开(公告)号:US11594244B2
公开(公告)日:2023-02-28
申请号:US16871587
申请日:2020-05-11
IPC分类号: G10L25/78 , G10L15/06 , G10L15/16 , G06N3/08 , G06N20/10 , G10L15/02 , G10L25/87 , G10L25/84
摘要: A voice event detection apparatus is disclosed. The apparatus comprises a vibration to digital converter and a computing unit. The vibration to digital converter is configured to convert an input audio signal into vibration data. The computing unit is configured to trigger a downstream module according to a sum of vibration counts of the vibration data for a number X of frames. In an embodiment, the voice event detection apparatus is capable of correctly distinguishing a wake phoneme from the input vibration data so as to trigger a downstream module of a computing system. Thus, the power consumption of the computing system is saved.
-
-
-
-
-
-
-
-
-