-
公开(公告)号:US20240203415A1
公开(公告)日:2024-06-20
申请号:US18471693
申请日:2023-09-21
申请人: Sonos, Inc.
IPC分类号: G10L15/22 , G10K11/178 , G10L15/08 , G10L21/0208 , G10L21/0232 , G10L25/78 , H04M3/53 , H04S7/00
CPC分类号: G10L15/22 , G10K11/1785 , G10L15/08 , G10L21/0208 , G10L21/0232 , G10L25/78 , G10L2015/088 , G10L2015/223 , G10L2021/02085 , H04M3/53 , H04S7/301
摘要: Example techniques involve systems with multiple acoustic echo cancellers. An example implementation captures first audio within an acoustic environment and detecting, within the captured first audio content, a wake-word. In response to the wake-word and before playing an acknowledgement tone, the implementation activates (a) a first sound canceller when one or more speakers are playing back audio content or (b) a second sound canceller when the one or more speakers are idle. In response to the wake-word and after activating either (a) the first sound canceller or (b) the second sound canceller, the implementation outputs the acknowledgement tone via the one or more speakers. The implementation captures second audio within the acoustic environment and cancelling the acoustic echo of the acknowledgement tone from the captured second audio using the activated sound canceller.
-
公开(公告)号:US12014716B2
公开(公告)日:2024-06-18
申请号:US17853471
申请日:2022-06-29
发明人: Jingfan Qin , Fan Fan , Yulong Li , Xiaowei Yu , Xiaohong Yang , Yangshan Ou
IPC分类号: G10K11/178 , G10L25/78 , H04R1/10 , H04R1/40 , H04R3/00
CPC分类号: G10K11/17827 , G10K11/17823 , G10K11/17825 , G10K11/17854 , G10K11/17881 , G10L25/78 , H04R1/1083 , H04R1/406 , H04R3/005 , G10K2210/1081 , G10K2210/3026 , G10K2210/3027 , G10K2210/3028 , G10K2210/3044 , G10K2210/3056 , H04R2460/01 , H04R2460/13
摘要: This application discloses a method for reducing an occlusion effect of an earphone, and a related apparatus. The method is applied to an earphone having at least one microphone and a speaker. The method includes: detecting an occurrence of at least one of the following events: a user speaks and the user is in motion; and triggering at least one of the following operations in response to the at least one event: processing the user's sound signal based on the at least one microphone to suppress an occlusion effect of the earphone, and playing an audio by using the speaker, to mask a sound signal in the user's auditory canal. Embodiments of this application can reduce or even eliminate the earphone occlusion effect, to improve user experience.
-
公开(公告)号:US12010488B2
公开(公告)日:2024-06-11
申请号:US18126938
申请日:2023-03-27
发明人: Robert J. Littrell
CPC分类号: H04R29/004 , G10L25/18 , G10L25/21 , G10L25/78 , H04R17/02 , H04R19/04 , H04R2201/003
摘要: An acoustic device is described and includes an acoustic sensor element configured to sense acoustic energy and produce an output signal and a threshold detector circuit including a switch having an input coupled to the output of the acoustic sensor element to receive the output signal, a control port that receives a control signal, and first and second output ports, a first channel including an analog-to-digital converter that operates at a first power level a second analog-to-digital converter that operates at a second higher power level, relative to the first power level and a threshold level detector that receives an output from the first analog-to-digital converter to produce the control signal having a first state that causes the switch feed the output signal from the acoustic sensor element to the second analog-to-digital converter when the first digitized output signal meets a threshold criteria.
-
公开(公告)号:US12010487B2
公开(公告)日:2024-06-11
申请号:US17750598
申请日:2022-05-23
申请人: Thomas Stachura
发明人: Thomas Stachura
IPC分类号: G10L15/18 , G06F3/01 , G10L15/08 , G10L15/22 , G10L15/30 , G10L17/24 , G10L25/51 , G10L25/78 , H04R3/00 , H04R5/04 , H04R29/00
CPC分类号: H04R29/004 , G06F3/011 , G06F3/017 , G10L15/08 , G10L15/18 , G10L15/22 , G10L15/30 , G10L17/24 , G10L25/51 , G10L25/78 , H04R3/005 , H04R5/04 , G10L2015/088 , G10L2015/223 , G10L2025/783 , H04R2420/00 , H04R2420/01 , H04R2499/11
摘要: Systems, apparatuses, and methods are described for a privacy blocking device configured to prevent receipt, by a listening device, of video and/or audio data until a trigger occurs. A blocker may be configured to prevent receipt of video and/or audio data by one or more microphones and/or one or more cameras of a listening device. The blocker may use the one or more microphones, the one or more cameras, and/or one or more second microphones and/or one or more second cameras to monitor for a trigger. The blocker may process the data. Upon detecting the trigger, the blocker may transmit data to the listening device. For example, the blocker may transmit all or a part of a spoken phrase to the listening device.
-
公开(公告)号:US20240185829A1
公开(公告)日:2024-06-06
申请号:US17987034
申请日:2022-11-15
申请人: Dell Products L.P.
发明人: Zijia Wang , Zhisong Liu , Zhen Jia
摘要: Embodiments of the present disclosure provide a method, an electronic device, and a computer program product for speech synthesis. The method for speech synthesis includes: extracting a plurality of voice feature vectors of a plurality of speakers from a plurality of audios corresponding to the plurality of speakers; calculating a first loss function based on distances between the plurality of voice feature vectors of the plurality of speakers; calculating a second loss function according to a plurality of texts and a plurality of corresponding real audios; and generating a speech synthesis model based on the first loss function and the second loss function. By implementing the method, the speech synthesis model can be optimized and trained, so that a high-quality audio with target voice features can be outputted based on the texts.
-
公开(公告)号:US12002450B2
公开(公告)日:2024-06-04
申请号:US17183743
申请日:2021-02-24
摘要: A computer-implemented method for speech recognition, comprising receiving a frame of speech audio; encoding the frame of speech audio; calculating a halting probability based on the frame of speech audio; adding the halting probability to a first accumulator variable; in response to the first accumulator variable exceeding or reaching a first threshold, calculating a context vector based on the halting probability and the encoding of the frame of speech audio; performing a decoding step using the context vector to derive a token; and executing a function based on the derived token, wherein the executed function comprises at least one of text output or command performance.
-
27.
公开(公告)号:US11999296B2
公开(公告)日:2024-06-04
申请号:US18537724
申请日:2023-12-12
申请人: Robert D. Pedersen
发明人: Robert D. Pedersen
IPC分类号: B60Q9/00 , G06N5/02 , G06N5/048 , G06V20/56 , G06V20/59 , G08G1/00 , G08G1/01 , G08G1/048 , G08G1/0967 , G08G1/16 , G10L15/22 , G10L15/26 , G10L21/0232 , G10L25/78 , H04B5/26 , H04B5/77 , H04M1/72454 , H04M1/72463 , H04R1/40 , H04R3/00 , H04W4/02 , H04W4/40 , H04W4/80 , H04W4/90 , G10L21/0216 , H04B5/73 , H04B7/06
CPC分类号: B60Q9/008 , G06N5/02 , G06N5/048 , G06V20/56 , G06V20/597 , G08G1/0116 , G08G1/012 , G08G1/0129 , G08G1/0141 , G08G1/048 , G08G1/096716 , G08G1/096741 , G08G1/096775 , G08G1/096783 , G08G1/166 , G08G1/167 , G08G1/205 , G10L15/22 , G10L15/26 , G10L21/0232 , G10L25/78 , H04B5/26 , H04B5/77 , H04M1/72454 , H04M1/72463 , H04R1/406 , H04R3/005 , H04W4/023 , H04W4/40 , H04W4/80 , H04W4/90 , G10L2021/02166 , H04B5/73 , H04B7/0617 , H04R2201/403 , H04R2499/13
摘要: Specifically programmed, integrated motor vehicle dangerous driving warning and control system and methods comprising at least one specialized communication computer machine including electronic artificial intelligence expert system decision making capability further comprising one or more motor vehicle electronic sensors for monitoring the motor vehicle and for monitoring activities of the driver and/or passengers including activities related to the use of cellular telephones and/or other wireless communication devices and further comprising electronic communications transceiver assemblies for communications with external sensor networks for monitoring dangerous driving situations, weather conditions, roadway conditions, pedestrian congestion and motor vehicle traffic congestion conditions to derive warning and/or control signals for warning the driver of dangerous driving situations and/or for controlling the motor vehicle driver use of a cellular telephone and/or other wireless communication devices.
-
公开(公告)号:US20240177731A1
公开(公告)日:2024-05-30
申请号:US18432139
申请日:2024-02-05
发明人: Brett Rogers , Tommy Naugle , Stephen Bye , Craig Sparks , Arman Kirakosyan
摘要: An embedded sensor can include an audio detector, a digital signal processor, a library, and a rules engine. The digital signal processor can be configured to receive signals from the audio detector and to identify the environment in which the embedded sensor is located. The library can store statistical models associated with specific environments, and the digital signal processor can be configured identify specific events based on detected sounds within the particular environment by utilizing the statistical model associated with the particular environment. The DSP can associate a probability of accuracy for the identified audible event. A rules engine can be configured to receive the probability and transmit a report of the detected audible event.
-
公开(公告)号:US11996121B2
公开(公告)日:2024-05-28
申请号:US17644363
申请日:2021-12-15
摘要: A method, computer system, and a computer program product for detecting face mask usage based on a crowd sound is provided. The present invention may include capturing an audio stream including a crowd voice data. The present invention may also include analyzing the crowd voice data using a machine learning model to determine an amount of people wearing masks. The present invention may further include in response to determining that the amount of people wearing masks does not meet a compliance threshold, displaying a content to promote face mask usage.
-
公开(公告)号:US11996092B1
公开(公告)日:2024-05-28
申请号:US17516227
申请日:2021-11-01
发明人: Ty Loren Carlson , Rohan Mutagi
IPC分类号: G10L15/02 , G10L15/20 , G10L15/22 , G10L15/26 , G10L17/00 , G10L21/0208 , G10L21/0272 , G10L25/78 , G10L25/84 , G10L25/87
CPC分类号: G10L15/20 , G10L15/22 , G10L15/26 , G10L17/00 , G10L21/0208 , G10L21/0272 , G10L25/84 , G10L2015/223 , G10L2021/02087 , G10L2025/783
摘要: A system has multiple audio-enabled devices that communicate with one another over an open microphone mode of communication. When a user says a trigger word, the nearest device validates the trigger word and opens a communication channel with another device. As the user talks, the device receives the speech and generates an audio signal representation that includes the user speech and may additionally include other background or interfering sound from the environment. The device transmits the audio signal to the other device as part of a conversation, while continually analyzing the audio signal to detect when the user stops talking. This analysis may include watching for a lack of speech in the audio signal for a period of time, or an abrupt change in context of the speech (indicating the speech is from another source), or canceling noise or other interfering sound to isolate whether the user is still speaking. Once the device confirms that the user has stopped talking, the device transitions from a transmission mode to a reception mode to await a reply in the conversation.
-
-
-
-
-
-
-
-
-