专利检索 ipc:G10L25/78 第 3 页

21.

发明公开
TONE AND ECHO CANCELLATION USING TWO ACOUSTIC SOUND CANCELLERS 审中-公开

公开(公告)号：US20240203415A1

公开(公告)日：2024-06-20

申请号：US18471693

申请日：2023-09-21

申请人： Sonos, Inc.

发明人： Saeed Bagheri Sereshki

IPC分类号： G10L15/22 , G10K11/178 , G10L15/08 , G10L21/0208 , G10L21/0232 , G10L25/78 , H04M3/53 , H04S7/00

CPC分类号： G10L15/22 , G10K11/1785 , G10L15/08 , G10L21/0208 , G10L21/0232 , G10L25/78 , G10L2015/088 , G10L2015/223 , G10L2021/02085 , H04M3/53 , H04S7/301

摘要： Example techniques involve systems with multiple acoustic echo cancellers. An example implementation captures first audio within an acoustic environment and detecting, within the captured first audio content, a wake-word. In response to the wake-word and before playing an acknowledgement tone, the implementation activates (a) a first sound canceller when one or more speakers are playing back audio content or (b) a second sound canceller when the one or more speakers are idle. In response to the wake-word and after activating either (a) the first sound canceller or (b) the second sound canceller, the implementation outputs the acknowledgement tone via the one or more speakers. The implementation captures second audio within the acoustic environment and cancelling the acoustic echo of the acknowledgement tone from the captured second audio using the activated sound canceller.

22.

发明授权
Method for reducing occlusion effect of earphone, and related apparatus 有权

公开(公告)号：US12014716B2

公开(公告)日：2024-06-18

申请号：US17853471

申请日：2022-06-29

申请人： HUAWEI TECHNOLOGIES CO., LTD.

发明人： Jingfan Qin , Fan Fan , Yulong Li , Xiaowei Yu , Xiaohong Yang , Yangshan Ou

IPC分类号： G10K11/178 , G10L25/78 , H04R1/10 , H04R1/40 , H04R3/00

CPC分类号： G10K11/17827 , G10K11/17823 , G10K11/17825 , G10K11/17854 , G10K11/17881 , G10L25/78 , H04R1/1083 , H04R1/406 , H04R3/005 , G10K2210/1081 , G10K2210/3026 , G10K2210/3027 , G10K2210/3028 , G10K2210/3044 , G10K2210/3056 , H04R2460/01 , H04R2460/13

摘要： This application discloses a method for reducing an occlusion effect of an earphone, and a related apparatus. The method is applied to an earphone having at least one microphone and a speaker. The method includes: detecting an occurrence of at least one of the following events: a user speaks and the user is in motion; and triggering at least one of the following operations in response to the at least one event: processing the user's sound signal based on the at least one microphone to suppress an occlusion effect of the earphone, and playing an audio by using the speaker, to mask a sound signal in the user's auditory canal. Embodiments of this application can reduce or even eliminate the earphone occlusion effect, to improve user experience.

23.

发明授权
Microphone having a digital output determined at different power consumption levels 有权

公开(公告)号：US12010488B2

公开(公告)日：2024-06-11

申请号：US18126938

申请日：2023-03-27

申请人： Qualcomm Technologies Inc.

发明人： Robert J. Littrell

IPC分类号： H04R29/00 , G10L25/18 , G10L25/21 , G10L25/78 , H04R17/02 , H04R19/04

CPC分类号： H04R29/004 , G10L25/18 , G10L25/21 , G10L25/78 , H04R17/02 , H04R19/04 , H04R2201/003

摘要： An acoustic device is described and includes an acoustic sensor element configured to sense acoustic energy and produce an output signal and a threshold detector circuit including a switch having an input coupled to the output of the acoustic sensor element to receive the output signal, a control port that receives a control signal, and first and second output ports, a first channel including an analog-to-digital converter that operates at a first power level a second analog-to-digital converter that operates at a second higher power level, relative to the first power level and a threshold level detector that receives an output from the first analog-to-digital converter to produce the control signal having a first state that causes the switch feed the output signal from the acoustic sensor element to the second analog-to-digital converter when the first digitized output signal meets a threshold criteria.

24.

发明授权
Privacy device for smart speakers 有权

公开(公告)号：US12010487B2

公开(公告)日：2024-06-11

申请号：US17750598

申请日：2022-05-23

申请人： Thomas Stachura

发明人： Thomas Stachura

IPC分类号： G10L15/18 , G06F3/01 , G10L15/08 , G10L15/22 , G10L15/30 , G10L17/24 , G10L25/51 , G10L25/78 , H04R3/00 , H04R5/04 , H04R29/00

CPC分类号： H04R29/004 , G06F3/011 , G06F3/017 , G10L15/08 , G10L15/18 , G10L15/22 , G10L15/30 , G10L17/24 , G10L25/51 , G10L25/78 , H04R3/005 , H04R5/04 , G10L2015/088 , G10L2015/223 , G10L2025/783 , H04R2420/00 , H04R2420/01 , H04R2499/11

摘要： Systems, apparatuses, and methods are described for a privacy blocking device configured to prevent receipt, by a listening device, of video and/or audio data until a trigger occurs. A blocker may be configured to prevent receipt of video and/or audio data by one or more microphones and/or one or more cameras of a listening device. The blocker may use the one or more microphones, the one or more cameras, and/or one or more second microphones and/or one or more second cameras to monitor for a trigger. The blocker may process the data. Upon detecting the trigger, the blocker may transmit data to the listening device. For example, the blocker may transmit all or a part of a spoken phrase to the listening device.

25.

发明公开
METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR SPEECH SYNTHESIS 审中-公开

公开(公告)号：US20240185829A1

公开(公告)日：2024-06-06

申请号：US17987034

申请日：2022-11-15

申请人： Dell Products L.P.

发明人： Zijia Wang , Zhisong Liu , Zhen Jia

IPC分类号： G10L13/02 , G10L25/78

CPC分类号： G10L13/02 , G10L25/78

摘要： Embodiments of the present disclosure provide a method, an electronic device, and a computer program product for speech synthesis. The method for speech synthesis includes: extracting a plurality of voice feature vectors of a plurality of speakers from a plurality of audios corresponding to the plurality of speakers; calculating a first loss function based on distances between the plurality of voice feature vectors of the plurality of speakers; calculating a second loss function according to a plurality of texts and a plurality of corresponding real audios; and generating a speech synthesis model based on the first loss function and the second loss function. By implementing the method, the speech synthesis model can be optimized and trained, so that a high-quality audio with target voice features can be outputted based on the texts.

26.

发明授权
Speech recognition systems and methods 有权

公开(公告)号：US12002450B2

公开(公告)日：2024-06-04

申请号：US17183743

申请日：2021-02-24

申请人： KABUSHIKI KAISHA TOSHIBA

发明人： Mohan Li , Tudor-Catalin Zorila , Rama Sanand Doddipatla

IPC分类号： G10L15/00 , G06N3/047 , G06N3/08 , G10L19/02 , G10L25/78

CPC分类号： G10L15/00 , G06N3/047 , G06N3/08 , G10L19/02 , G10L25/78 , G10L2025/783

摘要： A computer-implemented method for speech recognition, comprising receiving a frame of speech audio; encoding the frame of speech audio; calculating a halting probability based on the frame of speech audio; adding the halting probability to a first accumulator variable; in response to the first accumulator variable exceeding or reaching a first threshold, calculating a context vector based on the halting probability and the encoding of the frame of speech audio; performing a decoding step using the context vector to derive a token; and executing a function based on the derived token, wherein the executed function comprises at least one of text output or command performance.

27.

发明授权
Motor vehicle artificial intelligence expert system dangerous driving warning and control system and method 有权

公开(公告)号：US11999296B2

公开(公告)日：2024-06-04

申请号：US18537724

申请日：2023-12-12

申请人： Robert D. Pedersen

发明人： Robert D. Pedersen

IPC分类号： B60Q9/00 , G06N5/02 , G06N5/048 , G06V20/56 , G06V20/59 , G08G1/00 , G08G1/01 , G08G1/048 , G08G1/0967 , G08G1/16 , G10L15/22 , G10L15/26 , G10L21/0232 , G10L25/78 , H04B5/26 , H04B5/77 , H04M1/72454 , H04M1/72463 , H04R1/40 , H04R3/00 , H04W4/02 , H04W4/40 , H04W4/80 , H04W4/90 , G10L21/0216 , H04B5/73 , H04B7/06

CPC分类号： B60Q9/008 , G06N5/02 , G06N5/048 , G06V20/56 , G06V20/597 , G08G1/0116 , G08G1/012 , G08G1/0129 , G08G1/0141 , G08G1/048 , G08G1/096716 , G08G1/096741 , G08G1/096775 , G08G1/096783 , G08G1/166 , G08G1/167 , G08G1/205 , G10L15/22 , G10L15/26 , G10L21/0232 , G10L25/78 , H04B5/26 , H04B5/77 , H04M1/72454 , H04M1/72463 , H04R1/406 , H04R3/005 , H04W4/023 , H04W4/40 , H04W4/80 , H04W4/90 , G10L2021/02166 , H04B5/73 , H04B7/0617 , H04R2201/403 , H04R2499/13

摘要： Specifically programmed, integrated motor vehicle dangerous driving warning and control system and methods comprising at least one specialized communication computer machine including electronic artificial intelligence expert system decision making capability further comprising one or more motor vehicle electronic sensors for monitoring the motor vehicle and for monitoring activities of the driver and/or passengers including activities related to the use of cellular telephones and/or other wireless communication devices and further comprising electronic communications transceiver assemblies for communications with external sensor networks for monitoring dangerous driving situations, weather conditions, roadway conditions, pedestrian congestion and motor vehicle traffic congestion conditions to derive warning and/or control signals for warning the driver of dangerous driving situations and/or for controlling the motor vehicle driver use of a cellular telephone and/or other wireless communication devices.

28.

发明公开
EMBEDDED AUDIO SENSOR SYSTEM AND METHODS 审中-公开

公开(公告)号：US20240177731A1

公开(公告)日：2024-05-30

申请号：US18432139

申请日：2024-02-05

申请人： CELLULAR SOUTH, INC. DBA C SPIRE WIRELESS

发明人： Brett Rogers , Tommy Naugle , Stephen Bye , Craig Sparks , Arman Kirakosyan

IPC分类号： G10L25/78 , G06N5/04 , G06N7/01 , G16Y20/10 , G16Y40/10

CPC分类号： G10L25/78 , G06N5/04 , G06N7/01 , G16Y20/10 , G16Y40/10

摘要： An embedded sensor can include an audio detector, a digital signal processor, a library, and a rules engine. The digital signal processor can be configured to receive signals from the audio detector and to identify the environment in which the embedded sensor is located. The library can store statistical models associated with specific environments, and the digital signal processor can be configured identify specific events based on detected sounds within the particular environment by utilizing the statistical model associated with the particular environment. The DSP can associate a probability of accuracy for the identified audible event. A rules engine can be configured to receive the probability and transmit a report of the detected audible event.

29.

发明授权
Acoustic analysis of crowd sounds 有权

公开(公告)号：US11996121B2

公开(公告)日：2024-05-28

申请号：US17644363

申请日：2021-12-15

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Rachel Ostrand , Vagner Figueredo de Santana , Alecio Pedro Delazari Binotto

IPC分类号： G10L25/78 , G06N20/00 , G10L25/21 , G10L25/51 , G10L25/93

CPC分类号： G10L25/78 , G06N20/00 , G10L25/21 , G10L25/51 , G10L25/93 , G10L2025/937

摘要： A method, computer system, and a computer program product for detecting face mask usage based on a crowd sound is provided. The present invention may include capturing an audio stream including a crowd voice data. The present invention may also include analyzing the crowd voice data using a machine learning model to determine an amount of people wearing masks. The present invention may further include in response to determining that the amount of people wearing masks does not meet a compliance threshold, displaying a content to promote face mask usage.

30.

发明授权
Noise cancellation for open microphone mode 有权

公开(公告)号：US11996092B1

公开(公告)日：2024-05-28

申请号：US17516227

申请日：2021-11-01

申请人： Amazon Technologies, Inc.

发明人： Ty Loren Carlson , Rohan Mutagi

IPC分类号： G10L15/02 , G10L15/20 , G10L15/22 , G10L15/26 , G10L17/00 , G10L21/0208 , G10L21/0272 , G10L25/78 , G10L25/84 , G10L25/87

CPC分类号： G10L15/20 , G10L15/22 , G10L15/26 , G10L17/00 , G10L21/0208 , G10L21/0272 , G10L25/84 , G10L2015/223 , G10L2021/02087 , G10L2025/783

摘要： A system has multiple audio-enabled devices that communicate with one another over an open microphone mode of communication. When a user says a trigger word, the nearest device validates the trigger word and opens a communication channel with another device. As the user talks, the device receives the speech and generates an audio signal representation that includes the user speech and may additionally include other background or interfering sound from the environment. The device transmits the audio signal to the other device as part of a conversation, while continually analyzing the audio signal to detect when the user stops talking. This analysis may include watching for a lack of speech in the audio signal for a period of time, or an abrupt change in context of the speech (indicating the speech is from another source), or canceling noise or other interfering sound to isolate whether the user is still speaking. Once the device confirms that the user has stopped talking, the device transitions from a transmission mode to a reception mode to await a reply in the conversation.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类