专利检索 ipc:"G10L15/20" 第 1 页

1.

发明公开
RECOGNITION OR SYNTHESIS OF HUMAN-UTTERED HARMONIC SOUNDS 审中-公开

公开(公告)号：US20240363105A1

公开(公告)日：2024-10-31

申请号：US18290574

申请日：2022-05-13

申请人： Boris Fridman-Mintz

发明人： Boris Fridman-Mintz

IPC分类号： G10L15/187 , G10L15/20 , G10L21/0232 , G10L21/0264 , G10L25/78

CPC分类号： G10L15/187 , G10L15/20 , G10L21/0232 , G10L21/0264 , G10L25/78 , G10L2025/783

摘要： Within each harmonic spectrum of a sequence of spectra derived from analysis of a waveform representing human speech are identified two or more fundamental or harmonic components that have frequencies that are separated by integer multiples of a fundamental acoustic frequency. The highest harmonic frequency that is also greater than 410 Hz is a primary cap frequency, which is used to select a primary phonetic note that corresponds to a subset of phonetic chords from a set of phonetic chords for which acoustic spectral is available. The spectral data can also include frequencies for primary band, secondary band (or secondary note), basal band, or reduced basal band acoustic components, which can be used to select a phonetic chord from the subset of phonetic chords corresponding to the selected primary note.

2.

发明授权
Occupancy tracking using environmental information 有权

公开(公告)号：US12104815B2

公开(公告)日：2024-10-01

申请号：US17139250

申请日：2020-12-31

申请人： Lennox Industries Inc.

发明人： Sunil Bondalapati , Prasad Mecheri Chandravihar , Bhavana Chadive , F N U Kriti

IPC分类号： F24F11/63 , F24F120/10 , G05B13/02 , G05B13/04 , G10L15/20 , H04B17/318 , H04L67/12

CPC分类号： F24F11/63 , G05B13/0265 , G05B13/048 , G10L15/20 , H04B17/318 , H04L67/12 , F24F2120/10

摘要： An occupancy tracking device configured to receive sound samples, to identify voices within the sound samples, and to determine a first occupancy level based on the identified voices. The device is further configured to identify user devices connected to an access point and to determine a second occupancy level based on the user devices that are connected to the access point. The device is further configured to measure a signal strength of a network connection with the access point and to determine a third occupancy level based on the signal strength of the network connection with the access point. The device is further configured to determine a predicted occupancy level based on the first occupancy level, the second occupancy level, and the third occupancy level and to control a Heating, Ventilation, and Air Conditioning (HVAC) system based on the predicted occupancy level.

3.

发明授权
Methods and devices for selectively ignoring captured audio data 有权

公开(公告)号：US12094455B2

公开(公告)日：2024-09-17

申请号：US18242860

申请日：2023-09-06

申请人： Amazon Technologies, Inc.

发明人： James David Meyers , Kurt Wesley Piersol

IPC分类号： G10L15/08 , G10L15/04 , G10L15/20 , G10L15/22 , G10L21/0208 , G10L21/028

CPC分类号： G10L15/08 , G10L15/04 , G10L15/20 , G10L21/028 , G10L2015/088 , G10L15/22 , G10L2021/02082

摘要： Systems and methods for selectively ignoring an occurrence of a wakeword within audio input data is provided herein. In some embodiments, a wakeword may be detected to have been uttered by an individual within a modified time window, which may account for hardware delays and echoing offsets. The detected wakeword that occurs during this modified time window may, in some embodiments, correspond to a word included within audio that is outputted by a voice activated electronic device. This may cause the voice activated electronic device to activate itself, stopping the audio from being outputted. By identifying when these occurrences of the wakeword within outputted audio are going to happen, the voice activated electronic device may selectively determine when to ignore the wakeword, and furthermore, when not to ignore the wakeword.

4.

发明公开
SELECTIVE ADAPTATION AND UTILIZATION OF NOISE REDUCTION TECHNIQUE IN INVOCATION PHRASE DETECTION 审中-公开

公开(公告)号：US20240304187A1

公开(公告)日：2024-09-12

申请号：US18662334

申请日：2024-05-13

申请人： GOOGLE LLC

发明人： Christopher Hughes , Yiteng Huang , Turaj Zakizadeh Shabestary , Taylor Applebaum

IPC分类号： G10L15/20 , G10L15/02 , G10L15/08 , G10L15/22 , G10L21/0216 , G10L21/0232 , G10L25/84

CPC分类号： G10L15/20 , G10L15/02 , G10L15/08 , G10L15/22 , G10L21/0232 , G10L25/84 , G10L2015/025 , G10L2015/088 , G10L2015/223 , G10L2021/02166

摘要： Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

5.

发明授权
Pre-conditioning audio for echo cancellation in machine perception 有权

公开(公告)号：US12080317B2

公开(公告)日：2024-09-03

申请号：US17639317

申请日：2020-08-27

申请人： DOLBY LABORATORIES LICENSING CORPORATION

发明人： Hadis Nosrati , Glenn N. Dickins , Nicholas Luke Appleton

IPC分类号： G10L15/20 , G10L21/02 , G10L21/0208 , G10L21/0316

CPC分类号： G10L21/0316 , G10L15/20 , G10L21/0208 , G10L2021/02082

摘要： An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.

6.

发明公开
INFORMATION PROVIDING METHOD 审中-公开

公开(公告)号：US20240249744A1

公开(公告)日：2024-07-25

申请号：US18625507

申请日：2024-04-03

申请人： Panasonic Intellectual Property Corporation of America

发明人： Masaki Yamauchi , Nanami FUJIWARA

IPC分类号： G10L25/78 , G10L15/20 , G10L15/22

CPC分类号： G10L25/78 , G10L15/20 , G10L15/22 , G10L2015/228 , G10L2025/783

摘要： An information providing method includes: generating first information indicating that a friendly gathering is occurring in a home when (i) a threshold amount of time or longer has elapsed from a start time of food preparation by a user and (ii) the volume of sound in a dining space is a first threshold volume or greater; obtaining, from a second information processing apparatus connected to a first information processing apparatus, information indicating first request content over a network; and when content of the first information is included in the first request content, outputting, to the second information processing apparatus, second information including information for identifying the user or the home, using the first information generated.

7.

发明授权
Heliumspeech unscrambling method and system for saturation diving based on multi-objective optimization 有权

公开(公告)号：US12039988B1

公开(公告)日：2024-07-16

申请号：US18424695

申请日：2024-01-26

申请人： Nantong University

发明人： Shibing Zhang , Jianrong Wu

IPC分类号： G10L21/02 , G10L15/06 , G10L15/20 , G10L25/51

CPC分类号： G10L21/02 , G10L15/063 , G10L15/20 , G10L25/51 , G10L2015/0631

摘要： The present application discloses a method and a system for saturation diving heliumspeech unscrambling based on multi-objective optimization. In a system including a diver and a filter at least, a working language phonetic symbol library and a common working word library for divers are constructed. The divers read them one by one, and a phonetic symbol standard speech library, a phonetic symbol heliumspeech library and a common working word speech library are generated. The filter uses the multi-objective optimization algorithm to design its impulse response coefficients, corrects and unscrambles the tagged and sampled heliumspeech signal word by word, and continuously updates the impulse response coefficients to complete the perfect heliumspeech unscrambling.

8.

发明授权
Voice processing method, electronic device, and storage medium 有权

公开(公告)号：US12014730B2

公开(公告)日：2024-06-18

申请号：US17322238

申请日：2021-05-17

申请人： BEIJING XIAOMI MOBILE SOFTWARE CO., LTD.

发明人： Xiangyan Xu

IPC分类号： G10L15/20 , G10L15/02

CPC分类号： G10L15/20 , G10L15/02 , G10L2015/025

摘要： A voice processing method includes: collecting a voice signal by a microphone of an electronic device, and signal-processing the collected voice signal to obtain a first voice frame segment; performing voice recognition on the first voice frame segment to obtain a first recognition result; in response to the first recognition result not matching a target content and a plurality of tokens in the first recognition result meeting a preset condition, performing frame compensation on the first voice frame segment to obtain a second voice frame segment; and performing voice recognition on the second voice frame segment to obtain a second recognition result. A matching degree between the second recognition result and the target content is greater than a matching degree between the first recognition result and the target content.

9.

发明公开
AUTHENTICATION OF IMPAIRED VOICES 审中-公开

公开(公告)号：US20240194195A1

公开(公告)日：2024-06-13

申请号：US18581960

申请日：2024-02-20

申请人： Wells Fargo Bank, N.A.

发明人： Andrew J. Garner, IV , Tyua Larsen Fraser , Kimberly Ann Maclnnis , Paul R. McMahon , Darrell Lee Suen , Zhong Wan

IPC分类号： G10L15/20 , G06F21/32 , G10L17/08 , G10L17/20 , G10L17/22 , G10L17/24

CPC分类号： G10L15/20 , G06F21/32 , G10L17/20 , G10L17/22 , G10L17/24 , G10L17/08

摘要： Systems and techniques for are described herein. A voice profile may be generated for a user. An audio stream may be received including an authentication voice of the user. It may be determined that the authentication voice does not match a first set of authentication criteria. The audio stream may be compared to a second set of authentication criteria. The user may be authenticated based on the comparison.

10.

发明授权
Mixed speech recognition method and apparatus, and computer-readable storage medium 有权

公开(公告)号：US11996091B2

公开(公告)日：2024-05-28

申请号：US16989844

申请日：2020-08-10

申请人： TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

发明人： Jun Wang , Jie Chen , Dan Su , Dong Yu

IPC分类号： G10L15/20 , G10L15/02 , G10L15/16 , G10L15/22 , G10L17/06 , G10L21/02 , G10L21/0272 , G10L21/0208

CPC分类号： G10L15/20 , G10L15/02 , G10L15/16 , G10L15/22 , G10L17/06 , G10L21/02 , G10L21/0272 , G10L2015/223 , G10L2021/02087

摘要： A mixed speech recognition method, a mixed speech recognition apparatus, and a computer-readable storage medium are provided. The mixed speech recognition method includes: monitoring an input of speech input and detecting an enrollment speech and a mixed speech; acquiring speech features of a target speaker based on the enrollment speech; and determining speech belonging to the target speaker in the mixed speech based on the speech features of the target speaker. The enrollment speech includes preset speech information, and the mixed speech is non-enrollment speech inputted after the enrollment speech.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类