专利检索 ipc:G10L25/00 第 3 页

21.

发明授权
Method and system for ordering content using a voice menu system 有权

公开(公告)号：US10827066B2

公开(公告)日：2020-11-03

申请号：US12200905

申请日：2008-08-28

申请人： Alistair E. Jeffs

发明人： Alistair E. Jeffs

IPC分类号： G10L21/00 , G10L25/00 , H04M3/493 , G06F16/70 , G06F16/432

摘要： A method and system for ordering content includes a voice menu system and a phone device communicating a phone signal to the voice menu system. The voice menu system determines the phone number associated with the phone device through the phone signal and generates a voice prompt for recording a content selection from the voice menu system. The phone device selects a recording content option. The voice menu system generates prompts for determining a content title. The phone device selects a content title by communicating a selection signal to the voice menu system. The voice menu system enables a content recording at a recording device in response to the selection signal.

22.

发明授权
Development of voice and other interaction applications 有权

公开(公告)号：US10762890B1

公开(公告)日：2020-09-01

申请号：US16544508

申请日：2019-08-19

申请人： Voicify, LLC

发明人： Jeffrey K. McMahon , Robert T. Naughton , Nicholas G. Laidlaw , Alexander M. Dunn , Jason Green

IPC分类号： G10L21/00 , G10L25/00 , G10L21/06 , H04M3/493 , G10L13/08 , G06F3/0481 , G10L13/033 , G10L13/047

摘要： Among other things, a developer of an interaction application for an enterprise can create items of content to be provided to an assistant platform for use in responses to requests of end-users. The developer can deploy the interaction application using defined items of content and an available general interaction model including intents and sample utterances having slots. The developer can deploy the interaction application without requiring the developer to formulate any of the intents, sample utterances, or slots of the general interaction model.

23.

发明授权
Speech dialogue device and speech dialogue method 有权

公开(公告)号：US10706853B2

公开(公告)日：2020-07-07

申请号：US15763322

申请日：2015-11-25

申请人： Mitsubishi Electric Corporation

发明人： Naoya Baba , Yuki Furumoto , Masanobu Osawa , Takumi Takei

IPC分类号： G10L15/00 , G10L15/26 , G10L13/00 , G10L13/08 , G10L21/00 , G10L25/00 , G10L15/32 , G10L15/22 , G10L15/30 , G10L17/00 , G10L17/22

摘要： A correspondence relationship between keywords for instructing the start of a speech dialogue and modes of a response is defined in a response-mode correspondence table. A response-mode selecting unit selects a mode of a response corresponding to a keyword included in the recognition result of a speech recognition unit using the response-mode correspondence table. A dialogue controlling unit starts the speech dialogue when the keyword is included in the recognition result of the speech recognition unit, determines a response in accordance with the subsequent recognition result from the speech recognition unit, and controls a mode of the response in such a manner as to match the mode selected by the response-mode selecting unit. A speech output controlling unit generates speech data on the basis of the response and mode controlled by the dialogue controlling unit and outputs the speech data to a speaker.

24.

发明授权
User voice activity detection methods, devices, assemblies, and components 有权

公开(公告)号：US10564925B2

公开(公告)日：2020-02-18

申请号：US15711793

申请日：2017-09-21

申请人： Avnera Corporation

发明人： Jiajin An , Michael Jon Wurtz , David Wurtz , Manpreet Khaira , Amit Kumar , Shawn O'Connor , Shankar Rathoud , James Scanlan , Eric Sorensen

IPC分类号： G10L25/00 , G06F3/16 , G10L25/84 , H04R1/10 , H04R3/00 , G10L15/08 , G10L15/22 , H04R1/40

摘要： Many headsets include automatic noise cancellation (ANC) which dramatically reduces perceived background noise and improves user listening experience. Unfortunately, the voice microphones in these devices often capture ambient noise that the headsets output during phone calls or other communication sessions to other users. In response, many headsets and communication devices provide manual muting circuitry, but users frequently forget to turn the muting on and/or off, creating further problems as they communicate. To address this, the present inventors devised, among other things, an exemplary headset that detects the absence or presence of user speech, automatically muting and unmuting the voice microphone without user intervention. Some embodiments leverage relationships between feedback and feedforward signals in ANC circuitry to detect user speech, avoiding the addition of extra hardware to the headset. Other embodiments also leverage the speech detection function to activate and deactivate keyword detectors, and/or sidetone circuits, thus extending battery.

25.

发明授权
Very short pitch detection and coding 有权

公开(公告)号：US10482892B2

公开(公告)日：2019-11-19

申请号：US15662302

申请日：2017-07-28

申请人： HUAWEI TECHNOLOGIES CO.,LTD.

发明人： Yang Gao , Fengyan Qi

IPC分类号： G10L25/00 , G10L21/003 , G10L25/90 , G10L25/21 , G10L25/06 , G10L19/00 , G10L19/09

摘要： System and method embodiments are provided for very short pitch detection and coding for speech or audio signals. The system and method include detecting whether there is a very short pitch lag in a speech or audio signal that is shorter than a conventional minimum pitch limitation using a combination of time domain and frequency domain pitch detection techniques. The pitch detection techniques include using pitch correlations in time domain and detecting a lack of low frequency energy in the speech or audio signal in frequency domain. The detected very short pitch lag is coded using a pitch range from a predetermined minimum very short pitch limitation that is smaller than the conventional minimum pitch limitation.

26.

发明授权
Multi-lingual semantic parser based on transferred learning 有权

公开(公告)号：US10460036B2

公开(公告)日：2019-10-29

申请号：US15959833

申请日：2018-04-23

申请人： VoiceBox Technologies Corporation

发明人： Long Duong , Hadi Afshar , Dominique Estival , Glen Pink , Philip Cohen , Mark Edward Johnson

IPC分类号： G06F17/20 , G06F17/27 , G06F17/28 , G10L21/00 , G10L25/00

摘要： The disclosure relates to transferred learning from a first language (e.g., a source language for which a semantic parser has been defined) to a second language (e.g., a target language for which a semantic parser has not been defined). A system may use knowledge from a trained model in one language to model another language. For example, the system may transfer knowledge of a semantic parser from a first (e.g., source) language to a second (e.g., target) language. Such transfer of knowledge may occur and be useful when the first language has sufficient training data but the second language has insufficient training data. The foregoing transfer of knowledge may extend the semantic parser for multiple languages (e.g., the first language and the second language).

27.

发明授权
De-reverberation control method and device of sound producing equipment 有权

公开(公告)号：US10410651B2

公开(公告)日：2019-09-10

申请号：US15849091

申请日：2017-12-20

申请人： Beijing Xiaoniao Tingting Technology Co., LTD.

发明人： Shasha Lou , Bo Li

IPC分类号： G10L21/00 , G10L25/00 , G10L15/00 , G10L21/0208 , G10L21/02 , H04R1/32 , G10L21/0216 , G10L15/22

摘要： A de-reverberation control method and device of sound producing equipment are disclosed. The method includes that: when a piece of equipment performs audio playing, a voice signal from a user is collected in real time; a relative position of the user with respect to the equipment and acoustic parameters of a room environment in which the equipment is located, are acquired; according to one or more of the relative position and the acoustic parameters, a corresponding microphone in the equipment is selected, and a corresponding voice enhancement mode is called to perform de-reverberation; a voice command word from the user is acquired to control the equipment to perform a corresponding function, as a respond to the user. The present solution can improve the recognition accuracy of a voice command, and improve user interaction experience.

28.

发明授权
Audio processing techniques for semantic audio recognition and report generation 有权

公开(公告)号：US10366685B2

公开(公告)日：2019-07-30

申请号：US15728775

申请日：2017-10-10

申请人： The Nielsen Company (US), LLC

发明人： Alan Neuhauser , John Stavropoulos

IPC分类号： G10L25/00 , G10H1/40 , G06F17/28 , G10L19/018 , G10L15/18

摘要： Example apparatus, articles of manufacture and methods to determine semantic audio information for audio are disclosed. Example methods include extracting a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature. Example methods also include comparing the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith. Example methods further include determining a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio.

29.

发明授权
Voiceprint authentication method and apparatus 有权

公开(公告)号：US10325603B2

公开(公告)日：2019-06-18

申请号：US14757928

申请日：2015-12-23

申请人： BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.

发明人： Chao Li , Yong Guan

IPC分类号： G10L15/00 , G10L21/00 , G10L25/00 , G10L17/00 , G10L15/04 , G10L15/06 , G10L15/26 , G10L13/00 , G10L13/08 , G06F7/04 , G06F15/16 , G06F17/30 , G10L17/24 , G10L17/04 , G10L17/08 , G10L17/14

摘要： The present disclosure provides a voiceprint authentication method and a voiceprint authentication apparatus. The method includes: displaying a tip text to a user, the tip text being a combination of a preregistered phrase; obtaining a speech of the tip text read by the user; obtaining a pre-established registration model and determining a result of a voiceprint authentication according to the speech of the tip text and the pre-established registration model, if the speech of the tip text corresponds to the tip text.

30.

发明授权
Enhanced voice recognition task completion 有权

公开(公告)号：US10325592B2

公开(公告)日：2019-06-18

申请号：US15433754

申请日：2017-02-15

申请人： GM GLOBAL TECHNOLOGY OPERATIONS LLC

发明人： Gaurav Talwar , Xu Fang Zhao , Md Foezur Rahman Chowdhury

IPC分类号： G10L15/00 , G10L15/26 , G10L25/00 , G10L15/22 , B60R11/02 , G01C21/36 , G07C5/00 , G10L15/06 , G10L25/51 , G10L15/32 , G10L15/30

摘要： A method for recognizing speech in a vehicle includes receiving speech at a microphone installed to a vehicle, and determining whether the speech includes a navigation instruction. If the speech includes a navigation instruction, the speech may be sent to a remote facility. After sending the speech to the remote facility, a local speech recognition result is provided in the vehicle to the user. The speech sent to the remote facility may be used to provide corrective action. A system for recognizing speech in a vehicle may include a microphone, and may be configured to determine a local speech recognition result from the speech command and determine when the speech command includes a navigation instruction. The system may further include a remote server in communication with the vehicle that receives a sample of the speech command from the speech recognition system when the speech command includes a navigation instruction.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类