-
公开(公告)号:US10827066B2
公开(公告)日:2020-11-03
申请号:US12200905
申请日:2008-08-28
申请人: Alistair E. Jeffs
发明人: Alistair E. Jeffs
IPC分类号: G10L21/00 , G10L25/00 , H04M3/493 , G06F16/70 , G06F16/432
摘要: A method and system for ordering content includes a voice menu system and a phone device communicating a phone signal to the voice menu system. The voice menu system determines the phone number associated with the phone device through the phone signal and generates a voice prompt for recording a content selection from the voice menu system. The phone device selects a recording content option. The voice menu system generates prompts for determining a content title. The phone device selects a content title by communicating a selection signal to the voice menu system. The voice menu system enables a content recording at a recording device in response to the selection signal.
-
公开(公告)号:US10762890B1
公开(公告)日:2020-09-01
申请号:US16544508
申请日:2019-08-19
申请人: Voicify, LLC
发明人: Jeffrey K. McMahon , Robert T. Naughton , Nicholas G. Laidlaw , Alexander M. Dunn , Jason Green
IPC分类号: G10L21/00 , G10L25/00 , G10L21/06 , H04M3/493 , G10L13/08 , G06F3/0481 , G10L13/033 , G10L13/047
摘要: Among other things, a developer of an interaction application for an enterprise can create items of content to be provided to an assistant platform for use in responses to requests of end-users. The developer can deploy the interaction application using defined items of content and an available general interaction model including intents and sample utterances having slots. The developer can deploy the interaction application without requiring the developer to formulate any of the intents, sample utterances, or slots of the general interaction model.
-
公开(公告)号:US10706853B2
公开(公告)日:2020-07-07
申请号:US15763322
申请日:2015-11-25
发明人: Naoya Baba , Yuki Furumoto , Masanobu Osawa , Takumi Takei
IPC分类号: G10L15/00 , G10L15/26 , G10L13/00 , G10L13/08 , G10L21/00 , G10L25/00 , G10L15/32 , G10L15/22 , G10L15/30 , G10L17/00 , G10L17/22
摘要: A correspondence relationship between keywords for instructing the start of a speech dialogue and modes of a response is defined in a response-mode correspondence table. A response-mode selecting unit selects a mode of a response corresponding to a keyword included in the recognition result of a speech recognition unit using the response-mode correspondence table. A dialogue controlling unit starts the speech dialogue when the keyword is included in the recognition result of the speech recognition unit, determines a response in accordance with the subsequent recognition result from the speech recognition unit, and controls a mode of the response in such a manner as to match the mode selected by the response-mode selecting unit. A speech output controlling unit generates speech data on the basis of the response and mode controlled by the dialogue controlling unit and outputs the speech data to a speaker.
-
公开(公告)号:US10564925B2
公开(公告)日:2020-02-18
申请号:US15711793
申请日:2017-09-21
申请人: Avnera Corporation
发明人: Jiajin An , Michael Jon Wurtz , David Wurtz , Manpreet Khaira , Amit Kumar , Shawn O'Connor , Shankar Rathoud , James Scanlan , Eric Sorensen
摘要: Many headsets include automatic noise cancellation (ANC) which dramatically reduces perceived background noise and improves user listening experience. Unfortunately, the voice microphones in these devices often capture ambient noise that the headsets output during phone calls or other communication sessions to other users. In response, many headsets and communication devices provide manual muting circuitry, but users frequently forget to turn the muting on and/or off, creating further problems as they communicate. To address this, the present inventors devised, among other things, an exemplary headset that detects the absence or presence of user speech, automatically muting and unmuting the voice microphone without user intervention. Some embodiments leverage relationships between feedback and feedforward signals in ANC circuitry to detect user speech, avoiding the addition of extra hardware to the headset. Other embodiments also leverage the speech detection function to activate and deactivate keyword detectors, and/or sidetone circuits, thus extending battery.
-
公开(公告)号:US10482892B2
公开(公告)日:2019-11-19
申请号:US15662302
申请日:2017-07-28
发明人: Yang Gao , Fengyan Qi
摘要: System and method embodiments are provided for very short pitch detection and coding for speech or audio signals. The system and method include detecting whether there is a very short pitch lag in a speech or audio signal that is shorter than a conventional minimum pitch limitation using a combination of time domain and frequency domain pitch detection techniques. The pitch detection techniques include using pitch correlations in time domain and detecting a lack of low frequency energy in the speech or audio signal in frequency domain. The detected very short pitch lag is coded using a pitch range from a predetermined minimum very short pitch limitation that is smaller than the conventional minimum pitch limitation.
-
公开(公告)号:US10460036B2
公开(公告)日:2019-10-29
申请号:US15959833
申请日:2018-04-23
发明人: Long Duong , Hadi Afshar , Dominique Estival , Glen Pink , Philip Cohen , Mark Edward Johnson
摘要: The disclosure relates to transferred learning from a first language (e.g., a source language for which a semantic parser has been defined) to a second language (e.g., a target language for which a semantic parser has not been defined). A system may use knowledge from a trained model in one language to model another language. For example, the system may transfer knowledge of a semantic parser from a first (e.g., source) language to a second (e.g., target) language. Such transfer of knowledge may occur and be useful when the first language has sufficient training data but the second language has insufficient training data. The foregoing transfer of knowledge may extend the semantic parser for multiple languages (e.g., the first language and the second language).
-
公开(公告)号:US10410651B2
公开(公告)日:2019-09-10
申请号:US15849091
申请日:2017-12-20
发明人: Shasha Lou , Bo Li
IPC分类号: G10L21/00 , G10L25/00 , G10L15/00 , G10L21/0208 , G10L21/02 , H04R1/32 , G10L21/0216 , G10L15/22
摘要: A de-reverberation control method and device of sound producing equipment are disclosed. The method includes that: when a piece of equipment performs audio playing, a voice signal from a user is collected in real time; a relative position of the user with respect to the equipment and acoustic parameters of a room environment in which the equipment is located, are acquired; according to one or more of the relative position and the acoustic parameters, a corresponding microphone in the equipment is selected, and a corresponding voice enhancement mode is called to perform de-reverberation; a voice command word from the user is acquired to control the equipment to perform a corresponding function, as a respond to the user. The present solution can improve the recognition accuracy of a voice command, and improve user interaction experience.
-
公开(公告)号:US10366685B2
公开(公告)日:2019-07-30
申请号:US15728775
申请日:2017-10-10
发明人: Alan Neuhauser , John Stavropoulos
IPC分类号: G10L25/00 , G10H1/40 , G06F17/28 , G10L19/018 , G10L15/18
摘要: Example apparatus, articles of manufacture and methods to determine semantic audio information for audio are disclosed. Example methods include extracting a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature. Example methods also include comparing the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith. Example methods further include determining a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio.
-
公开(公告)号:US10325603B2
公开(公告)日:2019-06-18
申请号:US14757928
申请日:2015-12-23
IPC分类号: G10L15/00 , G10L21/00 , G10L25/00 , G10L17/00 , G10L15/04 , G10L15/06 , G10L15/26 , G10L13/00 , G10L13/08 , G06F7/04 , G06F15/16 , G06F17/30 , G10L17/24 , G10L17/04 , G10L17/08 , G10L17/14
摘要: The present disclosure provides a voiceprint authentication method and a voiceprint authentication apparatus. The method includes: displaying a tip text to a user, the tip text being a combination of a preregistered phrase; obtaining a speech of the tip text read by the user; obtaining a pre-established registration model and determining a result of a voiceprint authentication according to the speech of the tip text and the pre-established registration model, if the speech of the tip text corresponds to the tip text.
-
公开(公告)号:US10325592B2
公开(公告)日:2019-06-18
申请号:US15433754
申请日:2017-02-15
IPC分类号: G10L15/00 , G10L15/26 , G10L25/00 , G10L15/22 , B60R11/02 , G01C21/36 , G07C5/00 , G10L15/06 , G10L25/51 , G10L15/32 , G10L15/30
摘要: A method for recognizing speech in a vehicle includes receiving speech at a microphone installed to a vehicle, and determining whether the speech includes a navigation instruction. If the speech includes a navigation instruction, the speech may be sent to a remote facility. After sending the speech to the remote facility, a local speech recognition result is provided in the vehicle to the user. The speech sent to the remote facility may be used to provide corrective action. A system for recognizing speech in a vehicle may include a microphone, and may be configured to determine a local speech recognition result from the speech command and determine when the speech command includes a navigation instruction. The system may further include a remote server in communication with the vehicle that receives a sample of the speech command from the speech recognition system when the speech command includes a navigation instruction.
-
-
-
-
-
-
-
-
-