-
公开(公告)号:US20200168239A1
公开(公告)日:2020-05-28
申请号:US16775228
申请日:2020-01-28
IPC分类号: G10L21/06
摘要: A speech-based system includes a local device in a user premises and a network-based control service that directs the local device to perform actions for a user. The control service may specify a first action that is to be performed upon detection by the local device of a stimulus. In some cases, performing the first action may rely on the availability of network communications with the control service or with another service. In these cases, the control service also specifies a second, fallback action that does not rely upon network communications. Upon detecting the stimulus, the local device performs the first action if network communications are available. If network communications are not available, the local device performs the second, fallback action.
-
公开(公告)号:US10438582B1
公开(公告)日:2019-10-08
申请号:US14573943
申请日:2014-12-17
发明人: Peter Spalding VanLund , Nicolas Anton Medhurst Hertl , Peter Paul Henri Carbon , Frederic Johan Georges Deramat
摘要: A voice-controlled device may receive a voice command uttered by a user, where the voice command may request that the voice-controlled device perform an operation. The voice-controlled device and/or one or more remote computing resources may process an audio signal associated with the voice command to determine text corresponding to the voice command. The resulting user utterance may be associated with a unique identifier, which may be provided to a third party and/or third party application that is to provide information responsive to the user request. The information provided by the third party/third party application may be output to the user based at least partly on the unique identifier, without disclosing user data associated with the user.
-
公开(公告)号:US10283119B2
公开(公告)日:2019-05-07
申请号:US15966400
申请日:2018-04-30
发明人: Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat
摘要: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
-
公开(公告)号:US20180315425A1
公开(公告)日:2018-11-01
申请号:US15966400
申请日:2018-04-30
发明人: Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat
CPC分类号: G10L15/22 , G06F17/277 , G06F17/278 , G06F17/279 , G06F17/28 , G06F17/2881 , G10L13/08 , G10L15/26
摘要: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
-
公开(公告)号:US09959869B2
公开(公告)日:2018-05-01
申请号:US15694996
申请日:2017-09-04
发明人: Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat
CPC分类号: G10L15/22 , G06F17/277 , G06F17/278 , G06F17/279 , G06F17/28 , G06F17/2881 , G10L13/08 , G10L15/26
摘要: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
-
公开(公告)号:US09712625B2
公开(公告)日:2017-07-18
申请号:US13858753
申请日:2013-04-08
摘要: Techniques for creating a persistent connection between client devices and one or more remote computing resources, which may form a portion of a network-accessible computing platform. This connection may be considered “permanent” or “nearly permanent” to allow the client device to both send data to and receive data from the remote resources at nearly any time. In addition, both the client device and the remote resources may establish virtual channels over this single connection. If no data is exchanged between the client device and the remote computing resources for a threshold amount of time, then the connection may be severed and the client device may attempt to establish a new connection with the remote computing resources.
-
公开(公告)号:US11468889B1
公开(公告)日:2022-10-11
申请号:US16806516
申请日:2020-03-02
发明人: Gregory Michael Hart , Peter Paul Henri Carbon , John Daniel Thimsen , Vikram Kumar Gundeti , Scott Ian Blanksteen , Allan Timothy Lindsay , Frederic Johan Georges Deramat
摘要: A speech recognition platform configured to receive an audio signal that includes speech from a user and perform automatic speech recognition (ASR) on the audio signal to identify ASR results. The platform may identify: (i) a domain of a voice command within the speech based on the ASR results and based on context information associated with the speech or the user, and (ii) an intent of the voice command. In response to identifying the intent, the platform may perform a corresponding action, such as streaming audio to the device, setting a reminder for the user, purchasing an item on behalf of the user, making a reservation for the user or launching an application for the user. The speech recognition platform, in combination with the device, may therefore facilitate efficient interactions between the user and a voice-controlled device.
-
公开(公告)号:US11333378B1
公开(公告)日:2022-05-17
申请号:US15708002
申请日:2017-09-18
发明人: David A. Limp , Melissa J. Cha , Matthew Liang Chaboud , Rohan Mutagi , Frederic Johan Georges Deramat , Lindo St. Angel
摘要: Described are systems, methods, and apparatus that enable power management and reduction at both the individual and group level to help reduce the overall power demand on a power system. One or more sensors may be positioned at different locations that collect and provide various sensor data to a remote computing system, referred to herein as a management system. The management system maintains location profiles for each location, user profiles for users at the various locations, and may also receive third party data, such as weather patterns, power system load, etc. The management system utilizes the received data to determine one or more energy saving actions that may be performed at the location(s) to reduce energy consumption and lower the demand on the power system.
-
公开(公告)号:US20220148590A1
公开(公告)日:2022-05-12
申请号:US17454716
申请日:2021-11-12
发明人: Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat
IPC分类号: G10L15/22 , G10L15/26 , G06F40/35 , G06F40/40 , G06F40/56 , G06F40/284 , G06F40/295 , G10L13/08
摘要: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
-
公开(公告)号:US10853031B2
公开(公告)日:2020-12-01
申请号:US16222751
申请日:2018-12-17
发明人: Gautham Kumar Jayakumar , Nishant Kumar , Steven Michael Saxon , Frederic Johan Georges Deramat
摘要: Systems and methods for audio output control are disclosed. Audio may be output via a speaker of a communal device associated with a first portion of an environment. A user may provide a user utterance indicating an intent to add another device in a second portion of the environment to the audio-output session, and/or an intent to move the audio-output session from the first device to the second device, and/or an intent to remove a device from an audio-output session. Based on this determined intent, audio-session queues may be associated and dissociated from devices and device states may be altered to effectuate the intent of the user utterance.
-
-
-
-
-
-
-
-
-