-
公开(公告)号:US12266367B2
公开(公告)日:2025-04-01
申请号:US17234111
申请日:2021-04-19
Applicant: Amazon Technologies, Inc.
Inventor: Stanislaw Ignacy Pasko , Michal Papierski , Maciej Makowski , Marcin Fuszara
Abstract: A speech interface device is configured with “hybrid” capabilities, which allows the speech interface device to perform actions in response to user speech, even when the speech interface device is unable to communicate with a remote system over a wide area network (e.g., the Internet). A hybrid request selector of the speech interface device sends audio data representing user speech to both a remote speech processing system and a local speech processing component executing on the speech interface device, and then waits for a response from either or both components. The local speech processing component may start execution based on the audio data and subsequently suspend the execution until further instruction from the hybrid request selector. The hybrid request selector can then determine which response to use, and, depending on which response is chosen, may instruct the local speech processing component to either continue or terminate the suspended execution.
-
公开(公告)号:US20210241775A1
公开(公告)日:2021-08-05
申请号:US17234111
申请日:2021-04-19
Applicant: Amazon Technologies, Inc.
Inventor: Stanislaw Ignacy Pasko , Michal Papierski , Maciej Makowski , Marcin Fuszara
Abstract: A speech interface device is configured with “hybrid” capabilities, which allows the speech interface device to perform actions in response to user speech, even when the speech interface device is unable to communicate with a remote system over a wide area network (e.g., the Internet). A hybrid request selector of the speech interface device sends audio data representing user speech to both a remote speech processing system and a local speech processing component executing on the speech interface device, and then waits for a response from either or both components. The local speech processing component may start execution based on the audio data and subsequently suspend the execution until further instruction from the hybrid request selector. The hybrid request selector can then determine which response to use, and, depending on which response is chosen, may instruct the local speech processing component to either continue or terminate the suspended execution.
-
公开(公告)号:US20190295552A1
公开(公告)日:2019-09-26
申请号:US15934726
申请日:2018-03-23
Applicant: Amazon Technologies, Inc.
Inventor: Stanislaw Ignacy Pasko , Michal Papierski , Maciej Makowski , Marcin Fuszara
Abstract: A speech interface device is configured with “hybrid” capabilities, which allows the speech interface device to perform actions in response to user speech, even when the speech interface device is unable to communicate with a remote system over a wide area network (e.g., the Internet). A hybrid request selector of the speech interface device sends audio data representing user speech to both a remote speech processing system and a local speech processing component executing on the speech interface device, and then waits for a response from either or both components. The local speech processing component may start execution based on the audio data and subsequently suspend the execution until further instruction from the hybrid request selector. The hybrid request selector can then determine which response to use, and, depending on which response is chosen, may instruct the local speech processing component to either continue or terminate the suspended execution.
-
公开(公告)号:US11763819B1
公开(公告)日:2023-09-19
申请号:US17350451
申请日:2021-06-17
Applicant: Amazon Technologies, Inc.
Inventor: Benjamin Charles Eagan , Maciej Makowski , Zack Shahaf Matorin
IPC: G10L19/018 , G10L15/26 , H04L9/40 , G10L15/22
CPC classification number: G10L15/26 , G10L15/22 , G10L19/018 , H04L63/0428 , G10L2015/223
Abstract: A speech interface device is configured to defer encryption of audio data on-device until a time when the encryption operation is not competing with other computationally-intensive operations for responding to the audio data. For example, audio data based on sound captured in an environment of the speech interface device can be stored in volatile memory of the speech interface device, without encrypting it, until a set of processing operations (e.g., ASR processing, NLU processing, audio event processing, etc.) performed based on the audio data have stopped. Based on a determination that these processing operations for responding to the audio data have stopped, the logic may encrypt the audio data to generate encrypted data, and the encrypted data can be stored in non-volatile memory of the speech interface device for uploading to a remote system when a connection is available.
-
公开(公告)号:US11295743B1
公开(公告)日:2022-04-05
申请号:US16883379
申请日:2020-05-26
Applicant: Amazon Technologies, Inc.
Inventor: Fabian Andreas Bumberger , Sabria Farheen , Maciej Makowski , Eli Joshua Fidler , Sasitheran Shanmugarajah
Abstract: This disclosure proposes systems and methods enabling on-device/hybrid processing of speech requests using a hub device. The hub device is capable of receiving audio data from surrounding devices and performing speech processing on the audio data to improve latency and/or provide functionality to other devices within a private network. The hub device may receive multiple requests corresponding to different utterances. If the hub device receives a second utterance while processing a first utterance, the hub device may send an error notification, process the first utterance and the second utterance sequentially, suspend processing of the first utterance to process the second utterance first, send the second utterance to another hub device or remote system, or suspend processing of the first utterance and send the first utterance to the remote system in order to process the second utterance.
-
公开(公告)号:US12080291B2
公开(公告)日:2024-09-03
申请号:US17708077
申请日:2022-03-30
Applicant: Amazon Technologies, Inc.
Inventor: Fabian Andreas Bumberger , Sabria Farheen , Maciej Makowski , Eli Joshua Fidler , Sasitheran Shanmugarajah
CPC classification number: G10L15/22 , G10L15/02 , G10L2015/223 , G10L15/30
Abstract: This disclosure proposes systems and methods enabling on-device/hybrid processing of speech requests using a hub device. The hub device is capable of receiving audio data from surrounding devices and performing speech processing on the audio data to improve latency and/or provide functionality to other devices within a private network. The hub device may receive multiple requests corresponding to different utterances. If the hub device receives a second utterance while processing a first utterance, the hub device may send an error notification, process the first utterance and the second utterance sequentially, suspend processing of the first utterance to process the second utterance first, send the second utterance to another hub device or remote system, or suspend processing of the first utterance and send the first utterance to the remote system in order to process the second utterance.
-
公开(公告)号:US20220358921A1
公开(公告)日:2022-11-10
申请号:US17708077
申请日:2022-03-30
Applicant: Amazon Technologies, Inc.
Inventor: Fabian Andreas Bumberger , Sabria Farheen , Maciej Makowski , Eli Joshua Fidler , Sasitheran Shanmugarajah
Abstract: This disclosure proposes systems and methods enabling on-device/hybrid processing of speech requests using a hub device. The hub device is capable of receiving audio data from surrounding devices and performing speech processing on the audio data to improve latency and/or provide functionality to other devices within a private network. The hub device may receive multiple requests corresponding to different utterances. If the hub device receives a second utterance while processing a first utterance, the hub device may send an error notification, process the first utterance and the second utterance sequentially, suspend processing of the first utterance to process the second utterance first, send the second utterance to another hub device or remote system, or suspend processing of the first utterance and send the first utterance to the remote system in order to process the second utterance.
-
公开(公告)号:US10984799B2
公开(公告)日:2021-04-20
申请号:US15934726
申请日:2018-03-23
Applicant: Amazon Technologies, Inc.
Inventor: Stanislaw Ignacy Pasko , Michal Papierski , Maciej Makowski , Marcin Fuszara
Abstract: A speech interface device is configured with “hybrid” capabilities, which allows the speech interface device to perform actions in response to user speech, even when the speech interface device is unable to communicate with a remote system over a wide area network (e.g., the Internet). A hybrid request selector of the speech interface device sends audio data representing user speech to both a remote speech processing system and a local speech processing component executing on the speech interface device, and then waits for a response from either or both components. The local speech processing component may start execution based on the audio data and subsequently suspend the execution until further instruction from the hybrid request selector. The hybrid request selector can then determine which response to use, and, depending on which response is chosen, may instruct the local speech processing component to either continue or terminate the suspended execution.
-
公开(公告)号:US12170086B1
公开(公告)日:2024-12-17
申请号:US17525050
申请日:2021-11-12
Applicant: Amazon Technologies, Inc.
Inventor: Ashwin Venkatesh Raman , Bruno Dufour , Sasi Kiran Vepanjeri Lokanadha Reddy , Michal Kowalczuk , Maciej Grabon , Maciej Makowski , Fabian Andreas Bumberger
IPC: G10L15/22 , G06F3/16 , G06F8/65 , G06F9/445 , G06F9/54 , G10L15/00 , G10L15/19 , H04L67/14 , H04L67/56
Abstract: A speech interface device is configured to switch between languages, at the request of a user, in order to locally process utterances spoken in different languages, even in instances when a remote system is unavailable to, slower than, or otherwise less preferred than the speech interface device. For example, a user can request to set the language setting of the speech interface device to a second language, different from a first language to which the language setting of the device is currently set. Based on this user request, a local speech processing component of the device may load a language model(s) associated with the second language. The speech interface can also output voice prompts in the second language to manage the user's experience while a language update is in progress on the speech interface device.
-
公开(公告)号:US11176934B1
公开(公告)日:2021-11-16
申请号:US16362408
申请日:2019-03-22
Applicant: Amazon Technologies, Inc.
Inventor: Ashwin Venkatesh Raman , Bruno Dufour , Sasi Kiran Vepanjeri Lokanadha Reddy , Michal Kowalczuk , Maciej Grabon , Maciej Makowski , Fabian Andreas Bumberger
Abstract: A speech interface device is configured to switch between languages, at the request of a user, in order to locally process utterances spoken in different languages, even in instances when a remote system is unavailable to, slower than, or otherwise less preferred than the speech interface device. For example, a user can request to set the language setting of the speech interface device to a second language, different from a first language to which the language setting of the device is currently set. Based on this user request, a local speech processing component of the device may load a language model(s) associated with the second language. The speech interface can also output voice prompts in the second language to manage the user's experience while a language update is in progress on the speech interface device.
-
-
-
-
-
-
-
-
-