-
公开(公告)号:US11289087B2
公开(公告)日:2022-03-29
申请号:US16752262
申请日:2020-01-24
Applicant: Amazon Technologies, Inc.
Inventor: Joseph White , Lalithkumar Rajendran , Ravi Kiran Rachakonda , Venkata Snehith Cherukuri , Deepak Uttam Shah , Maziyar Khorasani , Vinodth Kumar Mohanam
Abstract: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
公开(公告)号:US11138977B1
公开(公告)日:2021-10-05
申请号:US16895419
申请日:2020-06-08
Applicant: Amazon Technologies, Inc.
Inventor: Venkata Snehith Cherukuri , Joseph White , Vinodth Kumar Mohanam , Rami Habal , Menghan Li
Abstract: This disclosure describes, in part, techniques for determining device groupings, or clusters, for multiple voice-enabled devices. The device clusters may be determined based on metadata data for audio signals (or audio data) generated by each of the multiple voice-enabled devices. For example, a remote system may analyze timestamp data for the audio signals received from the devices, and determine that the devices detected the same voice command of a user based on the timestamp data indicating that the audio signals were received within a threshold period of time from each other. Additionally, the remote system may analyze other metadata of the audio data, such as signal-to-noise (SNR) values, and determine that the SNR values are within a threshold value. The remote system may determine device clusters for the voice-enabled devices of a user based on these, and potentially other, types of metadata of the audio signals.
-
公开(公告)号:US12125483B1
公开(公告)日:2024-10-22
申请号:US17492371
申请日:2021-10-01
Applicant: Amazon Technologies, Inc.
Inventor: Venkata Snehith Cherukuri , Joseph White , Vinodth Kumar Mohanam , Rami Habal , Menghan Li
CPC classification number: G10L15/22 , G10L15/28 , G10L17/00 , G10L25/84 , G10L2015/223
Abstract: This disclosure describes, in part, techniques for determining device groupings, or clusters, for multiple voice-enabled devices. The device clusters may be determined based on metadata data for audio signals (or audio data) generated by each of the multiple voice-enabled devices. For example, a remote system may analyze timestamp data for the audio signals received from the devices, and determine that the devices detected the same voice command of a user based on the timestamp data indicating that the audio signals were received within a threshold period of time from each other. Additionally, the remote system may analyze other metadata of the audio data, such as signal-to-noise (SNR) values, and determine that the SNR values are within a threshold value. The remote system may determine device clusters for the voice-enabled devices of a user based on these, and potentially other, types of metadata of the audio signals.
-
公开(公告)号:US20220215837A1
公开(公告)日:2022-07-07
申请号:US17701387
申请日:2022-03-22
Applicant: Amazon Technologies, Inc.
Inventor: Joseph White , Ravi Kiran Rachakonda , Vinodth Kumar Mohanam , Lalithkumar Rajendran , Deepak Uttam Shah , Maziyar Khorasani , Venkata Snehith Cherukuri
Abstract: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
公开(公告)号:US20240404521A1
公开(公告)日:2024-12-05
申请号:US18732131
申请日:2024-06-03
Applicant: Amazon Technologies, Inc.
Inventor: Joseph White , Ravi Kiran Rachakonda , Vinodth Kumar Mohanam , Lalithkumar Rajendran , Deepak Uttam Shah , Maziyar Khorasani , Venkata Snehith Cherukuri
Abstract: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
公开(公告)号:US10685652B1
公开(公告)日:2020-06-16
申请号:US15928682
申请日:2018-03-22
Applicant: Amazon Technologies, Inc.
Inventor: Venkata Snehith Cherukuri , Joseph White , Vinodth Kumar Mohanam , Rami Habal , Menghan Li
Abstract: This disclosure describes, in part, techniques for determining device groupings, or clusters, for multiple voice-enabled devices. The device clusters may be determined based on metadata data for audio signals (or audio data) generated by each of the multiple voice-enabled devices. For example, a remote system may analyze timestamp data for the audio signals received from the devices, and determine that the devices detected the same voice command of a user based on the timestamp data indicating that the audio signals were received within a threshold period of time from each other. Additionally, the remote system may analyze other metadata of the audio data, such as signal-to-noise (SNR) values, and determine that the SNR values are within a threshold value. The remote system may determine device clusters for the voice-enabled devices of a user based on these, and potentially other, types of metadata of the audio signals.
-
公开(公告)号:US10546583B2
公开(公告)日:2020-01-28
申请号:US15691460
申请日:2017-08-30
Applicant: Amazon Technologies, Inc.
Inventor: Joseph White , Ravi Kiran Rachakonda , Vinodth Kumar Mohanam , Lalithkumar Rajendran , Deepak Uttam Shah , Maziyar Khorasani , Venkata Snehith Cherukuri
Abstract: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
公开(公告)号:US12033632B2
公开(公告)日:2024-07-09
申请号:US17701387
申请日:2022-03-22
Applicant: Amazon Technologies, Inc.
Inventor: Joseph White , Ravi Kiran Rachakonda , Vinodth Kumar Mohanam , Lalithkumar Rajendran , Deepak Uttam Shah , Maziyar Khorasani , Venkata Snehith Cherukuri
CPC classification number: G10L15/22 , G10L15/1815 , G10L15/28 , G10L25/84 , G10L2015/223
Abstract: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
公开(公告)号:US20200211554A1
公开(公告)日:2020-07-02
申请号:US16752262
申请日:2020-01-24
Applicant: Amazon Technologies, Inc.
Inventor: Joseph White , Lalithkumar Rajendran , Ravi Kiran Rachakonda , Venkata Snehith Cherukuri , Deepak Uttam Shah , Maziyar Khorasani , Vinodth Kumar Mohanam
Abstract: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
公开(公告)号:US20190066670A1
公开(公告)日:2019-02-28
申请号:US15691460
申请日:2017-08-30
Applicant: Amazon Technologies, Inc.
Inventor: Joseph White , Ravi Kiran Rachakonda , Vinodth Kumar Mohanam , Lalithkumar Rajendran , Deepak Uttam Shah , Maziyar Khorasani , Venkata Snehith Cherukuri
Abstract: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
-
-
-
-
-
-
-
-