-
公开(公告)号:US10685669B1
公开(公告)日:2020-06-16
申请号:US15926507
申请日:2018-03-20
发明人: Gang Lan , Joseph Pedro Tavares , Deepak Uttam Shah , Mckay Clawson , Vijay Shankar Tennety , Ravi Kiran Rachakonda , Venkata Snehith Cherukuri , Charles James Torbert
摘要: This disclosure describes techniques for identifying a voice-enabled device from a group of voice-enabled devices to respond to a speech utterance of a user. A speech-processing system may receive an audio signal representing the speech utterance captured in an environment of a voice-enabled device, and identify another voice-enabled device located in the environment. The system may analyze the audio signal using a different natural-language-understanding model for each of the voice-enabled devices to identify an intent for each of the voice-enabled devices to respond to the speech utterance. The system may determine confidence scores that the intents are responsive to the speech utterance, and select the intent with the highest confidence score. The system may use the selected intent to generate a command for the corresponding voice-enabled device to respond to the user.
-
公开(公告)号:US11289087B2
公开(公告)日:2022-03-29
申请号:US16752262
申请日:2020-01-24
发明人: Joseph White , Lalithkumar Rajendran , Ravi Kiran Rachakonda , Venkata Snehith Cherukuri , Deepak Uttam Shah , Maziyar Khorasani , Vinodth Kumar Mohanam
摘要: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
公开(公告)号:US11138977B1
公开(公告)日:2021-10-05
申请号:US16895419
申请日:2020-06-08
摘要: This disclosure describes, in part, techniques for determining device groupings, or clusters, for multiple voice-enabled devices. The device clusters may be determined based on metadata data for audio signals (or audio data) generated by each of the multiple voice-enabled devices. For example, a remote system may analyze timestamp data for the audio signals received from the devices, and determine that the devices detected the same voice command of a user based on the timestamp data indicating that the audio signals were received within a threshold period of time from each other. Additionally, the remote system may analyze other metadata of the audio data, such as signal-to-noise (SNR) values, and determine that the SNR values are within a threshold value. The remote system may determine device clusters for the voice-enabled devices of a user based on these, and potentially other, types of metadata of the audio signals.
-
公开(公告)号:US12125483B1
公开(公告)日:2024-10-22
申请号:US17492371
申请日:2021-10-01
CPC分类号: G10L15/22 , G10L15/28 , G10L17/00 , G10L25/84 , G10L2015/223
摘要: This disclosure describes, in part, techniques for determining device groupings, or clusters, for multiple voice-enabled devices. The device clusters may be determined based on metadata data for audio signals (or audio data) generated by each of the multiple voice-enabled devices. For example, a remote system may analyze timestamp data for the audio signals received from the devices, and determine that the devices detected the same voice command of a user based on the timestamp data indicating that the audio signals were received within a threshold period of time from each other. Additionally, the remote system may analyze other metadata of the audio data, such as signal-to-noise (SNR) values, and determine that the SNR values are within a threshold value. The remote system may determine device clusters for the voice-enabled devices of a user based on these, and potentially other, types of metadata of the audio signals.
-
公开(公告)号:US20220215837A1
公开(公告)日:2022-07-07
申请号:US17701387
申请日:2022-03-22
发明人: Joseph White , Ravi Kiran Rachakonda , Vinodth Kumar Mohanam , Lalithkumar Rajendran , Deepak Uttam Shah , Maziyar Khorasani , Venkata Snehith Cherukuri
摘要: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
公开(公告)号:US11600291B1
公开(公告)日:2023-03-07
申请号:US16900593
申请日:2020-06-12
发明人: Gang Lan , Joseph Pedro Tavares , Deepak Uttam Shah , Mckay Clawson , Vijay Shankar Tennety , Ravi Kiran Rachakonda , Venkata Snehith Cherukuri , Charles James Torbert
摘要: This disclosure describes techniques for identifying a voice-enabled device from a group of voice-enabled devices to respond to a speech utterance of a user. A speech-processing system may receive an audio signal representing the speech utterance captured in an environment of a voice-enabled device, and identify another voice-enabled device located in the environment. The system may analyze the audio signal using a different natural-language-understanding model for each of the voice-enabled devices to identify an intent for each of the voice-enabled devices to respond to the speech utterance. The system may determine confidence scores that the intents are responsive to the speech utterance, and select the intent with the highest confidence score. The system may use the selected intent to generate a command for the corresponding voice-enabled device to respond to the user.
-
公开(公告)号:US10685652B1
公开(公告)日:2020-06-16
申请号:US15928682
申请日:2018-03-22
摘要: This disclosure describes, in part, techniques for determining device groupings, or clusters, for multiple voice-enabled devices. The device clusters may be determined based on metadata data for audio signals (or audio data) generated by each of the multiple voice-enabled devices. For example, a remote system may analyze timestamp data for the audio signals received from the devices, and determine that the devices detected the same voice command of a user based on the timestamp data indicating that the audio signals were received within a threshold period of time from each other. Additionally, the remote system may analyze other metadata of the audio data, such as signal-to-noise (SNR) values, and determine that the SNR values are within a threshold value. The remote system may determine device clusters for the voice-enabled devices of a user based on these, and potentially other, types of metadata of the audio signals.
-
公开(公告)号:US10546583B2
公开(公告)日:2020-01-28
申请号:US15691460
申请日:2017-08-30
发明人: Joseph White , Ravi Kiran Rachakonda , Vinodth Kumar Mohanam , Lalithkumar Rajendran , Deepak Uttam Shah , Maziyar Khorasani , Venkata Snehith Cherukuri
摘要: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
公开(公告)号:US12033632B2
公开(公告)日:2024-07-09
申请号:US17701387
申请日:2022-03-22
发明人: Joseph White , Ravi Kiran Rachakonda , Vinodth Kumar Mohanam , Lalithkumar Rajendran , Deepak Uttam Shah , Maziyar Khorasani , Venkata Snehith Cherukuri
CPC分类号: G10L15/22 , G10L15/1815 , G10L15/28 , G10L25/84 , G10L2015/223
摘要: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
公开(公告)号:US20200211554A1
公开(公告)日:2020-07-02
申请号:US16752262
申请日:2020-01-24
发明人: Joseph White , Lalithkumar Rajendran , Ravi Kiran Rachakonda , Venkata Snehith Cherukuri , Deepak Uttam Shah , Maziyar Khorasani , Vinodth Kumar Mohanam
摘要: This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
-
-
-
-
-
-
-
-
-