-
公开(公告)号:US10978081B2
公开(公告)日:2021-04-13
申请号:US16141578
申请日:2018-09-25
Applicant: Amazon Technologies, Inc.
Inventor: Yuan-Yen Tai , Mohamed Mansour , Parind Shah
IPC: G10L19/018 , G10L19/16 , G10L13/08 , G10L15/22 , G10L15/05
Abstract: A system may embed audio watermarks in audio data using a sign sequence. The system may detect audio watermarks in audio data despite the effects of reverberation. For example, the system may embed multiple repetitions of an audio watermark before generating output audio using loudspeaker(s). To detect the audio watermark in audio data generated by a microphone, the system may perform a self-correlation that indicates where the audio watermark is repeated. In some examples, the system may encode the audio watermark using multiple repetitions of a multi-segment Eigenvector. Additionally or alternatively, the system may encode the audio watermark using a binary sequence of positive and negative values, which may be used as a shared key for encoding/decoding the audio watermark. The audio watermark can be embedded in output audio data to enable wakeword suppression (e.g., avoid cross-talk between devices) and/or local signal transmission between devices in proximity to each other.
-
公开(公告)号:US20230162728A1
公开(公告)日:2023-05-25
申请号:US18070830
申请日:2022-11-29
Applicant: Amazon Technologies, Inc.
Inventor: Christin Jose , Yuriy Mishchenko , Anish N. Shah , Alex Escott , Parind Shah , Shiv Naga Prasad Vitaladevuni , Thibaud Senechal
CPC classification number: G10L15/16 , G06F17/15 , G10L15/063 , G10L2015/088
Abstract: A system and method performs wakeword detection using a feedforward neural network model. A first output of the model indicates when the wakeword appears on a right side of a first window of input audio data. A second output of the model indicates when the wakeword appears in the center of a second window of input audio data. A third output of the model indicates when the wakeword appears on a left side of a third window of input audio data. Using these outputs, the system and method determine a beginpoint and endpoint of the wakeword.
-
公开(公告)号:US20210327442A1
公开(公告)日:2021-10-21
申请号:US17201843
申请日:2021-03-15
Applicant: Amazon Technologies, Inc.
Inventor: Yuan-Yen Tai , Mohamed Mansour , Parind Shah
IPC: G10L19/018 , G10L19/16 , G10L13/08 , G10L15/22 , G10L15/05
Abstract: A system may embed audio watermarks in audio data using an Eigenvector matrix. The system may detect audio watermarks in audio data despite the effects of reverberation. For example, the system may embed multiple repetitions of an audio watermark before generating output audio using loudspeaker(s). To detect the audio watermark in audio data generated by a microphone, the system may perform a self-correlation that indicates where the audio watermark is repeated. In some examples, the system may encode the audio watermark using multiple repetitions of a multi-segment Eigenvector. Additionally or alternatively, the system may encode the audio watermark using a binary sequence of positive and negative values, which may be used as a shared key for encoding/decoding the audio watermark. The audio watermark can be embedded in output audio data to enable wakeword suppression (e.g., avoid cross-talk between devices) and/or local signal transmission between devices in proximity to each other.
-
公开(公告)号:US10950249B2
公开(公告)日:2021-03-16
申请号:US16141489
申请日:2018-09-25
Applicant: Amazon Technologies, Inc.
Inventor: Yuan-Yen Tai , Mohamed Mansour , Parind Shah
IPC: G10L19/018 , G10L19/16 , G10L13/08 , G10L15/22 , G10L15/05
Abstract: A system may embed audio watermarks in audio data using an Eigenvector matrix. The system may detect audio watermarks in audio data despite the effects of reverberation. For example, the system may embed multiple repetitions of an audio watermark before generating output audio using loudspeaker(s). To detect the audio watermark in audio data generated by a microphone, the system may perform a self-correlation that indicates where the audio watermark is repeated. In some examples, the system may encode the audio watermark using multiple repetitions of a multi-segment Eigenvector. Additionally or alternatively, the system may encode the audio watermark using a binary sequence of positive and negative values, which may be used as a shared key for encoding/decoding the audio watermark. The audio watermark can be embedded in output audio data to enable wakeword suppression (e.g., avoid cross-talk between devices) and/or local signal transmission between devices in proximity to each other.
-
公开(公告)号:US20200098379A1
公开(公告)日:2020-03-26
申请号:US16141489
申请日:2018-09-25
Applicant: Amazon Technologies, Inc.
Inventor: Yuan-Yen Tai , Mohamed Mansour , Parind Shah
IPC: G10L19/018 , G10L19/16 , G10L15/05 , G10L13/08 , G10L15/22
Abstract: A system may embed audio watermarks in audio data using an Eigenvector matrix. The system may detect audio watermarks in audio data despite the effects of reverberation. For example, the system may embed multiple repetitions of an audio watermark before generating output audio using loudspeaker(s). To detect the audio watermark in audio data generated by a microphone, the system may perform a self-correlation that indicates where the audio watermark is repeated. In some examples, the system may encode the audio watermark using multiple repetitions of a multi-segment Eigenvector. Additionally or alternatively, the system may encode the audio watermark using a binary sequence of positive and negative values, which may be used as a shared key for encoding/decoding the audio watermark. The audio watermark can be embedded in output audio data to enable wakeword suppression (e.g., avoid cross-talk between devices) and/or local signal transmission between devices in proximity to each other.
-
公开(公告)号:US11521599B1
公开(公告)日:2022-12-06
申请号:US16577351
申请日:2019-09-20
Applicant: Amazon Technologies, Inc.
Inventor: Christin Jose , Yuriy Mishchenko , Anish N. Shah , Alex Escott , Parind Shah , Shiv Naga Prasad Vitaladevuni , Thibaud Senechal
Abstract: A system and method performs wakeword detection using a feedforward neural network model. A first output of the model indicates when the wakeword appears on a right side of a first window of input audio data. A second output of the model indicates when the wakeword appears in the center of a second window of input audio data. A third output of the model indicates when the wakeword appears on a left side of a third window of input audio data. Using these outputs, the system and method determine a beginpoint and endpoint of the wakeword.
-
公开(公告)号:US20200098380A1
公开(公告)日:2020-03-26
申请号:US16141578
申请日:2018-09-25
Applicant: Amazon Technologies, Inc.
Inventor: Yuan-Yen Tai , Mohamed Mansour , Parind Shah
IPC: G10L19/018 , G10L19/16 , G10L15/05 , G10L13/08 , G10L15/22
Abstract: A system may embed audio watermarks in audio data using a sign sequence. The system may detect audio watermarks in audio data despite the effects of reverberation. For example, the system may embed multiple repetitions of an audio watermark before generating output audio using loudspeaker(s). To detect the audio watermark in audio data generated by a microphone, the system may perform a self-correlation that indicates where the audio watermark is repeated. In some examples, the system may encode the audio watermark using multiple repetitions of a multi-segment Eigenvector. Additionally or alternatively, the system may encode the audio watermark using a binary sequence of positive and negative values, which may be used as a shared key for encoding/decoding the audio watermark. The audio watermark can be embedded in output audio data to enable wakeword suppression (e.g., avoid cross-talk between devices) and/or local signal transmission between devices in proximity to each other.
-
-
-
-
-
-