-
公开(公告)号:US20240428797A1
公开(公告)日:2024-12-26
申请号:US18823198
申请日:2024-09-03
Applicant: Amazon Technologies, Inc.
Inventor: Beiye Liu , Wael Hamza , Liwei Cai , Konstantine Arkoudas , Chengwei Su , Subendhu Rongali
Abstract: Techniques for performing spoken language understanding (SLU) processing are described. An SLU component may include an audio encoder configured to perform an audio-to-text processing task and an audio-to-NLU processing task. The SLU component may also include a joint decoder configured to perform the audio-to-text processing task, the audio-to-NLU processing task and a text-to-NLU processing task. Input audio data, representing a spoken input, is processed by the audio encoder and the joint decoder to determine NLU data corresponding to the spoken input.
-
公开(公告)号:US20230368796A1
公开(公告)日:2023-11-16
申请号:US18324440
申请日:2023-05-26
Applicant: Amazon Technologies, Inc.
Inventor: Beiye Liu , Wael Hamza , Liwei Cai , Konstantine Arkoudas , Chengwei Su , Subendhu Rongali
CPC classification number: G10L15/26 , G10L15/1822
Abstract: Techniques for performing spoken language understanding (SLU) processing are described. An SLU component may include an audio encoder configured to perform an audio-to-text processing task and an audio-to-NLU processing task. The SLU component may also include a joint decoder configured to perform the audio-to-text processing task, the audio-to-NLU processing task and a text-to-NLU processing task. Input audio data, representing a spoken input, is processed by the audio encoder and the joint decoder to determine NLU data corresponding to the spoken input.
-
公开(公告)号:US12087305B2
公开(公告)日:2024-09-10
申请号:US18324440
申请日:2023-05-26
Applicant: Amazon Technologies, Inc.
Inventor: Beiye Liu , Wael Hamza , Liwei Cai , Konstantine Arkoudas , Chengwei Su , Subendhu Rongali
CPC classification number: G10L15/26 , G10L15/1822
Abstract: Techniques for performing spoken language understanding (SLU) processing are described. An SLU component may include an audio encoder configured to perform an audio-to-text processing task and an audio-to-NLU processing task. The SLU component may also include a joint decoder configured to perform the audio-to-text processing task, the audio-to-NLU processing task and a text-to-NLU processing task. Input audio data, representing a spoken input, is processed by the audio encoder and the joint decoder to determine NLU data corresponding to the spoken input.
-
公开(公告)号:US11682400B1
公开(公告)日:2023-06-20
申请号:US17106600
申请日:2020-11-30
Applicant: Amazon Technologies, Inc.
Inventor: Beiye Liu , Wael Hamza , Liwei Cai , Konstantine Arkoudas , Chengwei Su , Subendhu Rongali
CPC classification number: G10L15/26 , G10L15/1822
Abstract: Techniques for performing spoken language understanding (SLU) processing are described. An SLU component may include an audio encoder configured to perform an audio-to-text processing task and an audio-to-NLU processing task. The SLU component may also include a joint decoder configured to perform the audio-to-text processing task, the audio-to-NLU processing task and a text-to-NLU processing task. Input audio data, representing a spoken input, is processed by the audio encoder and the joint decoder to determine NLU data corresponding to the spoken input.
-
-
-