-
公开(公告)号:US09728188B1
公开(公告)日:2017-08-08
申请号:US15195587
申请日:2016-06-28
Applicant: Amazon Technologies, Inc.
Inventor: Alexander David Rosen , Michael James Rodehorst , George Jay Tucker , Aaron Lee Mathers Challenner
CPC classification number: G10L15/22 , G10L19/08 , G10L25/18 , G10L25/51 , G10L2015/223
Abstract: Systems and methods for detecting similar audio being received by separate voice activated electronic devices, and ignoring those commands, is described herein. In some embodiments, a voice activated electronic device may be activated by a wakeword that is output by the additional electronic device, such as a television or radio, may capture audio of sound subsequently following the wakeword, and may send audio data representing the sound to a backend system. Upon receipt, the backend system may, in parallel to performing automated speech recognition processing to the audio data, generate a sound profile of the audio data, and may compare that sound profile to sound profiles of recently received audio data and/or flagged sound profiles. If the generated sound profile is determined to match another sound profiles, then the automated speech recognition processing may be stopped, and the voice activated electronic device may be instructed to return to a keyword spotting mode. If the matching sound profile is not already stored in a database of known sound profiles, it can be stored for future comparisons.
-
公开(公告)号:US10074364B1
公开(公告)日:2018-09-11
申请号:US15085772
申请日:2016-03-30
Applicant: Amazon Technologies, Inc.
Inventor: Colin Wills Wightman , Naresh Narayanan , Alexander David Rosen , Michael James Rodehorst , Daniel Robert Rashid
CPC classification number: G10L15/20 , G06F17/2775 , G10L15/10 , G10L15/26 , G10L15/265 , G10L17/04 , G10L25/51 , G10L2015/223
Abstract: Systems and methods for generating sound profiles of artificial commands detected by multiple voice activated electronic devices is described herein. In some embodiments, numerous voice activated electronic devices may send audio data representing a phrase to a backend system at a substantially same time. Text data representing the phrase, and counts for instances of that text data, may be generated. If the number of counts exceeds a predefined threshold, the backend system may cause any remaining response generation functionality that particular command that is in excess of the predefined threshold to be stopped, and those devices returned to a sleep state. In some embodiments, a sound profile unique to the phrase that caused the excess of the predefined threshold may be generated such that future instances of the same phrase may be recognized prior to text data being generated, conserving the backend system's resources.
-