-
公开(公告)号:US10460722B1
公开(公告)日:2019-10-29
申请号:US15639175
申请日:2017-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Ming Sun , David Snyder , Yixin Gao , Nikko Strom , Spyros Matsoukas , Shiv Naga Prasad Vitaladevuni
Abstract: A method for selective transmission of audio data to a speech processing server uses detection of an acoustic trigger in the audio data in determining the data to transmit. Detection of the acoustic trigger makes use of an efficient computation approach that reduces the amount of run-time computation required, or equivalently improves accuracy for a given amount of computation, by combining a “time delay” structure in which intermediate results of computations are reused at various time delays, thereby avoiding computation of computing new results, and decomposition of certain transformations to require fewer arithmetic operations without sacrificing significant performance. For a given amount of computation capacity the combination of these two techniques provides improved accuracy as compared to current approaches.
-
公开(公告)号:US09600764B1
公开(公告)日:2017-03-21
申请号:US14307412
申请日:2014-06-17
Applicant: Amazon Technologies, Inc.
Inventor: Ariya Rastrow , Spyros Matsoukas , Sri Venkata Surya Siva Rama Krishna Garimella , Nikko Ström , Bjorn Hoffmeister
CPC classification number: G06N3/08 , G06N3/0445 , G06N3/049
Abstract: Features are disclosed for using a neural network to tag sequential input without using an internal representation of the neural network generated when scoring previous positions in the sequence. A predicted or determined label (e.g., the highest scoring or otherwise most probable label) for input at a given position in the sequence can be used when scoring input corresponding to the next position the sequence. Additional features are disclosed for training a neural network for use in tagging sequential input without using an internal representation of the neural network generated when scoring previous positions the sequence.
-