-
公开(公告)号:US11501102B2
公开(公告)日:2022-11-15
申请号:US16691205
申请日:2019-11-21
Applicant: Adobe Inc.
Inventor: Justin Salamon , Yu Wang , Nicholas J. Bryan
Abstract: Certain embodiments involve techniques for automatically identifying sounds in an audio recording that match a selected sound. An audio search and editing system receives the audio recording and preprocesses the audio recording into audio portions. The audio portions are provided as a query to the neural network that includes a trained embedding model used to analyze the audio portions in view of the selected sound to estimate feature vectors. The audio search and editing system compares the feature vectors for the audio portions against the feature vector for the selected sound and the feature vector for the negative samples to generate an audio score that is a numerical representation of the level of similarity between the audio portion and the selected sound and uses the audio scores to classify the audio portions into a first class of matching sounds and a second class of non-matching sounds.
-
公开(公告)号:US20210158086A1
公开(公告)日:2021-05-27
申请号:US16691205
申请日:2019-11-21
Applicant: Adobe Inc.
Inventor: Justin Salamon , Yu Wang , Nicholas J. Bryan
Abstract: Certain embodiments involve techniques for automatically identifying sounds in an audio recording that match a selected sound. An audio search and editing system receives the audio recording and preprocesses the audio recording into audio portions. The audio portions are provided as a query to the neural network that includes a trained embedding model used to analyze the audio portions in view of the selected sound to estimate feature vectors. The audio search and editing system compares the feature vectors for the audio portions against the feature vector for the selected sound and the feature vector for the negative samples to generate an audio score that is a numerical representation of the level of similarity between the audio portion and the selected sound and uses the audio scores to classify the audio portions into a first class of matching sounds and a second class of non-matching sounds.
-