摘要:
Techniques for generating an auditory memory for an auditory event are described. An example technique includes obtaining a first audio content associated with an event in an environment. At least one attribute of the environment is determined, based on evaluating the first audio content. At least one emotional attribute associated with the event in the environment is determined, based on evaluating the first audio content. A second audio content is determined, based at least in part on the at least one attribute of the environment. A third audio content is determined, based at least in part on the at least one emotional attribute. An auditory memory including fourth audio content associated with the event in the environment is generated, based on the first audio content, the second audio content, and the third audio content.
摘要:
Audio recognition methods and systems are disclosed. An audio recognition method comprises: performing diffusion processing on a plurality of first feature points in a spectrogram of a to-be-recognized audio file to obtain a feature point map (S110); searching in a spectrogram of a target audio file to determine whether second feature points that respectively correspond to the diffused first feature points in the feature point map (S120) exist; and if yes, determining that the spectrogram of the to-be-recognized audio file is a part of the target audio file (S130). The method can improve the matching success rate of feature points in audio recognition.
摘要:
Systems and methods for the matching of datasets, such as input audio segments, with known datasets in a database are disclosed. In an illustrative embodiment, the use of the presently disclosed systems and methods is described in conjunction with recognizing known network message recordings encountered during an outbound telephone call. The methodologies include creation of a ternary fingerprint bitmap to make the comparison process more efficient. Also disclosed are automated methodologies for creating the database of known datasets from a larger collection of datasets.