Abstract:
According to one embodiment, audio and non-audio data can be represented as sound sources in a three-dimensional sound space adapted to also provide visual data. Non-audio data can be associated with audio sound sources presented in the sound space. Navigation within this combined three-dimensional audio/visual space can be based primarily on the audio aspects of the sound sources with the details of the non-audio data being presented on demand, for example, when the listener navigates through the combined three-dimensional audio/visual space to a particular sound source at which point the non-audio data associated with that sound source can be presented.
Abstract:
Embodiments are directed to using a three-dimensional sound space to analyze security surveillance information. According to one embodiment, the three-dimensional sound space can comprise part of a security surveillance system in which sound sources related to security surveillance information can be presented and a user can efficiently navigate even a large number of sound sources in the three-dimensional sound space. Effective audio surveillance relies on the ability of the surveillance personnel to efficiently identify calls that need further analysis and calls that need no further analysis without introducing too many false negative or false positive conditions. Utilization of three-dimensional space described herein can increase the ease with which security analysts review audio content and identify relevant audio content that requires further analysis.
Abstract:
Systems, methods, and computer-readable storage media for generating an immersive three-dimensional sound space for searching audio. The system generates a three-dimensional sound space having a plurality of sound sources playing at a same time, wherein each of the plurality of sound sources is assigned a respective location in the three-dimensional sound space relative to one another, and wherein a user is assigned a current location in the three-dimensional sound space relative to each respective location. Next, the system receives input from the user to navigate to a new location in the three-dimensional sound space. Based on the input, the system then changes each respective location of the plurality of sound sources relative to the new location in the three-dimensional sound space.
Abstract:
Systems, methods, and computer-readable storage media for generating personalized tag recommendations using speech analytics. The system first analyzes an audio stream to identify topics in the audio stream. Next, the system identifies tags related to the topics to yield identified tags. Based on the identified tags, the system then generates a tag recommendation for tagging the audio stream. The system can also send the tag recommendation to a device associated with a user for presentation to the user.
Abstract:
An apparatus and methods are disclosed for enabling a telecommunications terminal to notify its user of the arrival of a message via an acoustic or visual signal whose properties are based on attributes of the message. A network infrastructure element (e.g., a switch, a private branch exchange [PBX], a server, etc.) receives a message directed to a terminal and sets the values of ringtone properties (e.g., tempo, volume, pitch, rhythm, etc.) based on attributes of an incoming message (e.g., the sender, a priority, a subject, the location from which the message was sent, etc.). In a first illustrative embodiment the network infrastructure element sends the message and the instantiated ringtone to the terminal, while in a second illustrative embodiment the network infrastructure element sends the message and the property values to the terminal, and the terminal plays a locally-stored ringtone in accordance with the property values.