Abstract:
Voice controlled multimedia content creation techniques are discussed in which a multimedia package is created and shared to a specified destination responsive to voice commands. The voice commands can be received by a device as a single stream (e.g., a single phrase) that causes automatic performance of a sharing sequence or as a series of multiple voice commands that are input in response to prompts for voice input as part of the sharing sequence. The voice commands can be recognized and handled by a content creation system of the device to select a clip for tagging of content (such as captured audio or video). The selected clip is then combined with the content to create the multimedia package. Voice commands can also be employed to specify a destination for sharing of the content, such as one or more contacts or a particular sharing site.
Abstract:
Configuring an adaptive microphone array to gather signals from a main lobe of the array, and configuring the array to reduce side interference gathered from sources that are not situated within the main lobe. A memory stores test signals gathered by the array at a plurality of predetermined angular bearings with reference to the array in an anechoic chamber. Signals gathered in real time are processed to provide a preliminary output and preliminary weights. The test signals are retrieved from memory. The preliminary weights are applied to the test signals to provide null steering weights. The null steering weights and the preliminary output are processed to reduce or minimize the amplitude response of the array at the angular orientation.
Abstract:
Systems and methods of providing improved directional noise suppression in an electronic device implement a technique that specifies a direction or speaker of interest, determines the directions corresponding to speakers not lying in the direction of interest, beam forms the reception pattern of the device microphone array to focus in the direction of interest and suppresses signals from the other directions, creating beam formed reception data. A spatial mask is generated as a function of direction relative to the direction of interest. The spatial mask emphasizes audio reception in the direction of interest and attenuates audio reception in the other directions. The beam formed reception data is then multiplied by the spatial mask to generate an audio signal with directional noise suppression.
Abstract:
A method and apparatus increase audio output of a device. A mode of audio output operation of a device can be determined. Audio output can operate in a determined first mode of audio output operation that powers at least one first speaker at a first bandwidth and at a first output level below a first output level threshold. The audio output can operate in a determined second mode of audio output operation. The second mode of audio output operation can power the at least one first speaker at a high pass filtered second bandwidth that filters out low frequencies of the first bandwidth. The second mode of audio output operation can power the at least one first speaker at a second output level below a second output level threshold. The second output level threshold can be higher than the first output level threshold. The second output level can exceed the first output level threshold at least once. The second mode of audio output operation can power at least one second speaker at a low pass filtered second speaker bandwidth that includes at least the low frequencies of the first bandwidth.
Abstract:
A method and apparatus increase audio output of a device. A mode of audio output operation of a device can be determined. Audio output can operate in a determined first mode of audio output operation that powers at least one first speaker at a first bandwidth and at a first output level below a first output level threshold. The audio output can operate in a determined second mode of audio output operation. The second mode of audio output operation can power the at least one first speaker at a high pass filtered second bandwidth that filters out low frequencies of the first bandwidth. The second mode of audio output operation can power the at least one first speaker at a second output level below a second output level threshold. The second output level threshold can be higher than the first output level threshold. The second output level can exceed the first output level threshold at least once. The second mode of audio output operation can power at least one second speaker at a low pass filtered second speaker bandwidth that includes at least the low frequencies of the first bandwidth.
Abstract:
Voice controlled multimedia content creation techniques are discussed in which a multimedia package is created and shared to a specified destination responsive to voice commands. The voice commands can be received by a device as a single stream (e.g., a single phrase) that causes automatic performance of a sharing sequence or as a series of multiple voice commands that are input in response to prompts for voice input as part of the sharing sequence. The voice commands can be recognized and handled by a content creation system of the device to select a clip for tagging of content (such as captured audio or video). The selected clip is then combined with the content to create the multimedia package. Voice commands can also be employed to specify a destination for sharing of the content, such as one or more contacts or a particular sharing site.
Abstract:
Systems and methods for voice recognition determine energy levels for speech and noise and generate adaptive thresholds based on the determined energy levels. The adaptive thresholds are applied to determine the presence of speech and to generate noise-dependent triggers for indicating the presence of speech during high-noise conditions. In an embodiment, the signal energy is averaged in the presence of speech and in the presence of background noise. Audio energy calculations may be made by averaging via a sliding window or via a memory filter.
Abstract:
Systems and methods of providing improved directional noise suppression in an electronic device implement a technique that specifies a direction or speaker of interest, determines the directions corresponding to speakers not lying in the direction of interest, beam forms the reception pattern of the device microphone array to focus in the direction of interest and suppresses signals from the other directions, creating beam formed reception data. A spatial mask is generated as a function of direction relative to the direction of interest. The spatial mask emphasizes audio reception in the direction of interest and attenuates audio reception in the other directions. The beam formed reception data is then multiplied by the spatial mask to generate an audio signal with directional noise suppression.
Abstract:
A method and electronic device that enables quick presentation of user briefs on the electronic device includes: receiving, at the electronic device, an informational brief (IB) request (IBR) input that includes an identifier of a topic and a trigger that causes the electronic device to open an IB content screen that temporarily presents specific information corresponding to the topic. In response to receipt of the IB input, the method further includes: retrieving the IB content screen, with content that includes at least one of the specific information; presenting the IB content screen on the electronic device; monitoring an elapsed presentation time for the IB content screen on the electronic device; comparing the elapsed presentation time against a time limit allocated for presenting the IB content screen; and in response to expiration of the time limit, closing the IB content screen to return the device to its previous operating state.
Abstract:
A method, a system, and a computer program product for preventing initiation of a voice recognition session. The method includes monitoring at least one audio output channel for at least one audio trigger phrase that initiates a voice recognition session. The method further includes in response to detecting the at least one audio trigger phrase on the at least one audio output channel, setting a logic state of at least one output trigger detector of the at least one audio output channel to a first state. The method further includes gating a logic state of at least one input trigger detector of at least one audio input channel to the first state for a time period and preventing initiation of a voice recognition session by the at least one audio trigger phrase on the at least one audio input channel while the logic state is the first state.