Abstract:
A device performs a method for using image data to aid voice recognition. The method includes the device capturing image data of a vicinity of the device and adjusting, based on the image data, a set of parameters for voice recognition performed by the device. The set of parameters for the device performing voice recognition include, but are not limited to: a trigger threshold of a trigger for voice recognition; a set of beamforming parameters; a database for voice recognition; and/or an algorithm for voice recognition, wherein the algorithm can include using noise suppression or using acoustic beamforming.
Abstract:
A method and apparatus for adjusting a trigger parameter related to voice recognition processing includes receiving into the device an acoustic signal comprising a speech signal, which is provided to a voice recognition module, and comprising noise. The method further includes determining a noise profile for the acoustic signal, wherein the noise profile identifies a noise level for the noise and identifies a noise type for the noise based on a frequency spectrum for the noise, and adjusting the voice recognition module based on the noise profile by adjusting a trigger parameter related to voice recognition processing.
Abstract:
A method is performed in an electronic device for orienting an audio beam generated by the device. The method includes: detecting that the device is coupled to a docking station; and determining a geographic location of the device. The method further includes determining, based on the geographic location of the device, an orientation for an audio beam relative from the docking station; and generating the audio beam by the device.
Abstract:
A method and apparatus provide two-dimensional to three-dimensional image conversion. The apparatus can include an input configured to receive a first image. The apparatus can include a controller configured to segment the first image into a plurality of regions, configured to perform a Fast Fourier Transform on at least one of the regions, and configured to determine a relative horizontal displacement distance between a first frame and a second frame of at least one region based on performing the Fast Fourier Transform.