Abstract:
Systems and methods for detecting motion in compressed video are provided. Some methods can include parsing a stream of compressed video, obtaining macroblock size information from the parsed stream, computing factors derived from the macroblock size information, computing adaptive threshold values derived from relative frame characteristics of the compressed video, comparing the factors derived from the macroblock size information with the adaptive threshold values, and detecting motion based upon the comparing when at least one of the factors exceeds at least one of the adaptive threshold values. In some embodiments, detecting the motion can include performing spatio-temporal filtering on macroblocks in which the motion is detected or performing spatio-temporal filtering on at least one non-motion macroblock.
Abstract:
A system includes a speech recognition processor, a depth sensor coupled to the speech recognition processor, and an array of microphones coupled to the speech recognition processor. The depth sensor is operable to calculate a distance and a direction from the array of microphones to a source of audio data. The speech recognition processor is operable to select an acoustic model as a function of the distance and the direction from the array of microphones to the source of audio data. The speech recognition processor is operable to apply the distance measure in the microphone array beam formation so as to boost portions of the signals originating from the source of audio data and to suppress portions of the signals resulting from noise.
Abstract:
A method and apparatus wherein the method includes the steps of parsing a stream of compressed video, obtaining macroblock size information from the parsed stream, computing factors derived from the macroblock size, wherein the factors include a normalized bit size, a bit size ratio and a neighbor score, computing corresponding adaptive threshold values derived from the relative frame characteristics of the compressed video, comparing the factors derived from the macroblock size information with the corresponding adaptive threshold values and detecting motion based upon combinations of the comparisons when the factors exceed the threshold value.
Abstract:
Systems and methods for detecting motion in compressed video are provided. Some methods can include parsing a stream of compressed video, obtaining macroblock size information from the parsed stream, computing factors derived from the macroblock size information, computing adaptive threshold values derived from relative frame characteristics of the compressed video, comparing the factors derived from the macroblock size information with the adaptive threshold values, and detecting motion based upon the comparing when at least one of the factors exceeds at least one of the adaptive threshold values. In some embodiments, detecting the motion can include performing spatio-temporal filtering on macroblocks in which the motion is detected or performing spatio-temporal filtering on at least one non-motion macroblock.
Abstract:
A method and apparatus wherein the method includes the steps of parsing a stream of compressed video, obtaining macroblock size information from the parsed stream, computing factors derived from the macroblock size, wherein the factors include a normalized bit size, a bit size ratio and a neighbor score, computing corresponding adaptive threshold values derived from the relative frame characteristics of the compressed video, comparing the factors derived from the macroblock size information with the corresponding adaptive threshold values and detecting motion based upon combinations of the comparisons when the factors exceed the threshold value.
Abstract:
A system includes a speech recognition processor, a depth sensor coupled to the speech recognition processor, and an array of microphones coupled to the speech recognition processor. The depth sensor is operable to calculate a distance and a direction from the array of microphones to a source of audio data. The speech recognition processor is operable to select an acoustic model as a function of the distance and the direction from the array of microphones to the source of audio data. The speech recognition processor is operable to apply the distance measure in the microphone array beam formation so as to boost portions of the signals originating from the source of audio data and to suppress portions of the signals resulting from noise.
Abstract:
A method and apparatus wherein the method includes the steps of parsing a stream of compressed video, obtaining macroblock size information from the parsed stream, computing factors derived from the macroblock size, wherein the factors include a normalized bit size, a bit size ratio and a neighbor score, computing corresponding adaptive threshold values derived from the relative frame characteristics of the compressed video, comparing the factors derived from the macroblock size information with the corresponding adaptive threshold values and detecting motion based upon combinations of the comparisons when the factors exceed the threshold value.