Abstract:
A compact signal coding method and apparatus including obtaining cumulative probabilities for successive quantized samples of a signal, determine the direct probability for the leading one of the successive quantized samples, and encode the quantized sample based at least in part on the determined direct probability.
Abstract:
An audio-on-demand communication system provides real-time playback of audio data transferred via telephone lines or other communication links. One or more audio servers include memory banks which store compressed audio data. At the request of a user at a subscriber PC, an audio server transmits the compressed audio data over the communication link to the subscriber PC. The subscriber PC receives and decompresses the transmitted audio data in less than real-time using only the processing power of the CPU within the subscriber PC. According to one aspect of the present invention, high quality audio data compressed according to lossless compression techniques is transmitted together with normal quality audio data. According to another aspect of the present invention, metadata, or extra data, such as text, captions, still images, etc., is transmitted with audio data and is simultaneously displayed with corresponding audio data. The audio-on-demand system also provides a table of contents indicating significant divisions in the audio clip to be played and allows the user immediate access to audio data at the listed divisions. According to a further aspect of the present invention, servers and subscriber PCs are dynamically allocated based upon geographic location to provide the highest possible quality in the communication link.
Abstract:
A method is provided for performing a fast Discrete Cosine Transform (DCT) and a fast Inverse Discrete Cosine Transform (IDCT) in a software implementation. The method provided exploits symmetries found in both the DCT and IDCT. As a result of the symmetries found in the DCT and IDCT, both transforms may be performed using a combination of look-up tables and butterfly operations, thus employing only a small number of additions and subtractions and no multiplications. Furthermore, there is provided an aspect of the present invention which exploits the excess precision available in current central processing units (CPUs) relative to the precision required by the DCT and IDCT calculations.
Abstract:
Techniques are described herein for receiving and analyzing messages originating from one sender for distribution to a recipient. A plurality of messages are received from a sender, each of the plurality of messages includes metadata and content. A sender profile is generated for the sender based on an analysis of the metadata of each of the plurality of messages. Each respective message of the plurality of messages is classified as one of a plurality of categories based on a deep learning network analysis of the content of each respective message. A sender fingerprint is generated based on a machine learning analysis of the content of each respective message. A probability that the sender is a spammer is determined based on the sender profile, the message classifications, and the sender fingerprint. The sender is tagged based on the determined probability.
Abstract:
Embodiments are directed towards metadata service for video enhancement. An example method includes receiving a request for metadata applicable to instruct a display device to transform low dynamic range (LDR) video content to high dynamic range (HDR) video content, determining availability of candidate metadata, and responsive to determining that no candidate metadata is available, providing options for generating the metadata applicable to instruct the display device. The example method can also include causing generation of the metadata based at least one of the options, and providing the generated metadata to the display device.
Abstract:
Technologies are provided for assessing liveness of a subject presented to a set of one or more cameras. A first image and a second image depicting a face are captured during a first time period, and a third image and a fourth image depicting the face are captured during a second time period. A first variance in facial appearance is detected between the first depiction of the face and the third depiction of the face, and a second variance in facial appearance is detected between the second depiction of the face and the fourth depiction of the face. A liveliness of a person is verified based on a determination that the first variance in facial appearance corresponds in time to the second variance in facial appearance.
Abstract:
Provided herein are systems and methods for encoding an unencoded video frame of a sequence of video frames using a condensed coding header format. After a frame is divided into coding blocks and the visual data encoded, the coding block header data for each coding block is combined and encoded. A flag is set in the frame's frame header indicating whether the condensed or non-condensed coding block header format is being used.
Abstract:
A block processing procedure to determine coding-block or transform-block sizes for encoded and decoding video data. An encoder obtains unencoded video data and selects a region-of-interest in the video data. Image characteristics of the video data are determined from the region-of-interest and a block size is determined from those image characteristics. The video data is encoded using the block size without storing the block size in a header associated with the encoded data. A decoder obtains the encoded data and selects a second region-of-interest in the encoded data that corresponds to the unencoded data region-of-interest. Second image characteristics are determined from the second region-of-interest, where the second characteristics are the same as the characteristics determined for the unencoded data. A block size is determined from the second image characteristics without using a header associated with the encoded data. The encoded data is decoded based on the determined block size.
Abstract:
Summarization segments of an encoded video can be efficiently identified, without the need to decode the encoded video to obtain image data, by analyzing encoded-buffer-size deltas, each indicating an encoded-buffer-size difference between a pair of intra-coded frames of an encoded video.