Abstract:
Pull frame interpolation may include identifying a plurality of input video frames, generating a plurality of motion vectors indicating motion from a first frame of the plurality of input video frames to a second frame of the plurality of input video frames, identifying an interpolation point between the first frame and the second frame, generating a plurality of candidate interpolation motion vectors indicating motion from the first frame to the interpolation point and from the second frame to the interpolation point based on the plurality of motion vectors, selecting an interpolation motion vector from the plurality of candidate interpolation motion vectors based on a metric, and generating an interpolated frame at the interpolation point based on the selected interpolation motion vector.
Abstract:
Implementations generally relate to enhancing content appearance. In some implementations, a method includes receiving an image, selecting a reference object in the image. The method also includes determining one or more image parameter adjustments based on the selected reference object, and applying the one or more image parameter adjustments to the entire image.
Abstract:
Provided are methods for synchronizing multiple video recordings of the same event using audio signals of the recordings. An algorithm is designed to generate candidate time offsets for pairwise matches of video recordings based on audio signals corresponding to the recordings, and combine the candidate time offsets to obtain the most accurate actual time offset between each pair of recordings. Each candidate time offset for each pairwise match of recordings is assigned a confidence measure indicating how likely it is that the particular time offset is correct for the given pair of recordings. The confidence measure assigned to a particular time offset is based on various components and characteristics of the audio signal pair corresponding to the video recordings.
Abstract:
Described herein are techniques related to noise reduction for image sequences or videos. A noise reduction tool includes a motion estimator configured to estimated motion in the video, a noise spectrum estimator configured to estimate noise in the video, a shot detector configured to trigger the noise estimation process, a noise spectrum validator configured to validate the estimated noise spectrum, and a noise reducer to reduce noise in the video using the estimated noise spectrum.
Abstract:
Implementations disclose bitrate optimization for multi-representation encoding using playback statistics. A method includes generating multiple versions of a segment of a source video, the versions comprising encodings of the segment at different encoding bitrates for each resolution of the segment, measuring a quality metric for each version of the segment, generating rate-quality models for each resolution of the segment based on the measured quality metrics corresponding to the resolutions, generating a probability model to predict requesting probabilities that representations of the segment are requested, the probability model based on a joint probability distribution of network speed and viewport size that is generated from client-side feedback statistics associated with prior playbacks of other videos, determining an encoding bitrate for each of the representations of the segment based on the rate-quality models and the probability model, and assigning determined encoding bitrates to corresponding representations of the segment.
Abstract:
An interactive multi-view module identifies a plurality of media items associated with a real-world event, each of the plurality of media items comprising a video portion and an audio portion. The interactive multi-view module synchronizes the audio portions of each of the plurality of media items according to a common reference timeline, determines a relative geographic position associated with each of the plurality of media items and presents the plurality of media items in an interactive multi-view player interface based at least on the synchronized audio portions and the relative geographic positions.