Abstract:
A system and method disposed to enable encoding, decoding and manipulation of digital video with substantially less processing load than would otherwise required. In particular, one disclosed method is directed to generating a compressed video data structure that is selectively decodable to a plurality of resolutions including the full resolution of the uncompressed stream. The desired number of data components and the content of the data components that make up the compressed video data, which determine the available video resolutions, are variable based upon the processing carried out and the resources available to decode and process the data components. During decoding, efficiency is substantially improved because only the data components necessary to generate a desired resolution are decoded. In variations, both temporal and spatial decoding are utilized to reduce frame rates, and hence, further reduce processor load. The system and method are particularly useful for real-time video editing applications.
Abstract:
Video and corresponding metadata is accessed. Events of interest within the video are identified based on the corresponding metadata, and best scenes are identified based on the identified events of interest. In one example, best scenes are identified based on the motion values associated with frames or portions of a frame of a video. Motion values are determined for each frame and portions of the video including frames with the most motion are identified as best scenes. Best scenes may also be identified based on the motion profile of a video. The motion profile of a video is a measure of global or local motion within frames throughout the video. For example, best scenes are identified from portion of the video including steady global motion. A video summary can be generated including one or more of the identified best scenes.
Abstract:
An image sensor of a camera system captures an image over an image capture interval of time, and waits a blanking interval of time before capturing an additional image. The captured image is provided to a frame controller, and is buffered until an image signal processor accesses the captured image. The image signal processor processes the accessed image over an image processing interval of time, producing a processed image. The image processing interval of time is selected to be greater than the image capture interval of time, but less than the sum of the image capture interval of time and the blanking interval of time. By reducing the image capture interval of time but maintaining an image processing interval of time, rolling shutter artifacts are beneficially reduced without increasing the processing resources or power required by the image signal processor to process the image.
Abstract:
A pair of cameras having an overlapping field of view is aligned based on images captured by image sensors of the pair of cameras. A pixel shift is identified between the images. Based on the identified pixel shift, a calibration is applied to one or both of the pair of cameras. To determine the pixel shift, the camera applies correlation methods including edge matching. Calibrating the pair of cameras may include adjusting a read window on an image sensor. The pixel shift can also be used to determine a time lag, which can be used to synchronize subsequent image captures.
Abstract:
An image sensor compresses image data prior to transmitting the image data to a DSP. The image sensor captures light representing an image, for instance via a camera's aperture. A focal plane array converts the captured light into pixel data. The pixel data is sorted into categories, and is compressed in parallel by a compression engine. The compressed pixel data is then sent to a DSP, which may be located off-chip. The DSP then decompresses the compressed pixel data, performs image signal processing operations on the compressed pixel data, and then compresses the processed pixel data into a digital image format. The image sensor may buffer the pixel data for one or more images to accommodate for slowdown by the compression engine. The pixel data may be sorted by row and column of a pixel array. Alternatively, the pixel data may be sorted by color from a Bayer color filter.
Abstract:
Methods and apparatus for processing of video content to optimize codec bandwidth. In one embodiment, the method includes capturing panoramic imaging content (e.g., a 360° panorama), mapping the panoramic imaging content into an equi-angular cubemap (EAC) format, and splitting the EAC format into segments for transmission to maximize codec bandwidth. In one exemplary embodiment, the EAC segments are transmitted at a different frame rate than the subsequent display rate of the panoramic imaging content. For example, the mapping and frame rate may be chosen to enable the rendering of 8K, 360° content at 24 fps, using commodity encoder hardware and software that nominally supports 4K content at 60 fps.
Abstract:
Methods and apparatus for seamlessly transitioning between lens projections. In one exemplary embodiment, a piecewise lens projection is composed of three (3) functions: (i) a first polynomial-based lens projection, (ii) a second “joining” lens projection, and (iii) a trigonometric lens projection. The piecewise lens projection characterizes virtualized lens distortion as a function of FOV; image data can be dynamically projected based on the virtualized lens distortion, regardless of FOV. In this manner, a user may achieve the visually familiar effects associated with a first lens definition for a first FOV, while still smoothly animating transitions to other lens projections (e.g., a larger FOV using stereographic projections).
Abstract:
Methods and apparatus for metadata-based cinematography, production effects, shot selection, and/or other content augmentation. Effective cinematography conveys storyline, emotion, excitement, etc. Unfortunately, most amateur filmmakers lack the knowledge and ability to create cinema quality media. Various aspects of the present disclosure are directed to, among other things, rendering media based on instantaneous metadata. Unlike traditional post-processing techniques that rely on human subjectivity, some of the various techniques described herein leverage the camera's actual experiential data to enable cinema-quality post-processing for the general consuming public. Instantaneous metadata-based cinematography and shot selection advisories and architectures are also described.
Abstract:
Apparatus and methods for applying motion blur to overcapture content. In one embodiment, the motion blur is applied by selecting a number of frames of the captured image content for application of motion blur; selecting a plurality of pixel locations within the number of frames of the captured image content for the application of motion blur; applying motion blur to the captured image content in accordance with the selected number of frames and the selected plurality of pixel locations; and outputting the captured image content with the applied motion blur. In some implementations, motion blur is applied via implementation of a virtualized neutral density filter. Computerized devices and computer-readable apparatus for the application of motion blur are also disclosed.
Abstract:
In a video capture system, a virtual lens is simulated when applying a crop or zoom effect to an input video. An input video frame is received from the input video that has a first field of view and an input lens distortion caused by a lens used to capture the input video frame. A selection of a sub-frame representing a portion of the input video frame is obtained that has a second field of view smaller than the first field of view. The sub-frame is processed to remap the input lens distortion to a desired lens distortion in the sub-frame. The processed sub-frame is the outputted.