Automatic generation of video and directional audio from spherical content

    公开(公告)号:US10410680B2

    公开(公告)日:2019-09-10

    申请号:US16105304

    申请日:2018-08-20

    Applicant: GoPro, Inc.

    Abstract: A spherical content capture system captures spherical video and audio content. In one embodiment, captured metadata or video/audio processing is used to identify content relevant to a particular user based on time and location information. The platform can then generate an output video from one or more shared spherical content files relevant to the user. The output video may include a non-spherical reduced field of view such as those commonly associated with conventional camera systems. Particularly, relevant sub-frames having a reduced field of view may be extracted from each frame of spherical video to generate an output video that tracks a particular individual or object of interest. For each sub-frame, a corresponding portion of an audio track is generated that includes a directional audio signal having a directionality based on the selected sub-frame.

    Scene and activity identification in video summary generation based on motion detected in a video

    公开(公告)号:US10192585B1

    公开(公告)日:2019-01-29

    申请号:US15467804

    申请日:2017-03-23

    Applicant: GOPRO, INC.

    Abstract: Video and corresponding metadata is accessed. Events of interest within the video are identified based on the corresponding metadata, and best scenes are identified based on the identified events of interest. In one example, best scenes are identified based on the motion values associated with frames or portions of a frame of a video. Motion values are determined for each frame and portions of the video including frames with the most motion are identified as best scenes. Best scenes may also be identified based on the motion profile of a video. The motion profile of a video is a measure of global or local motion within frames throughout the video. For example, best scenes are identified from portion of the video including steady global motion. A video summary can be generated including one or more of the identified best scenes.

    Apparatus and methods for video image post-processing for segmentation-based interpolation

    公开(公告)号:US10134114B2

    公开(公告)日:2018-11-20

    申请号:US15270971

    申请日:2016-09-20

    Applicant: GoPro, Inc.

    Abstract: Apparatus and methods for video image post-processing for segmentation based interpolation. In one embodiment, a computerized apparatus is utilized in order to obtain a first frame of video data; segment one or more objects within the first frame of video data; obtain a second frame of video data; segment one or more objects within the second frame of video data; match at least a portion of the one or more objects within the first frame of video data with the one or more objects within the second frame of video data; compute the motion of the pixels for the matched portion of the one or more objects; compute the motion of pixels associated with a background image; and generate an interpolated frame of video data, the interpolated frame of video data residing temporally between the first frame of video data and the second frame of video data.

    Associating audio with three-dimensional objects in videos

    公开(公告)号:US10045120B2

    公开(公告)日:2018-08-07

    申请号:US15187695

    申请日:2016-06-20

    Applicant: GoPro, Inc.

    Abstract: Disclosed is a system and method for generating a model of the geometric relationships between various audio sources recorded by a multi-camera system. The spatial audio scene module associates source signals, extracted from recorded audio, of audio sources to visual objects identified in videos recorded by one or more cameras. This association may be based on estimated positions of the audio sources based on relative signal gains and delays of the source signal received at each microphone. The estimated positions of audio sources are tracked indirectly by tracking the associated visual objects with computer vision. A virtual microphone module may receive a position for a virtual microphone and synthesize a signal corresponding to the virtual microphone position based on the estimated positions of the audio sources.

    Combining independent solutions to an image or video processing task

    公开(公告)号:US09965703B2

    公开(公告)日:2018-05-08

    申请号:US15177310

    申请日:2016-06-08

    Applicant: GoPro, Inc.

    Abstract: An algorithm for performing an image or video processing task is generated that may be used to combine a plurality of different independent solutions to the image or video processing task in an optimized manner. A plurality of base algorithms may be applied to a training set of images or video and a first generation of different combining algorithms may be applied to combine the respective solutions from each of the respective base algorithms into respective combined solutions. The respective combined solutions may be evaluated to generate respective fitness scores representing measures of how well the plurality of different combining algorithms each perform the image or video processing task. The algorithms may be iteratively updated to generate an optimized combining algorithm that may be applied to an input image or video.

Patent Agency Ranking