Abstract:
The invention relates to a method for applying metadata to immersive media files, comprising the following steps: - providing an immersive media file comprising at least one frame coding immersive media content, - adding a spatial grid overlay on the at least one frame, said spatial grid overlay defining latitudes and longitudes on a spheroid surface, - identifying an object of interest in the immersive media content, - determining frame sequence data of a first reference frame in which said object of interest is present, - determining latitude and longitude data of a point associated with the object of interest in the spatial grid overlay of the first reference frame, - providing information data associated with the object of interest - generating frame and location based metadata comprising the frame sequence data, the latitude and longitude data and the information data, and - applying the generated metadata to the immersive media file.
Abstract:
Security video searching device, systems, and methods determine behavior associated with each of a plurality of video clips captured by security devices, find video clips that include behavior indicative of search characteristics, and send the found video clips to a user. A search engine analyzes each of the plurality of video clips to determine behavior associated with the video clip and finds ones of the plurality of video clips where the behavior matches the search characteristics defined by of the user. Previously determined behavioral patterns associated with motion signals generated by the security devices may also be used to determine behavior within the video clips.
Abstract:
A method of tracking an object across a sequence of video frames using a natural language query includes receiving the natural language query and identifying an initial target in an initial frame of the sequence of video frames based on the natural language query. The method also includes adjusting the natural language query, for a subsequent frame, based on content of the subsequent frame and/or a likelihood of a semantic property of the initial target appearing in the subsequent frame. The method further includes identifying a text driven target and a visual driven target in the subsequent frame. The method still further includes combining the visual driven target with the text driven target to obtain a final target in the subsequent frame.
Abstract:
A computer at a content management system receives a first digital content item from a content provider. The computer matches the first digital content item to each of a plurality of reference digital content items in a database. The system determines a plurality of match metrics from the matches. Each match metric is indicative of a similarity between the first digital content item and one of the plurality of reference digital content items. Responsive to one of the match metrics being greater than a threshold level, the system sets a content age of the first digital content item to equal a content age of a reference digital content item associated with the match metric. Responsive to none of the match metrics being greater than the threshold, the system sets the content age of the first digital content item to a time of receiving the first digital content item.
Abstract:
Frames of video data from a surveillance system can be analyzed in near real time to allow for action to be taken based on the analysis. Task-based resources can be allocated to process each individual frame. Pre-processing can be performed to determine whether to analyze a given video frame. Each frame to be analyzed can be processed using at least one recognition algorithm to detect objects of interest, which can also be compared against corresponding data from earlier frames to determine relevant behaviors, moods, actions, or patterns of use. Each determination can have a corresponding confidence value. Information about the determinations and confidence levels can be analyzed to determine whether an action should be taken, as well as the type of action to take. Information for the determinations can also be used to apply tags to the video content to allow for searching and indexing of the video content.
Abstract:
Systems and methods are provided to record portions of media assets. User request is received to record a media asset together with a criterion for recording portions of that media asset. A content recognition algorithm is executed against segments of the media asset to determine a set of keywords associated with those segments. Separately a set of keywords associated with the criterion is generated. Sets of keywords are compared and segments that match the criterion are discovered. If it is determined that a first segment and third segment each match the criterion and a second segment does not, a delete indicator is added to the second segment and the third and first segments are compared. If those segments match the delete indicator is removed from the second segment.