摘要:
A method comprises: accessing a plurality of videos; retrieving a video access log; differentiating a viewers from the video access log; extracting demographic information from one or more viewers associated with the viewer that accessed the video; determining whether the demographic information includes demographic attributes of interest; generating and storing a multi-attribute demographic distribution for the viewer; generating a feature vectors for each of the videos based on video content of frames; associating each feature vector with a visual object within a frame within each video; generating and storing a subset of representative feature vectors by performing dimensionality reduction; in response to storing the subset of feature vectors and the multi-attribute demographic distribution for each video, generating a predicted distribution; and determining, for the viewer having extracted demographic information that does not include the demographic attributes of interest, predicted demographic attribute values using the predicted distribution.
摘要:
A solution is provided to generate video recommendations in a video sharing environment. A video recommendation system selects a video as a target video and extracts target keywords from the title of the identified target video or the title of a non-video trending news item. The system receives multiple candidate videos. For each candidate video, the system extracts keywords from the title of the candidate video and compares the extracted keywords with the target words. Based on the comparison, the system generates a similarity score for the candidate video. The system ranks the candidate videos based their associated similarity scores and selects a candidate video having the highest similarity score as the video recommendation for the target video.
摘要:
A system for identifying one or more locations of a video clip within video content comprises means for producing segments of the video clip each segment derived from a plurality of frames. The segments are compared to segments of video content. From the comparison a first measure of similarity is prepared at virtually displaced temporal positions of the video clip and the video content. A filtering arrangement filters the measure of similarity to exclude potential matches based on temporal separation of positions of high similarity. A means is then arranged to compare the video clip to the video content at the positions identified as candidate matches. In this way, apparent matches can be removed based on likely knowledge of the content such as likely repetitions of a video clip within larger video content.
摘要:
A device, method, and computer-readable media for managing interrupt times for content items based on metadata encapsulating user behavior. A user controls the playback of content items such as audio or videos. The content items may be episodes in a programming series. Tags are associated with the content items. These tags include metadata having playback time of the received user interactions. In turn, the device processes the tags to identify user interaction that include stop events for the content items. The tags are processed based on a sliding time window. The playback times of the stop events are selected as potential interrupt times that the device may include recommendations for unaired episodes. The selected interrupt times are also used to identify inconsistencies in the metadata.
摘要:
The present invention relates to a mobile device and a method of controlling therefor. The method includes the steps of, if an input for touching the specific screen image for more than prescribed time is received from a user, recognizing the specific screen image, extracting tag information from the recognized specific screen image, executing a specific application related to the specific screen image based on the extracted tag information and displaying an execution screen of the specific application.
摘要:
The present invention relates to a video categorization method and apparatus, a computer program and a recording medium. The method comprises: acquiring a key frame, which contains a face, in a video; acquiring a face feature in the key frame; acquiring one or more face features corresponding to one or more picture categories; determining, according to the face feature in the key frame and the one or more face features corresponding to the one or more picture categories, a picture category to which the video belongs; and assigning the video to the picture category to which the video belongs. With the above technical solution, a video may be smartly and automatically classified into a picture category corresponding to a user appearing in the video, thereby not only eliminating the need for manual categorization by the user but also improving the accuracy of categorization.
摘要:
The present disclosure relates to a method for annotating a content element of a video stream which has been at least partially received by an electronic device, said method being implemented by said electronic device during a restitution of said video stream. According to the present disclosure, the method comprises:—receiving at least one item of information for identifying an image part in said video stream, comprising a temporal and/or spatial stamping of said image part;—when said identified image part belongs to a portion already restituted of said video stream: analysing said restituted portion, and obtaining a significant content element from said identified image part; searching for the presence of said significant content element in an image, called marked image, of at least one portion remaining to be restituted of said video stream; when a marked image is found, associating an annotation linked to said content item with a marked image; when no marked image is found, restituting said identified image again, while delivering at least one annotation linked to said content element.
摘要:
To provide a novel algorithm for visual attention detection in videos that can be easily implemented and is of superior reliability, a visual attention detector includes a feature extraction unit configured to extract a spatiotemporal feature from a local region in a video; a hashing unit configured to convert a spatiotemporal feature value for the local region into a hash value, and to select a training value mapped to the hash using a hash table; and an attention measure determining unit for determining an attention measure on the basis of the distance between a spatiotemporal feature value for the local region and the selected training value such that the larger the distance the larger the attention measure.
摘要:
Methods, apparatus, and systems for time-based and geographic navigation of video content are provided. Video content and associated metadata information are recorded and encoded using a video capture and encoding module. The associated metadata information includes at least one of date and time information of the recording and geographic position information indicative of a recording location. The recorded video content and the associated metadata information are communicated to a remote storage and web server device. A graphical user interface enables the display of an interactive map showing a route and current location of the video capture and encoding module. The video content may be searched using at least one of the graphical user interface and the interactive map by the date and/or time information and the geographic position information. Selected video content can be streamed or downloaded to a select location for display or storage.