摘要:
System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.
摘要:
System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.
摘要:
System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.
摘要:
Computer implemented method, system and computer usable program code for detecting topic shift boundaries in a multimedia stream. A computer implemented method for detecting topic shift boundaries in a multimedia stream includes receiving a multimedia stream, and performing multimodal analysis on the multimedia stream to locate a plurality of temporal positions within the multimedia stream at which topic changes have an increased likelihood of occurring to provide a sequence of multimedia portions. Characteristics for a sliding window for each multimedia portion in the sequence of multimedia portions are automatically determined, and topic shift boundaries are detected in each multimedia portion by applying a text-based topic shift detector over the media stream's text transcript using a sliding window, wherein the sliding window used with each multimedia portion has the characteristics determined from its respective multimedia portion.
摘要:
A system, method and computer program product for generating personal profiles for subjects appearing in a video media source. The method includes extracting audiovisual-related personal information related to a subject appearing in the video media source; extracting text-related personal information that are related to the subject in the video source; correlating the extracted audiovisual-related personal information and the extracted text-related personal information related to the subject; and assembling a personal profile data structure for the subject, the personal profile data structure comprising the text-related personal information and audiovisual-related personal information related to the subject. The text-related personal information forms the name identity of the subject, while the audiovisual-related personal information includes audiovisual-related features including information forming one or more of: a visual identity, a kinematic identity and, a voice identity of the subject. In an alternate embodiment, in an iterative manner, the correlated extracted audiovisual-related personal information and extracted text-related personal information may be fed back and utilized for performing an additional search from external information sources, via a search engine, to obtain additional texts relating to the subject or obtain additional video media sources having the subject. There is further enabled the updating of an assembled personal profile of a subject as a new video media source having said subject becomes available.
摘要:
Computer implemented method, system and computer program product for extracting salient keywords for videos. A computer implemented method for extracting salient keywords for videos includes extracting a set of candidate keywords from a text source of a video, assigning a salience value to each candidate keyword based on statistical information to provide a set of statistically significant keywords, exploiting additional cues that are available to the video and that can be used to further measure the significance of existing keywords or to extract new keywords, and selecting a set of salient keywords for the video based on the set of statistically significant keywords and the additional cues.
摘要:
System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.
摘要:
A leak detection system and method of use thereof is disclosed based on shock wave propagation in a fluid. In one form, the system includes at least one shock wave generator for introducing at least one shock wave signal into a fluid medium; at least one detector for detecting signals in the fluid medium; and at least one processor configured to identify excitation signals in the fluid medium caused by the at least one shock wave signal, wherein the identification of excitation signals is indicative of a fluid leak.
摘要:
Systems, methods, and computer-readable media for dynamically generating text associated with an advertisement are provided. Core text associated with an advertisement is received from an advertiser, as is at least one attribute relevant to the advertiser and/or a user. Based upon the received attribute(s), it is determined whether customization of the core text is desired. If customization is desired, the core text is modified and presented in association with the advertisement. If customization is not desired, the core text is presented in association with the advertisement. In one embodiment, target advertisement placement information may also be utilized to determine whether customization of the core text is desired.
摘要:
A system for processing a video signal into a processed video signal includes a pattern recognition module for detecting a region of text in the image sequence based on coding feedback data and generating pattern recognition data in response thereto. A video codec generates the processed video signal and generates the coding feedback data in conjunction with the processing of the image sequence.