Abstract:
A method and apparatus for symbol-space based compression of patterns are provided. The method comprises generating an output sequence responsive of an input sequence, the input sequence being of a first length and includes a plurality of symbols, by extraction of all common patterns, wherein a common pattern includes at least two symbols and the output sequence is of a second length that is shorter than the first length; and storing in a memory the output sequence as a data layer.
Abstract:
There is provided a method for searching a plurality of information sources using a multimedia element, the method may include receiving at least one multimedia element; generating, by a signature generator, for the at least one multimedia element at least one signature that is unidirectional, and yields compression; generating at least one textual search query using the at least one signature; wherein the generating of the textual search query comprises: (a) searching for at least one matching stored signature that matches one or more of the at least one signature; and (b) using a mapping between stored signatures and textual search queries, selecting at least one textual search query mapped to at least one matching stored signature; searching the plurality of information sources using the at least one textual search query; and causing a display of search results retrieved from the plurality of information sources.
Abstract:
A system and method for identifying influential entities depicted in multimedia content. The method includes determining, for each of a plurality of social linking graphs, a number of related entities, wherein each related entity has a social linking score above a first predetermined threshold, wherein each social linking score is generated based on at least one context of at least one multimedia content element (MMCE), and wherein each context is determined based on signatures generated for the at least one MMCE; and identifying, based on the determined number of related entities, at least one influential entity, wherein each influential entity is associated with one of the social linking graphs for which the determined number of related entities is above a second predetermined threshold.
Abstract:
A system and method for enriching a concept database with homogenous concepts. The method includes determining, based on signatures of a first multimedia content element (MMCE) and signatures of a plurality of existing concepts in the concept database, at least one first concept; generating a reduced representation of the first MMCE, wherein the reduced representation excludes the signatures of the first MMCE that match the at least one first concept; comparing the reduced representation to signatures representing a plurality of second MMCEs to select a first plurality of top matching second MMCEs; generating, based on the reduced representation and the first plurality of top matching second MMCEs, at least one second concept; determining, for each second concept, whether the second concept is a homogenous concept, wherein each homogenous concept uniquely represents the same content; and adding each homogenous concept to the concept database.
Abstract:
Content-based clustering, recognition, classification and search of high volumes of multimedia data in real-time. The embodiments disclosed herein are dedicated to real-time fast generation of signatures to high-volume of multimedia content-segments, based on relevant audio and visual signals, and to scalable matching of signatures of high-volume database of content-segments' signatures. The embodiments disclosed herein can be implemented in any applications which involve large-scale content-based clustering, recognition and classification of multimedia data, such as, content-tracking, video filtering, multimedia taxonomy generation, video fingerprinting, speech-to-text, audio classification, object recognition, video search and any other application requiring content-based signatures generation and matching for large content volumes such as, web and other large-scale databases.
Abstract:
A system and method for signature-based clustering of multimedia content elements. The method includes generating at least one signature for a first multimedia content element; determining, based on the generated at least one signature, at least one tag for the first multimedia content element; searching, using the determined at least one tag, for at least one matching multimedia content element cluster in at least one data source, wherein each multimedia content element cluster includes a plurality of clustered multimedia content elements sharing a common concept; and adding the first multimedia content element to each matching multimedia content element cluster.
Abstract:
A method and system for generating a complex signature respective of a multimedia data element (MMDE) are provided. The method includes partitioning the MMDE into a plurality of different minimum size MMDEs; generating, for each of the different minimum MMDEs, at least one signature, wherein generation of each at least one signature is performed by a plurality of computational cores, each computational core having at least one configurable property characterizing the core, and wherein configuration of the at least one configurable property respective of each core results in statistical independence among the plurality of cores; and assembling at least a complex signature for the MMDE comprised of a plurality of the generated signatures.
Abstract:
A system and method for generating signatures of an input multimedia data element (MMDE) are provided. The method includes identifying low-level characteristics of the input MMDE; identifying a plurality of portions of the input MMDE, wherein each portion of the plurality of portions of the input MMDE comprises an identified low-level characteristic of the input MMDE; partitioning the input MMDE into a plurality of minimum size MMDEs, wherein each minimum size MMDE comprises a portion of the plurality of portions of the input MMDE; generating a signature for each minimum size MMDE of the plurality of minimum size MMDEs; assembling at least a complex signature comprising a plurality of signatures of the minimum size MMDEs; and storing the signatures of each of the minimum size MMDEs and the complex signature in association with the multimedia data element and the plurality of minimum size MMDEs in a storage.
Abstract:
A method and system for method for generating concept structures are disclosed. The method comprises receiving a request to create a new concept structure, wherein the request includes at least a multimedia data element (MMDE) related to the new concept structure; querying a deep-content-classification (DCC) system using the MMDE to find at least one sub-concept, wherein a sub-concept is a concept structure that partially matches the received MMDE; checking if the at least one sub-concept satisfies at least one predefined logic rule; generating one or more sub-concepts from the at least MMDE; and generating the new concept structure using one or more sub-concepts out of the at least one sub-concepts that satisfies the predefined logic rule.
Abstract:
A system and method for clustering multimedia content. The method includes: detecting at least one clustering trigger event related to at least one multimedia content element to be clustered; generating at least one signature for the at least one multimedia content element, each signature representing at least a portion of the at least one multimedia content element; determining, based on the generated at least one signature, at least one multimedia content element cluster, wherein each multimedia content element cluster includes a plurality of clustered multimedia content elements sharing at least one common concept with the at least one multimedia content element; and adding, to each determined cluster, the at least one multimedia content element.