Abstract:
A method and apparatus for symbol-space based compression of patterns are provided. The method comprises receiving an input sequence, the input sequence being of a first length and comprising a plurality of symbols; extracting all common patterns within the input sequence, wherein a common pattern includes at least two symbols; generating an output sequence responsive of the extraction of all common patterns, wherein the output sequence has a second length that is shorter than the first length; and storing in a memory the output sequence as a data layer, wherein the output sequence is provided as a new input sequence for a subsequent generation of a data layer.
Abstract:
A method and server for analyzing a multimedia content item are provided. The method comprises receiving a multimedia content item; extracting from the multimedia content item a plurality of multimedia elements; generating at least one signature for each of the plurality of multimedia elements; for each of the plurality of multimedia elements, querying a deep-content-classification (DCC) system to identify at least one concept that matches one of the plurality of multimedia elements, wherein querying is performed using the at least one signature generated for the multimedia elements and wherein an unidentified multimedia content element does not have a matching concept; generating a context for the multimedia content item using matching concepts; and characterizing each unidentified multimedia element using the generating context and signatures of the matching concepts.
Abstract:
A method for reducing an amount of storage required for maintaining a large-scale collection of multimedia data elements by unsupervised clustering of multimedia data elements. The method comprises processing the multimedia data elements in the large-scale collection to generate a first cluster of multimedia data elements; storing the first cluster in a storage unit; repeating the generation of a new cluster from the first cluster and un-clustered multimedia elements in the large-scale collection until a single cluster is reached; and storing the new cluster generated at each iteration in the storage unit, wherein a N-th cluster generated at the N-th iteration is stored in the storage unit, wherein the amount of storage required to store the N-th cluster is less than an amount of storage of the large-scale collection, thereby the unsupervised clustering enables reducing the storage amount required to store the multimedia data elements in the large-scale collection.
Abstract:
A method for detection of common patterns within unstructured data elements. The method includes extracting a plurality of unstructured data elements retrieved from a plurality of big data sources; generating at least one signature for each of the plurality of unstructured data elements; identifying common patterns among the generated signatures; clustering the signatures identified to have common patterns; and correlating the generated clusters to identify associations between their respective identified common patterns.
Abstract:
A system and method for monitoring a brand sentiment through a plurality of different web sources are provided. The method comprises receiving a request to monitor a brand; searching the plurality of web sources for multimedia content elements related to the brand; generating at least one signature for each multimedia content element determined to be related to the brand, wherein each of the at least one generated signatures represents a concept; and correlating the concepts respective of the generated signatures to determine a context of the multimedia content elements determined to be related to the brand, wherein the context is the brand sentiment.
Abstract:
A method and system for determining a context of a web-page containing a plurality of multimedia content elements. The method comprises receiving a uniform resource locator (URL) of the web-page; downloading the web-page respective of the received URL; analyzing the web-page to identify the existence of each of the plurality of multimedia content elements; generating at least one signature for each of the plurality of multimedia content elements, wherein each of the generated signatures represents a concept; and correlating the concepts respective of the generated signatures to determine the context of each of the plurality of multimedia content elements, thereby determining the context of the web-page.
Abstract:
A system for generating signatures of an input multimedia data element comprises a partitioning unit for recursively partitioning the input multimedia data element into a plurality of multimedia data elements, wherein each of the plurality of the minimum size multimedia data elements is a minimal partition of the input multimedia data elements; a signature generator for generating for each of the plurality of minimum size multimedia data elements a respective signature; and a storage unit for storing the respective signatures respective of the plurality of minimum size multimedia data elements.
Abstract:
A method and apparatus for unsupervised clustering of a large-scale collection of multimedia data elements. The method comprises generating a first cluster from the large-scale collection by: matching each of the multimedia data elements to all other multimedia data elements in the large-scale collection, determining a clustering score for each match being performed, clustering multimedia data elements having a clustering score above a threshold to create the first cluster; and storing the first cluster in a storage unit.
Abstract:
A method for identifying nutritional data related to food substances contained in a multimedia content item is provided. The method includes analyzing a received multimedia content item to identify multimedia elements containing food substance; generating at least one signature for each identified multimedia element; querying a deep-content-classification (DCC) system for each of the identified multimedia elements to find at least one concept that matches at least one of the identified multimedia elements; matching the at least one signature of each of the at least one matching concepts to previously generated signatures of food substances maintained in a data warehouse; retrieving, for each of the at least one matching signature, nutritional data associated with the at least one matching signature from the data warehouse, thereby providing nutritional data for the food substances substance contained in the received multimedia content item; and sending the nutritional data to the user device.
Abstract:
A method for tagging multimedia content elements is provided. The method comprises receiving at least one multimedia content element from a user device; generating at least one signature for the at least one multimedia content element; generating at least one tag based on the least one generated signature, wherein the at least one tag is searchable by the user device; and sending the tag generated for the received multimedia content element to storage on the user device.