Method and system for entropy-based semantic hashing
    1.
    发明授权
    Method and system for entropy-based semantic hashing 有权
    基于熵的语义散列的方法和系统

    公开(公告)号:US08676725B1

    公开(公告)日:2014-03-18

    申请号:US12794380

    申请日:2010-06-04

    IPC分类号: G06F15/18

    CPC分类号: G06N99/005

    摘要: Methods, systems and articles of manufacture for identifying semantic nearest neighbors in a feature space are described herein. A method embodiment includes generating an affinity matrix for objects in a given feature space, wherein the affinity matrix identifies the semantic similarity between each pair of objects in the feature space, training a multi-bit hash function using a greedy algorithm that increases the Hamming distance between dissimilar objects in the feature space while minimizing the Hamming distance between similar objects, and identifying semantic nearest neighbors for an object in a second feature space using the multi-bit hash function. A system embodiment includes a hash generator configured to generate the affinity matrix and train the multi-bit hash function, and a similarity determiner configured to identify semantic nearest neighbors for an object in a second feature space using the multi-bit hash function.

    摘要翻译: 本文描述了用于识别特征空间中的语义最近邻居的方法,系统和制品。 方法实施例包括为给定特征空间中的对象生成亲和度矩阵,其中亲和矩阵识别特征空间中每对对象之间的语义相似性,使用增加汉明距离的贪心算法训练多比特哈希函数 在特征空间中的不相似对象之间,同时使相似对象之间的汉明距离最小化,并且使用多位哈希函数来识别第二特征空间中的对象的语义最近邻居。 系统实施例包括被配置为生成亲和度矩阵并训练多比特哈希函数的哈希发生器,以及被配置为使用多比特哈希函数来识别第二特征空间中的对象的语义最近邻居的相似性确定器。

    Audio identification using ordinal transformation

    公开(公告)号:US09684715B1

    公开(公告)日:2017-06-20

    申请号:US13415704

    申请日:2012-03-08

    申请人: David Ross Jay Yagnik

    发明人: David Ross Jay Yagnik

    IPC分类号: G06F17/30 G06F17/00

    摘要: This disclosure relates to audio identification using ordinal transformations. A media matching component receives a sample audio file. The sample audio file can include, for example, a cover song. The media matching component includes a vector component that computes a set of vectors using auditory feature values included in the sample audio file. A hashing component employs a hash function to generate a fingerprint, including a set of sub-fingerprints, for the sample audio file using the set of vectors. The fingerprint is invariant to variations including but not limited to variations in key, instrumentation, encoding formats, performers, performance conditions, arrangement, and/or recording and processing variations. An identification component determines if any reference audio files are similar to the sample audio file using the fingerprint and/or sub-fingerprints, and identifies any similar reference audio files.

    Matching based upon rank
    3.
    发明授权
    Matching based upon rank 有权
    基于等级匹配

    公开(公告)号:US08805090B1

    公开(公告)日:2014-08-12

    申请号:US13368317

    申请日:2012-02-07

    IPC分类号: G06K9/68

    CPC分类号: G06K9/6212

    摘要: Systems and methods for measuring consistency between two objects based upon a rank of object elements instead of based upon the values of those object elements. Objects being compared can be represented by d-dimension feature vectors, U and V, where each dimension includes an associated value. U and V can be converted to rank vectors, P and Q, where values of U and V dimensions are replaced by an ordered rank or a function thereof. Analysis directed to the consistency between U and V can be accomplished by determining consistency between P and Q, which can be more efficient and more accurate, particularly with regard to illumination-invariant comparisons.

    摘要翻译: 基于对象元素的等级而不是基于这些对象元素的值来测量两个对象之间的一致性的系统和方法。 被比较的对象可以由d维特征向量U和V表示,其中每个维度包括相关联的值。 U和V可以被转换为等级向量P和Q,其中U和V维度的值被有序等级或其功能所代替。 可以通过确定P和Q之间的一致性来实现对U和V之间的一致性的分析,这可以更有效和更准确,特别是在照明不变比较方面。

    Detection and classification of matches between time-based media
    4.
    发明授权
    Detection and classification of matches between time-based media 有权
    基于时间的媒体之间的匹配检测和分类

    公开(公告)号:US08238669B2

    公开(公告)日:2012-08-07

    申请号:US12174366

    申请日:2008-07-16

    IPC分类号: G06K9/62 H04N7/10 H04N7/025

    CPC分类号: G06K9/00758 G06F17/30784

    摘要: A system and method detects matches between portions of video content. A matching module receives an input video fingerprint representing an input video and a set of reference fingerprints representing reference videos in a reference database. The matching module compares the reference fingerprints and input fingerprints to generate a list of candidate segments from the reference video set. Each candidate segment comprises a time-localized portion of a reference video that potentially matches the input video. A classifier is applied to each of the candidate segments to classify the segment as a matching segment or a non-matching segment. A result is then outputted identifying a matching portion of a reference video from the reference video set based on the segments classified as matches.

    摘要翻译: 系统和方法检测视频内容的部分之间的匹配。 匹配模块接收表示参考数据库中的参考视频的输入视频和一组参考指纹的输入视频指纹。 匹配模块比较参考指纹和输入指纹,以从参考视频集中生成候选片段的列表。 每个候选片段包括潜在地匹配输入视频的参考视频的时间局部化部分。 将分类器应用于每个候选片段以将片段分类为匹配片段或非匹配片段。 然后基于被分类为匹配的段,从参考视频集中输出标识参考视频的匹配部分的结果。

    Rewarding creative use of product placements in user-contributed videos
    5.
    发明授权
    Rewarding creative use of product placements in user-contributed videos 有权
    奖励用户贡献的视频中的产品展示位置的广告使用

    公开(公告)号:US08180667B1

    公开(公告)日:2012-05-15

    申请号:US12132576

    申请日:2008-06-03

    IPC分类号: G06Q40/00

    摘要: A video hosting service automatically identifies, in a video database, a set of videos associated with an advertiser, and presents the identified videos to the advertiser for consideration. The videos may be selected based on analysis of their video content for images of logos associated with the advertisers. The video hosting service may then receive from the advertisers a listing of which of the presented videos should be given an award.

    摘要翻译: 视频托管服务在视频数据库中自动识别与广告商相关联的一组视频,并将识别的视频呈现给广告商以供考虑。 可以基于对与广告商相关联的徽标的图像的视频内容的分析来选择视频。 然后,视频托管服务可以从广告商接收应该给出哪个被呈现的视频的列表。

    Inferring user interests
    7.
    发明授权
    Inferring user interests 有权
    推荐用户兴趣

    公开(公告)号:US08055664B2

    公开(公告)日:2011-11-08

    申请号:US11742995

    申请日:2007-05-01

    IPC分类号: G06F17/30 G06Q30/00

    摘要: The subject matter of this specification can be embodied in, among other things, a method that includes determining, for a portion of users of a social network, label values each comprising an inferred interest level of a user in a subject indicated by a label, associating a first user with one or more second users based on one or more relationships specified by the first user, and outputting a first label value for the first user based on one or more second label values of the one or more second users.

    摘要翻译: 本说明书的主题可以包括一种方法,其包括为社交网络的用户的一部分确定每个包括由标签指示的对象中的用户的推定的兴趣级别的标签值, 基于由第一用户指定的一个或多个关系将第一用户与一个或多个第二用户相关联,并且基于一个或多个第二用户的一个或多个第二标签值输出第一用户的第一标签值。

    Fast efficient vocabulary computation with hashed vocabularies applying hash functions to cluster centroids that determines most frequently used cluster centroid IDs
    9.
    发明授权
    Fast efficient vocabulary computation with hashed vocabularies applying hash functions to cluster centroids that determines most frequently used cluster centroid IDs 有权
    快速有效的词汇计算,使用散列词汇表将哈希函数应用于确定最常用的群集中心ID的群集质心

    公开(公告)号:US09054876B1

    公开(公告)日:2015-06-09

    申请号:US13314294

    申请日:2011-12-08

    申请人: Jay Yagnik

    发明人: Jay Yagnik

    IPC分类号: G06F17/00 H04L9/32

    摘要: The disclosed embodiments describe a method, an apparatus, an application specific integrated circuit, and a server that provides a fast and efficient look up for data analysis. The apparatus and server may be configured to obtain data segments from a plurality of input devices. The data segments may be individual unique subsets of the entire data set obtained by a plurality input devices. A hash function may be applied to an aggregated set of the data segments. A result of the hash function may be stored in a data structure. A codebook may be generated from the hash function results.

    摘要翻译: 所公开的实施例描述了提供快速和有效地查找数据分析的方法,装置,专用集成电路和服务器。 设备和服务器可以被配置为从多个输入设备获取数据段。 数据段可以是由多个输入设备获得的整个数据集的独立子集。 哈希函数可以应用于数据段的聚合集合。 散列函数的结果可以存储在数据结构中。 可以从散列函数结果生成码本。

    Identifying images using face recognition
    10.
    发明授权
    Identifying images using face recognition 有权
    使用脸部识别识别图像

    公开(公告)号:US09053357B2

    公开(公告)日:2015-06-09

    申请号:US13324058

    申请日:2011-12-13

    申请人: Jay Yagnik

    发明人: Jay Yagnik

    IPC分类号: G06K9/46 G06K9/00 G06F17/30

    摘要: A method includes identifying a named entity, retrieving images associated with the named entity, and using a face detection algorithm to perform face detection on the retrieved images to detect faces in the retrieved images. At least one representative face image from the retrieved images is identified, and the representative face image is used to identify one or more additional images representing the at least one named entity.

    摘要翻译: 一种方法包括识别命名实体,检索与命名实体相关联的图像,以及使用面部检测算法对检索到的图像执行面部检测,以检测检索到的图像中的面部。 识别来自检索到的图像的至少一个代表性面部图像,并且使用代表性面部图像来识别表示至少一个命名实体的一个或多个附加图像。