Accounting for behavioral variability in web search
    1.
    发明授权
    Accounting for behavioral variability in web search 有权
    计算网络搜索中的行为变异性

    公开(公告)号:US07743047B2

    公开(公告)日:2010-06-22

    申请号:US11904103

    申请日:2007-09-26

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30867

    摘要: The concept of variability pertains to whether users exhibit consistent search interaction patterns, for example, in terms of interaction flow or information targeted. Methods are provided for analyzing variability, and then adapting search-related functionality (e.g., processes and/or interfaces) to account for variability characteristics, for example, to account for predictable search interaction behavior.

    摘要翻译: 可变性的概念涉及用户是否展示一致的搜索交互模式,例如,在交互流或信息目标方面。 提供了用于分析变异性的方法,然后使搜索相关功能(例如,过程和/或接口)适应于变异性特征,例如考虑到可预测的搜索交互行为。

    Question answering over structured content on the web
    2.
    发明申请
    Question answering over structured content on the web 失效
    在网络上回答结构化内容的问题

    公开(公告)号:US20070094285A1

    公开(公告)日:2007-04-26

    申请号:US11256503

    申请日:2005-10-21

    IPC分类号: G06F7/00

    摘要: Structured content and associated metadata from the Web are leveraged to provide specific answer string responses to user questions. The structured content can also be indexed at crawl-time to facilitate searching of the content at search-time. Ranking techniques can also be employed to facilitate in providing an optimum answer string and/or a top K list of answer strings for a query. Ranking can be based on trainable algorithms that utilize feature vectors for candidate answer strings. In one instance, at crawl-time, structured content is indexed and automatically associated with metadata relating to the structured content and the source web page. At search-time, candidate indexed structured content is then utilized to extract an appropriate answer string in response to a user query.

    摘要翻译: 来自网络的结构化内容和相关元数据被用来提供用户问题的特定答案字符串响应。 结构化内容还可以在爬行时间进行索引,以便于搜索时搜索内容。 也可以采用排名技术来促进为查询提供最佳答案字符串和/或回答字符串的顶部K列表。 排名可以基于利用候选答案字符串的特征向量的可训练算法。 在一个实例中,在爬行时,结构化内容被索引并且与结构化内容和源网页相关联的元数据自动关联。 在搜索时间,然后利用候选索引的结构化内容来提取响应于用户查询的适当答案字符串。

    Auto playlist generator
    3.
    发明授权
    Auto playlist generator 有权
    自动播放列表生成器

    公开(公告)号:US07313571B1

    公开(公告)日:2007-12-25

    申请号:US11263347

    申请日:2005-10-31

    IPC分类号: G06F17/30 G03B19/18

    摘要: A system and method for generating a list is provided. The system includes a seed item input subsystem, an item identifying subsystem, a descriptive metadata similarity determining subsystem and a list generating subsystem that builds a list based, at least in part, on similarity processing performed on seed item descriptive metadata and user item descriptive metadata and user selected thresholds applied to such similarity processing. The method includes inexact matching between identifying metadata associated with new user items and identifying metadata stored in a reference metadata database. The method further includes subjecting candidate user items to similarity processing, where the degree to which the candidate user items are similar to the seed item is determined, and placing user items in a list of items based on user selected preferences for (dis)similarity between items in the list and the seed item.

    摘要翻译: 提供了一种用于生成列表的系统和方法。 系统包括种子项目输入子系统,项目识别子系统,描述性元数据相似性确定子系统和列表生成子系统,其至少部分地基于对种子项目描述性元数据和用户项目描述元数据执行的相似性处理来构建列表 并且将用户选择的阈值应用于这种相似性处理。 该方法包括识别与新用户项目相关联的元数据和识别存储在参考元数据数据库中的元数据之间的不准确匹配。 该方法还包括对候选用户项目进行相似性处理,其中确定候选用户项目类似于种子项目的程度,并且基于用户选择的偏好来将用户项目放置在项目列表中以用于(dis)相似性 列表中的项目和种子项目。

    System and method for learning ranking functions on data
    4.
    发明申请
    System and method for learning ranking functions on data 有权
    用于学习数据排序功能的系统和方法

    公开(公告)号:US20060195406A1

    公开(公告)日:2006-08-31

    申请号:US11066514

    申请日:2005-02-25

    IPC分类号: G06F15/18

    CPC分类号: G06F17/30864

    摘要: A machine learning system to rank data within sets is disclosed. The system comprises a ranking module that has differentiable parameters. The system further comprises a cost calculation module that uses a cost function that depends on pairs of examples and which describes an output of the ranking module. Methods of using the disclosed system are also provided.

    摘要翻译: 公开了一种用于对集合内的数据进行排序的机器学习系统。 该系统包括具有可微分参数的排名模块。 该系统还包括成本计算模块,该模块使用取决于对示例并且描述排序模块的输出的成本函数。 还提供了使用所公开的系统的方法。

    System and method providing automated margin tree analysis and processing of sampled data

    公开(公告)号:US20050165732A1

    公开(公告)日:2005-07-28

    申请号:US11086831

    申请日:2005-03-22

    IPC分类号: G06F7/00 G06F17/30

    摘要: The present invention relates to a system and methodology to facilitate database processing in accordance with a plurality of various applications. In one aspect, a large database of objects is processed, wherein the objects can be represented as points in a vector space, and two or more objects are deemed ‘close’ if a Euclidean distance between the points is small. This can apply for substantially any type of object, provided a suitable distance measure can be defined. In another aspect, a ‘test’ object having a vector x, is processed to determine if there exists an object y in the database such that the distance between x and y falls below a threshold t. If several objects in the database satisfy this criteria, a list of objects can be returned, together with their corresponding distances. If no objects were to satisfy the criterion, an indication of this condition can also be provided, but in addition, the condition or information relating to the condition can be provided.

    Ranking results using multiple nested ranking
    6.
    发明申请
    Ranking results using multiple nested ranking 有权
    使用多个嵌套排名排名结果

    公开(公告)号:US20060195440A1

    公开(公告)日:2006-08-31

    申请号:US11294269

    申请日:2005-12-05

    IPC分类号: G06F17/30

    摘要: A unique system and method that facilitates improving the ranking of items is provided. The system and method involve re-ranking decreasing subsets of high ranked items in separate stages. In particular, a basic ranking component can rank a set of items. A subset of the top or high ranking items can be taken and used as a new training set to train a component for improving the ranking among these high ranked documents. This process can be repeated on an arbitrary number of successive high ranked subsets. Thus, high ranked items can be reordered in separate stages by focusing on the higher ranked items to facilitate placing the most relevant items at the top of a search results list.

    摘要翻译: 提供了一种有助于提高项目排名的独特系统和方法。 该系统和方法包括在不同阶段重新排列高排名项目的减少子集。 特别地,基本排名组件可以对一组项目进行排序。 可以采用顶级或高级项目的一部分,并将其用作新的培训组,以训练组件以提高这些高排名文档中的排名。 该过程可以在任意数量的连续高排名子集上重复。 因此,通过关注较高排名的项目以便将最相关的项目放置在搜索结果列表的顶部,可以在单独的阶段重新排列高排名的项目。

    Audio duplicate detector
    10.
    发明申请
    Audio duplicate detector 有权
    音频重复检测器

    公开(公告)号:US20050091275A1

    公开(公告)日:2005-04-28

    申请号:US10785561

    申请日:2004-02-24

    摘要: The present invention relates to a system and methodology to facilitate automatic management and pruning of audio files residing in a database. Audio fingerprinting is a powerful tool for identifying streaming or file-based audio, using a database of fingerprints. Duplicate detection identifies duplicate audio clips in a set, even if the clips differ in compression quality or duration. The present invention can be provided as a self-contained application that it does not require an external database of fingerprints. Also, a user interface provides various options for managing and pruning the audio files.

    摘要翻译: 本发明涉及一种便于自动管理和修剪驻留在数据库中的音频文件的系统和方法。 音频指纹是使用指纹数据库识别流媒体或基于文件的音频的强大工具。 重复的检测识别集合中的重复音频剪辑,即使剪辑在压缩质量或持续时间上有所不同。 本发明可以作为独立应用来提供,其不需要外部指纹数据库。 此外,用户界面提供了管理和修剪音频文件的各种选项。