Keyword usage score based on frequency impulse and frequency weight
    31.
    发明授权
    Keyword usage score based on frequency impulse and frequency weight 失效
    基于频率冲击和频率权重的关键词使用得分

    公开(公告)号:US07644075B2

    公开(公告)日:2010-01-05

    申请号:US11756740

    申请日:2007-06-01

    IPC分类号: G06F17/30

    摘要: A method and system for assessing keyword usage based on frequency of usage of the keywords during various periods is provided. A keyword usage measurement system is provided with the frequency of keywords during various periods. The measurement system then calculates a recent usage score for a keyword by combining a frequency impulse score for the keyword with a frequency weight for the keyword. The frequency impulse score for a keyword indicates whether a recent change in the frequency of the keyword has occurred. The frequency weight for a keyword indicates a recent measure of the frequency of the keyword.

    摘要翻译: 提供了一种基于各种期间关键词使用频率来评估关键字使用的方法和系统。 关键字使用测量系统在不同时期提供关键字的频率。 然后,测量系统通过将关键字的频率脉冲得分与该关键字的频率权重组合来计算关键字的最近使用分数。 关键字的频率脉冲得分指示是否发生了关键字的频率的最近的改变。 关键字的频率权重表示最近对关键字频率的度量。

    Person disambiguation using name entity extraction-based clustering
    32.
    发明申请
    Person disambiguation using name entity extraction-based clustering 有权
    使用基于名称实体提取的聚类方法消除歧义

    公开(公告)号:US20080065623A1

    公开(公告)日:2008-03-13

    申请号:US11796818

    申请日:2007-04-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3071 G06F17/30696

    摘要: Described is a technology for disambiguating data corresponding to persons that are located from search results, so that different persons having the same name can be clearly distinguished. Name entity extraction locates words (terms) that are within a certain distance of persons' names in the search results. The terms are used in disambiguating search results that correspond to different persons having the same name, such as location information, organization information, career information, and/or partner information. In one example, each person is represented as a vector, and similarity among vectors is calculated based on weighting that corresponds to nearness of the terms to a person, and/or the types of terms. Based on the similarity data, the person vectors that represent the same person are then merged into one cluster, so that each cluster represents (to a high probability) only one distinct person.

    摘要翻译: 描述了一种用于消除对应于从搜索结果定位的人的数据的技术,使得可以清楚地区分具有相同名称的不同的人。 名称实体提取查找搜索结果中某人距离内的单词(术语)。 这些术语用于消除与具有相同名称的不同人员相对应的搜索结果,例如位置信息,组织信息,职业信息和/或合作伙伴信息。 在一个示例中,每个人被表示为向量,并且基于对应于对人的术语的接近度的加权和/或术语的类型来计算向量之间的相似性。 基于相似性数据,代表同一个人的人物向量然后被合并成一个群集,使得每个群集只代表一个不同的人。

    Efficient retrieval algorithm by query term discrimination
    33.
    发明授权
    Efficient retrieval algorithm by query term discrimination 有权
    通过查询词辨别的有效检索算法

    公开(公告)号:US07925644B2

    公开(公告)日:2011-04-12

    申请号:US12038652

    申请日:2008-02-27

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30675 G06Q10/10

    摘要: A method and system for use in information retrieval includes, for each of a plurality of terms, selecting a predetermined number of top scoring documents for the term to form a corresponding document set for the term. When a plurality of terms are received, optionally as a query, the system ranks, using an inverse document frequency algorithm, the plurality of terms for importance based on the document sets for the plurality of terms. Then a number of ranked terms are selected based on importance and a union set is formed based on the document sets associated with the selected number of ranked terms.

    摘要翻译: 用于信息检索的方法和系统包括对于多个术语中的每一个,为术语选择预定数量的最高评分文档以形成用于该术语的相应文档集合。 当接收到多个术语时,可选地作为查询,系统使用逆文档频率算法基于多个术语的文档集来排列多个重要术语。 然后,基于重要性选择多个排名项,并且基于与所选择的排序项数相关联的文档集合形成联合集合。

    Efficient retrieval algorithm by query term discrimination
    34.
    发明授权
    Efficient retrieval algorithm by query term discrimination 有权
    通过查询词辨别的有效检索算法

    公开(公告)号:US07822752B2

    公开(公告)日:2010-10-26

    申请号:US11804627

    申请日:2007-05-18

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30675

    摘要: Described is an efficient retrieval mechanism that quickly locates documents (e.g., corresponding to online advertisements) based on query term discrimination. A topmost subset (e.g., two) of search terms is selected according to their ranked importance, e.g., as ranked by inverted document frequency. The topmost terms are then used to narrow the number of rows of an inverted query index that are searched to find document identifiers and associated scores, such as computed offline by a BM25 algorithm. For example, for each document identifier of each important term, a fast search within each of the narrowed subset of rows (that also contain that document identifier) may be performed by comparing document identifiers to jump a pointer within each other row, followed by a binary search to locate a particular document. The scores of the set of particular documents may then be used to rank their relative importance for returning as results.

    摘要翻译: 描述了一种有效的检索机制,其基于查询词辨别快速定位文档(例如,对应于在线广告)。 根据其排序的重要性来选择搜索项的最顶层子集(例如,两个),例如按照倒排的文档频率排序。 然后使用最上面的术语来缩小被搜索以查找文档标识符和相关分数的反向查询索引的行数,例如通过BM25算法离线计算。 例如,对于每个重要术语的每个文档标识符,可以通过比较文档标识符来跳过每个其他行中的指针,然后是一个指针,来执行每个狭窄的行子集(也包含该文档标识符)的快速搜索 二进制搜索查找特定文档。 然后可以使用该组特定文件的分数来排列其作为结果返回的相对重要性。

    Determination of time dependency of search queries
    35.
    发明授权
    Determination of time dependency of search queries 失效
    确定搜索查询的时间依赖关系

    公开(公告)号:US07693908B2

    公开(公告)日:2010-04-06

    申请号:US11770358

    申请日:2007-06-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06Q30/02

    摘要: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.

    摘要翻译: 用于分析和建模查询频率的技术由查询分析系统提供。 查询分析系统分析查询的频率,以确定查询是时间依赖还是时间无关。 查询分析系统根据其周期性预测与时间相关的查询的频率。 查询分析系统根据与其他查询的因果关系预测与时间无关的查询的频率。 为了预测时间无关查询的频率,查询分析系统随时间分析查询的频率,以识别频率的显着增加,这被称为“查询事件”或“事件”。查询分析系统预测频率 基于具有事件倾向于在要预测的查询的事件之前的查询的与时间无关的查询。

    Forecasting time-independent search queries
    36.
    发明授权
    Forecasting time-independent search queries 有权
    预测与时间无关的搜索查询

    公开(公告)号:US07685099B2

    公开(公告)日:2010-03-23

    申请号:US11770445

    申请日:2007-06-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06Q30/02

    摘要: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.

    摘要翻译: 用于分析和建模查询频率的技术由查询分析系统提供。 查询分析系统分析查询的频率,以确定查询是时间依赖还是时间无关。 查询分析系统根据其周期性预测与时间相关的查询的频率。 查询分析系统根据与其他查询的因果关系预测与时间无关的查询的频率。 为了预测时间无关查询的频率,查询分析系统随时间分析查询的频率,以识别频率的显着增加,这被称为“查询事件”或“事件”。查询分析系统预测频率 基于具有事件倾向于在要预测的查询的事件之前的查询的与时间无关的查询。

    Adaptive grouping in a file network
    37.
    发明授权
    Adaptive grouping in a file network 有权
    文件网络中的自适应分组

    公开(公告)号:US07634471B2

    公开(公告)日:2009-12-15

    申请号:US11392760

    申请日:2006-03-30

    IPC分类号: G06F17/30

    摘要: Extraction of semantic information and the generation of semantic attributes allows for improved organization and management of data. Semantic attributes are automatically generated and eliminate the need for manual entry of attribute information. A semantic file network may further be constructed based on similarities between files that are based on the semantic attribute information. Semantic links representing a semantic relationship may be built between similar or relevant files. In addition, user operations and user operation patterns may also be considered in building the file network. Semantic attributes and information may further facilitate browsing the file systems as well as improve the accuracy and speed of queries.

    摘要翻译: 语义信息的提取和语义属性的产生可以改善数据的组织和管理。 自动生成语义属性,无需手动输入属性信息。 还可以基于基于语义属性信息的文件之间的相似性来构建语义文件网络。 表示语义关系的语义链接可以在相似或相关文件之间建立。 此外,在构建文件网络时也可以考虑用户操作和用户操作模式。 语义属性和信息可以进一步促进文件系统的浏览以及提高查询的准确性和速度。

    Internet Visualization System and Related User Interfaces
    38.
    发明申请
    Internet Visualization System and Related User Interfaces 有权
    互联网可视化系统和相关用户界面

    公开(公告)号:US20080256444A1

    公开(公告)日:2008-10-16

    申请号:US11972073

    申请日:2008-01-10

    IPC分类号: G06F3/00

    CPC分类号: G06F17/30864 G06F2216/03

    摘要: Systems and methods are described for an Internet visualization system and related user interfaces. In one implementation, the system analyzes Internet search logs to determine most popular search queries across the world at a current time. A user interface displays a keyword of each of the most popular queries in a single visual display that relates each query to a geographical location of greatest popularity. The system can also filter queries according to demographics. In one implementation the user interface provides a 3-dimensional Internet visualization that adopts an ocean or seascape theme. The ocean floor displays a map of the world, and query bubbles rise from geographical locations on the map. The size and duration of each query bubble denotes the relative popularity of a given query.

    摘要翻译: 为互联网可视化系统和相关的用户界面描述了系统和方法。 在一个实现中,系统分析互联网搜索日志以确定当前世界上最流行的搜索查询。 用户界面在单个视觉显示中显示每个最流行的查询的关键字,其将每个查询与最受欢迎的地理位置相关联。 该系统还可以根据人口特征来过滤查询。 在一个实现中,用户界面提供采用海洋或海景主题的三维互联网可视化。 海底显示世界地图,查询气泡从地图上的地理位置上升。 每个查询气泡的大小和持续时间表示给定查询的相对受欢迎程度。

    WEBPAGE BLOCK TRACKING GADGET
    39.
    发明申请
    WEBPAGE BLOCK TRACKING GADGET 审中-公开
    WEBPAGE块跟踪GADGET

    公开(公告)号:US20080215997A1

    公开(公告)日:2008-09-04

    申请号:US12038687

    申请日:2008-02-27

    IPC分类号: G06F3/048

    CPC分类号: G06F3/0481

    摘要: An exemplary web browser system includes a selection module for selecting a webpage block and recording information about a selected webpage block; a tracking module for tracking changes to a selected webpage block based at least in part on the recorded information for that webpage block; and a display module for displaying a selected webpage block wherein the tracking module updates the display module as to changes to the selected webpage block. Various other exemplary systems, methods, devices are also disclosed.

    摘要翻译: 示例性网络浏览器系统包括用于选择网页块并记录关于所选网页块的信息的选择模块; 跟踪模块,用于至少部分地基于所述网页块的记录信息跟踪对所选网页块的改变; 以及用于显示所选网页块的显示模块,其中所述跟踪模块更新所述显示模块以改变所选择的网页块。 还公开了各种其它示例性系统,方法,装置。

    Representing queries and determining similarity based on an ARIMA model
    40.
    发明授权
    Representing queries and determining similarity based on an ARIMA model 失效
    基于ARIMA模型表示查询和确定相似性

    公开(公告)号:US08090709B2

    公开(公告)日:2012-01-03

    申请号:US11770307

    申请日:2007-06-28

    IPC分类号: G06F17/30

    CPC分类号: G06Q30/02

    摘要: Representing queries and determining similarity of queries based on an autoregressive integrated moving average (“ARIMA”) model is provided. A query analysis system represents each query by its ARIMA coefficients. The query analysis system may estimate the frequency information for a desired past or future interval based on frequency information for some initial intervals. The query analysis system may also determine the similarity of a pair of queries based on the similarity of their ARIMA coefficients. The query analysis system may use various metrics, such as a correlation metric, to determine the similarity of the ARIMA coefficients.

    摘要翻译: 提供了基于自回归综合移动平均(“ARIMA”)模型的查询和确定查询的相似性。 查询分析系统通过其ARIMA系数表示每个查询。 查询分析系统可以基于一些初始间隔的频率信息来估计期望的过去或将来间隔的频率信息。 查询分析系统还可以基于它们的ARIMA系数的相似度来确定一对查询的相似性。 查询分析系统可以使用诸如相关度量的各种度量来确定ARIMA系数的相似性。