Efficient retrieval algorithm by query term discrimination
    31.
    发明授权
    Efficient retrieval algorithm by query term discrimination 有权
    通过查询词辨别的有效检索算法

    公开(公告)号:US07822752B2

    公开(公告)日:2010-10-26

    申请号:US11804627

    申请日:2007-05-18

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30675

    摘要: Described is an efficient retrieval mechanism that quickly locates documents (e.g., corresponding to online advertisements) based on query term discrimination. A topmost subset (e.g., two) of search terms is selected according to their ranked importance, e.g., as ranked by inverted document frequency. The topmost terms are then used to narrow the number of rows of an inverted query index that are searched to find document identifiers and associated scores, such as computed offline by a BM25 algorithm. For example, for each document identifier of each important term, a fast search within each of the narrowed subset of rows (that also contain that document identifier) may be performed by comparing document identifiers to jump a pointer within each other row, followed by a binary search to locate a particular document. The scores of the set of particular documents may then be used to rank their relative importance for returning as results.

    摘要翻译: 描述了一种有效的检索机制,其基于查询词辨别快速定位文档(例如,对应于在线广告)。 根据其排序的重要性来选择搜索项的最顶层子集(例如,两个),例如按照倒排的文档频率排序。 然后使用最上面的术语来缩小被搜索以查找文档标识符和相关分数的反向查询索引的行数,例如通过BM25算法离线计算。 例如,对于每个重要术语的每个文档标识符,可以通过比较文档标识符来跳过每个其他行中的指针,然后是一个指针,来执行每个狭窄的行子集(也包含该文档标识符)的快速搜索 二进制搜索查找特定文档。 然后可以使用该组特定文件的分数来排列其作为结果返回的相对重要性。

    Efficient bitstream compression
    32.
    发明授权
    Efficient bitstream compression 有权
    高效的比特流压缩

    公开(公告)号:US07808405B1

    公开(公告)日:2010-10-05

    申请号:US12413787

    申请日:2009-03-30

    申请人: Zheng Chen

    发明人: Zheng Chen

    IPC分类号: H03M7/30 H03K19/173

    CPC分类号: H03M7/40 H03K19/177

    摘要: In one embodiment of the invention, a method of generating a compressed configuration bitstream for a programmable logic device comprises encoding the most-prevalent data word within the configuration data of the bitstream into a codeword of a first type; encoding a set of more-prevalent data words within the configuration data into codewords of a second type; and including in the compressed bitstream at least some of the data words that are members of the set of more-prevalent data words. The included data words, when received by the programmable logic device, are adapted to be identified by the device as members of the set of more-prevalent data words. The included data words are stored for selection by the device when a codeword of the second type representing an included data word is received by the device.

    摘要翻译: 在本发明的一个实施例中,生成用于可编程逻辑器件的压缩配置比特流的方法包括将比特流的配置数据内最流行的数据字编码为第一类型的码字; 将配置数据内的一组更流行的数据字编码成第二类型的码字; 并且在所述压缩比特流中包括作为所述一组更普遍数据字的成员的至少一些所述数据字。 所包括的数据字在被可编程逻辑器件接收时被适配为被该器件识别为该组更普遍的数据字的成员。 当由设备接收到表示所包括的数据字的第二类型的代码字时,包含的数据字被存储以供设备选择。

    Modulation of T cell signaling threshold and T cell sensitivity to antigens
    33.
    发明授权
    Modulation of T cell signaling threshold and T cell sensitivity to antigens 有权
    调节T细胞信号阈值和T细胞对抗原的敏感性

    公开(公告)号:US07803784B2

    公开(公告)日:2010-09-28

    申请号:US11977506

    申请日:2007-10-24

    摘要: MicroRNAs (miRNAs) are a diverse and abundant class of ˜22-nucleotide (nt) endogenous regulatory RNAs that play a variety of roles in animal cells by controlling gene expression at the posttranscriptional level. Increased miR-181a expression in mature T cells is shown to cause a marked increase in T cell activation and augments T cell sensitivity to peptide antigens. Moreover, T cell blasts with higher miR-181a expression become reactive to antagonists. The effects of miR-181a on antigen discrimination are in part achieved by dampening the expression of multiple negative regulators in the T cell receptor (TCR) signaling pathway, including PTPN22 and the dual specificity phosphatases DUSP5 and DUSP6. This results in a reduction in the TCR signaling threshold, thus quantitatively and qualitatively enhancing T cell sensitivity to antigens.

    摘要翻译: 微RNA(miRNA)是一种多样且丰富的〜22核苷酸(nt)内源性调控RNA,通过控制转录后水平的基因表达,在动物细胞中发挥各种作用。 在成熟T细胞中增加的miR-181a表达显示引起T细胞活化的显着增加并增加T细胞对肽抗原的敏感性。 此外,具有较高miR-181a表达的T细胞爆发成为对拮抗剂的反应性。 miR-181a对抗原鉴别的影响部分通过抑制T细胞受体(TCR)信号通路中多种负调节因子的表达,包括PTPN22和双特异性磷酸酶DUSP5和DUSP6来实现。 这导致TCR信号阈值的降低,从而定量和定性地增强T细胞对抗原的敏感性。

    PROVIDING CONTENT ITEMS SELECTED BASED ON CONTEXT
    34.
    发明申请
    PROVIDING CONTENT ITEMS SELECTED BASED ON CONTEXT 审中-公开
    提供基于背景选择的内容项目

    公开(公告)号:US20100241663A1

    公开(公告)日:2010-09-23

    申请号:US12791091

    申请日:2010-06-01

    IPC分类号: G06F17/30 G06F12/08

    摘要: Systems, methods, and computer storage media having computer-executable instructions embodied thereon that provide content items selected based on context are provided. Contextual indicators associated with a user are identified and utilized to determine one or more content items that the user is likely to desire to access at a particular point in time. Upon receiving an indication that the user desires to perform a context-aware search, the identified content items (or references thereto) are presented automatically to the user, that is, without the user having to input any search query terms. The indication that the user desires to perform a context-aware search may be received, for instance, upon receiving an indication that a selectable context-aware search button has been selected by the user. This single-button action is particularly useful for mobile computing devices, wherein alpha-numeric textual input is relatively difficult.

    摘要翻译: 提供了具有包含其上提供基于上下文选择的内容项的计算机可执行指令的系统,方法和计算机存储介质。 与用户相关联的上下文指标被识别并用于确定用户可能希望在特定时间点访问的一个或多个内容项目。 在接收到用户希望执行上下文感知搜索的指示之后,所识别的内容项目(或其参考文献)被自动呈现给用户,即,用户不必输入任何搜索查询项。 例如,在接收到用户选择了可选择的上下文感知搜索按钮的指示之后,可以接收用户期望执行上下文感知搜索的指示。 这种单按钮动作对于移动计算设备特别有用,其中字母数字文本输入相当困难。

    MINING NEW WORDS FROM A QUERY LOG FOR INPUT METHOD EDITORS
    35.
    发明申请
    MINING NEW WORDS FROM A QUERY LOG FOR INPUT METHOD EDITORS 有权
    从输入法编辑器的查询记录中挖掘新的词

    公开(公告)号:US20100088303A1

    公开(公告)日:2010-04-08

    申请号:US12244774

    申请日:2008-10-03

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30731

    摘要: Described is a technology in which new words (including a phrase or set of Chinese characters) are mined from a query log. The new words may be added to (or otherwise supplement) an IME dictionary. A set of candidate queries may be selected from the log based upon market (e.g., the Chinese market) and/or by language. From this set, various filtering steps are performed to locate only new words that are frequently in used. For example, only frequent queries are kept for further processing, which may include filtering out queries based on length (e.g., less than two or greater than eight Chinese characters), and/or filtering out queries based on too many stop-words in the query. Processing may also include filtering out a query that is a substring of a larger query, or Vice-versa. Also described is Pinyin-based clustering and filtering, and filtering out queries already handled in the dictionary.

    摘要翻译: 描述了从查询日志中挖出新词(包括短语或一组汉字)的技术。 新词可能会添加到(或以其他方式补充)IME词典。 可以基于市场(例如,中国市场)和/或按语言从日志中选择一组候选查询。 从该集合中,执行各种过滤步骤以仅定位经常使用的新词。 例如,只有频繁的查询被保留用于进一步的处理,其可以包括基于长度(例如,少于两个或大于八个汉字)过滤掉查询,和/或基于过多的停止词过滤掉查询 查询。 处理还可以包括过滤掉作为较大查询的子串的查询,反之亦然。 还描述了基于拼音的群集和过滤,并且过滤掉已经在字典中处理的查询。

    Determination of time dependency of search queries
    36.
    发明授权
    Determination of time dependency of search queries 失效
    确定搜索查询的时间依赖关系

    公开(公告)号:US07693908B2

    公开(公告)日:2010-04-06

    申请号:US11770358

    申请日:2007-06-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06Q30/02

    摘要: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.

    摘要翻译: 用于分析和建模查询频率的技术由查询分析系统提供。 查询分析系统分析查询的频率,以确定查询是时间依赖还是时间无关。 查询分析系统根据其周期性预测与时间相关的查询的频率。 查询分析系统根据与其他查询的因果关系预测与时间无关的查询的频率。 为了预测时间无关查询的频率,查询分析系统随时间分析查询的频率,以识别频率的显着增加,这被称为“查询事件”或“事件”。查询分析系统预测频率 基于具有事件倾向于在要预测的查询的事件之前的查询的与时间无关的查询。

    Graph-based search leveraging sentiment analysis of user comments
    37.
    发明授权
    Graph-based search leveraging sentiment analysis of user comments 有权
    基于图形的搜索利用用户评论的情绪分析

    公开(公告)号:US07689624B2

    公开(公告)日:2010-03-30

    申请号:US11712808

    申请日:2007-03-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867

    摘要: A search system and method is provided. The method includes constructing a graph-based query that is indicative of a user's preference-levels for different features of a search item (a product, for example). The constructed graph-based query is executed by comparing the user's preference-levels for the different features of the product, which are graphically represented in the query, with information related to sentiments expressed by other users regarding the product. Information related to the sentiments expressed by other users regarding the product can include system-generated product performance graphs constructed from comments regarding the product obtained from the World Wide Web (or other network). Results returned and output upon execution of the graph-based query include system-generated product performance graphs that are similar to the user-submitted query.

    摘要翻译: 提供了一种搜索系统和方法。 该方法包括构建基于图表的查询,其指示用户对于搜索项(例如,产品)的不同特征的偏好级别。 通过将查询中以图形表示的产品的不同特征的用户偏好等级与由其他用户对产品表达的情绪相关的信息进行比较来执行构造的基于图的查询。 与其他用户对产品表达的情绪相关的信息可以包括由从万维网(或其他网络)获得的产品的评论构成的系统生成的产品性能图。 在执行基于图形的查询时返回并输出的结果包括与用户提交的查询类似的系统生成的产品性能图。

    Person disambiguation using name entity extraction-based clustering
    38.
    发明授权
    Person disambiguation using name entity extraction-based clustering 有权
    使用基于名称实体提取的聚类方法消除歧义

    公开(公告)号:US07685201B2

    公开(公告)日:2010-03-23

    申请号:US11796818

    申请日:2007-04-30

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/3071 G06F17/30696

    摘要: Described is a technology for disambiguating data corresponding to persons that are located from search results, so that different persons having the same name can be clearly distinguished. Name entity extraction locates words (terms) that are within a certain distance of persons' names in the search results. The terms are used in disambiguating search results that correspond to different persons having the same name, such as location information, organization information, career information, and/or partner information. In one example, each person is represented as a vector, and similarity among vectors is calculated based on weighting that corresponds to nearness of the terms to a person, and/or the types of terms. Based on the similarity data, the person vectors that represent the same person are then merged into one cluster, so that each cluster represents (to a high probability) only one distinct person.

    摘要翻译: 描述了一种用于消除对应于从搜索结果定位的人的数据的技术,使得可以清楚地区分具有相同名称的不同的人。 名称实体提取查找搜索结果中某人距离内的单词(术语)。 这些术语用于消除与具有相同名称的不同人员相对应的搜索结果,例如位置信息,组织信息,职业信息和/或合作伙伴信息。 在一个示例中,每个人被表示为向量,并且基于对应于对人的术语的接近度的加权和/或术语的类型来计算向量之间的相似性。 基于相似性数据,代表同一个人的人物向量然后被合并成一个群集,使得每个群集只代表一个不同的人。

    Forecasting time-independent search queries
    39.
    发明授权
    Forecasting time-independent search queries 有权
    预测与时间无关的搜索查询

    公开(公告)号:US07685099B2

    公开(公告)日:2010-03-23

    申请号:US11770445

    申请日:2007-06-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06Q30/02

    摘要: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.

    摘要翻译: 用于分析和建模查询频率的技术由查询分析系统提供。 查询分析系统分析查询的频率,以确定查询是时间依赖还是时间无关。 查询分析系统根据其周期性预测与时间相关的查询的频率。 查询分析系统根据与其他查询的因果关系预测与时间无关的查询的频率。 为了预测时间无关查询的频率,查询分析系统随时间分析查询的频率,以识别频率的显着增加,这被称为“查询事件”或“事件”。查询分析系统预测频率 基于具有事件倾向于在要预测的查询的事件之前的查询的与时间无关的查询。

    METHOD AND SYSTEM FOR ADAPTING SEARCH RESULTS TO PERSONAL INFORMATION NEEDS
    40.
    发明申请
    METHOD AND SYSTEM FOR ADAPTING SEARCH RESULTS TO PERSONAL INFORMATION NEEDS 有权
    搜索结果适用于个人信息需求的方法和系统

    公开(公告)号:US20100057798A1

    公开(公告)日:2010-03-04

    申请号:US12616739

    申请日:2009-11-11

    IPC分类号: G06F17/30

    摘要: A method and system for adapting search results of a query to the information needs of the user submitting the query is provided. A search system analyzes click-through triplets indicating that a user submitted a query and that the user selected a document from the results of the query. To overcome the large size and sparseness of the click-through data, the search system when presented with an input triplet comprising a user, a query, and a document determines a probability that the user will find the input document important by smoothing the click-through triplets. The search system then orders documents of the result based on the probability of their importance to the input user.

    摘要翻译: 提供了一种用于将查询的搜索结果适应于提交查询的用户的信息需求的方法和系统。 搜索系统分析点击三胞胎,指示用户提交了查询,并且用户从查询的结果中选择了文档。 为了克服点击数据的大尺寸和稀疏性,搜索系统当呈现包括用户,查询和文档的输入三元组时,确定用户将通过平滑点击数据来重新找到输入文档的概率, 通过三胞胎。 然后,搜索系统基于其对输入用户的重要性的概率来订购结果的文档。