-
公开(公告)号:US07849077B2
公开(公告)日:2010-12-07
申请号:US11481686
申请日:2006-07-06
申请人: Ciya Liao , Shamim A. Alpha
发明人: Ciya Liao , Shamim A. Alpha
IPC分类号: G06F17/30
CPC分类号: G06F17/30672 , G06F17/30864
摘要: Systems, methods, media, and other embodiments associated with ranking documents by providing a search engine with a series of sub-queries generated from an original query are described. One example system includes input logic for receiving a query. The example system may include a relaxation logic configured to produce sub-queries from the query. The sub-queries may describe metadata string matching, content string matching, and/or metadata numerical attribute analysis. The sub-queries may be provided by an output logic to a search engine in an order that facilitates defining document relevance without requiring post-retrieval relevance ranking.
摘要翻译: 描述了通过向搜索引擎提供从原始查询生成的一系列子查询来对与文档排序相关联的系统,方法,媒体和其他实施例。 一个示例系统包括用于接收查询的输入逻辑。 示例系统可以包括被配置为从查询产生子查询的放松逻辑。 子查询可以描述元数据字符串匹配,内容字符串匹配和/或元数据值属性分析。 子查询可以通过输出逻辑以搜索引擎的顺序提供,这有助于定义文档相关性,而不需要后检索相关性排名。
-
公开(公告)号:US20080010268A1
公开(公告)日:2008-01-10
申请号:US11481686
申请日:2006-07-06
申请人: Ciya Liao , Shamim A. Alpha
发明人: Ciya Liao , Shamim A. Alpha
CPC分类号: G06F17/30672 , G06F17/30864
摘要: Systems, methods, media, and other embodiments associated with ranking documents by providing a search engine with a series of sub-queries generated from an original query are described. One example system includes input logic for receiving a query. The example system may include a relaxation logic configured to produce sub-queries from the query. The sub-queries may describe metadata string matching, content string matching, and/or metadata numerical attribute analysis. The sub-queries may be provided by an output logic to a search engine in an order that facilitates defining document relevance without requiring post-retrieval relevance ranking.
摘要翻译: 描述了通过向搜索引擎提供从原始查询生成的一系列子查询来对与文档排序相关联的系统,方法,媒体和其他实施例。 一个示例系统包括用于接收查询的输入逻辑。 示例系统可以包括被配置为从查询产生子查询的放松逻辑。 子查询可以描述元数据字符串匹配,内容字符串匹配和/或元数据值属性分析。 子查询可以通过输出逻辑以搜索引擎的顺序提供,这有助于定义文档相关性,而不需要后检索相关性排名。
-
公开(公告)号:US07856598B2
公开(公告)日:2010-12-21
申请号:US11481750
申请日:2006-07-06
申请人: Ciya Liao , Shamim A. Alpha
发明人: Ciya Liao , Shamim A. Alpha
IPC分类号: G06F17/00 , G06F17/20 , G06F17/21 , G06F17/22 , G06F17/24 , G06F17/25 , G06F17/26 , G06F17/27 , G06F17/28
CPC分类号: G06F17/273
摘要: Systems, methods, media, and other embodiments associated with (non)contiguous n-gram based spell correction are described. One exemplary system embodiment includes logic for creating contiguous and non-contiguous trigrams, logic for creating an inverted index relating trigrams and the words from which they were generated, and logic for comparing trigrams associated with a word to spell check to trigrams associated with the words selected using the inverted index.
摘要翻译: 描述与(非)连续的基于n-gram的拼写校正相关联的系统,方法,介质和其他实施例。 一个示例性系统实施例包括用于创建连续和不连续的三元组的逻辑,用于创建与三角形相关联的反向索引的逻辑和从其产生的单词的逻辑,以及用于将与单词相关联的三元组与拼写检查相对应的逻辑与用于与单词相关联的三元组 使用反向索引选择。
-
公开(公告)号:US20080010316A1
公开(公告)日:2008-01-10
申请号:US11481750
申请日:2006-07-06
申请人: Ciya Liao , Shamim A. Alpha
发明人: Ciya Liao , Shamim A. Alpha
CPC分类号: G06F17/273
摘要: Systems, methods, media, and other embodiments associated with (non)contiguous n-gram based spell correction are described. One exemplary system embodiment includes logic for creating contiguous and non-contiguous trigrams, logic for creating an inverted index relating trigrams and the words from which they were generated, and logic for comparing trigrams associated with a word to spell check to trigrams associated with the words selected using the inverted index.
摘要翻译: 描述与(非)连续的基于n-gram的拼写校正相关联的系统,方法,介质和其他实施例。 一个示例性系统实施例包括用于创建连续和不连续的三元组的逻辑,用于创建与三角形相关联的反向索引的逻辑和从其产生的单词的逻辑,以及用于将与单词相关联的三元组与拼写检查相对应的逻辑与用于与单词相关联的三元组 使用反向索引选择。
-
公开(公告)号:US08433712B2
公开(公告)日:2013-04-30
申请号:US11680548
申请日:2007-02-28
申请人: Hiroshi Koide , Ciya Liao , Cindy Hsin , Meeten Bhavsar
发明人: Hiroshi Koide , Ciya Liao , Cindy Hsin , Meeten Bhavsar
CPC分类号: G06F17/30979 , G06F21/41
摘要: A flexible and extensible architecture allows for secure searching across an enterprise. Such an architecture can provide a simple Internet-like search experience to users searching secure content inside (and outside) the enterprise. The architecture allows for the crawling and searching of a variety or sources across an enterprise, regardless of whether any of these sources conform to a conventional user role model. The architecture further allows for security attributes to be submitted at query time, for example, in order to provide real-time secure access to enterprise resources. The user query also can be transformed to provide for dynamic querying that provides for a more current result list than can be obtained for static queries.
摘要翻译: 灵活可扩展的架构允许跨企业进行安全搜索。 这样的架构可以为在企业内部(和外部)搜索安全内容的用户提供简单的类似Internet的搜索体验。 该架构允许在整个企业中爬行和搜索各种源,而不管这些源是否符合常规用户角色模型。 该体系结构进一步允许在查询时提交安全属性,例如为了提供对企业资源的实时安全访问。 用户查询也可以被转换以提供动态查询,其提供比静态查询可获得的更多当前结果列表。
-
公开(公告)号:US20110258184A1
公开(公告)日:2011-10-20
申请号:US13169688
申请日:2011-06-27
IPC分类号: G06F17/30
CPC分类号: G06F17/30867 , G06F17/30699
摘要: Search term ranking algorithms can be generated and updated based on customer settings, such as where a ranking algorithm is modeled as a combination function of different ranking factors. An end user of a search system provides personalized preferences for weighted attributes, generally or for a single instance of the query. The user also can indicate the relative importance of one or more ranking factors by specifying different weights to the factors. Ranking factors can specify document attributes, such as document title, document body, document page rank, etc. Based on the attribute weights and the received user query, a ranking algorithm function will produce the relevant value for each document corresponding to the user preferences and personalization configurations.
摘要翻译: 搜索项排序算法可以根据客户设置生成和更新,例如排序算法被建模为不同排名因素的组合函数。 搜索系统的最终用户为加权属性提供个性化偏好,一般或单个查询实例。 用户还可以通过为因素指定不同的权重来指示一个或多个排名因子的相对重要性。 排名因素可以指定文档属性,如文档标题,文档正文,文档页面排名等。基于属性权重和接收到的用户查询,排序算法函数将为每个文档生成与用户偏好相对应的相关值, 个性化配置
-
公开(公告)号:US20110231390A1
公开(公告)日:2011-09-22
申请号:US12725310
申请日:2010-03-16
申请人: Yoshiyuki Inagaki , Narayanan Sadagopan , Georges-Eric Albert Marie Robert Dupret , Ciya Liao , Anlei Dong , Yi Chang , Zhaohui Zheng
发明人: Yoshiyuki Inagaki , Narayanan Sadagopan , Georges-Eric Albert Marie Robert Dupret , Ciya Liao , Anlei Dong , Yi Chang , Zhaohui Zheng
CPC分类号: G06F17/30864
摘要: In one embodiment, access one or more query-resource pairs, wherein for each one of the query-resource pairs comprising one of one or more search queries and one of one or more network resources, the one search query is recency-sensitive with respect to a particular time period, and the one network resource is identified for the one search query, and a resource-view count and a resource-click count associated with each one of the query-resource pairs; and construct one or more first click features using the resource-view counts and the resource-click counts associated with the query-resource pairs. To construct one of the first click features in connection with one of the query-resource pairs comprises determine a only-resource-click count associated with the one query-resource pair; and calculate a ratio between the only-resource-click count and the resource-view count associated with the one query-resource pair as the one first click feature.
摘要翻译: 在一个实施例中,访问一个或多个查询 - 资源对,其中对于包括一个或多个搜索查询中的一个和一个或多个网络资源中的一个的查询 - 资源对中的每个查询 - 资源对,所述一个搜索查询对于近似度敏感 到特定时间段,并且为一个搜索查询标识一个网络资源,以及与每个查询 - 资源对相关联的资源视图计数和资源点击计数; 并使用资源视图计数和与查询 - 资源对相关联的资源点击计数构建一个或多个第一个点击功能。 为了构建与其中一个查询 - 资源对相关联的第一个点击功能之一,包括确定与一个查询 - 资源对相关联的唯一资源点击计数; 并且计算唯一的资源点击计数和与一个查询资源对相关联的资源视图计数之间的比率作为一个第一点击特征。
-
公开(公告)号:US07996392B2
公开(公告)日:2011-08-09
申请号:US11769245
申请日:2007-06-27
CPC分类号: G06F17/30867 , G06F17/30699
摘要: Search term ranking algorithms can be generated and updated based on customer settings, such as where a ranking algorithm is modeled as a combination function of different ranking factors. An end user of a search system provides personalized preferences for weighted attributes, generally or for a single instance of the query. The user also can indicate the relative importance of one or more ranking factors by specifying different weights to the factors. Ranking factors can specify document attributes, such as document title, document body, document page rank, etc. Based on the attribute weights and the received user query, a ranking algorithm function will produce the relevant value for each document corresponding to the user preferences and personalization configurations.
摘要翻译: 搜索项排序算法可以根据客户设置生成和更新,例如排序算法被建模为不同排名因素的组合函数。 搜索系统的最终用户为加权属性提供个性化偏好,一般或单个查询实例。 用户还可以通过为因素指定不同的权重来指示一个或多个排名因子的相对重要性。 排名因素可以指定文档属性,如文档标题,文档正文,文档页面排名等。基于属性权重和接收到的用户查询,排序算法函数将为每个文档生成与用户偏好相对应的相关值, 个性化配置
-
公开(公告)号:US08626794B2
公开(公告)日:2014-01-07
申请号:US13539622
申请日:2012-07-02
IPC分类号: G06F17/30
CPC分类号: H04L63/08 , G06F17/30011 , G06F17/30321 , G06F17/30477 , G06F17/30554 , G06F17/30864 , G06F17/30867 , G06F21/31 , G06F21/6227 , H04L63/0815 , H04L63/083 , H04L63/102
摘要: A web crawler indexes documents including information about document contents and metadata including information such as a URL. However, some applications rely on URL's that change frequently or are constructed to include user information so that the contents retrieved is customized to the user. An approach is provided for storing generic URL's in an index at crawl time, which are customized for the user at search time. A callback mechanism may be used to dynamically transform the generic URL into a URL that is specific to the user issuing the query and/or includes current information that may change frequently. In this way, when the query or search results are returned to the user, the user receives links that are active and valid for that particular user, directing the user to the appropriate site, application, etc. without requiring continuous updating of a very large index.
摘要翻译: 网页抓取工具索引文档,包括有关文档内容和元数据的信息,包括诸如URL之类的信息。 然而,一些应用程序依赖于频繁更改的URL或被构造为包括用户信息,以便检索到的内容是为用户定制的。 提供了一种方法,用于将通用URL存储在抓取时间的索引中,这是在搜索时为用户定制的。 可以使用回调机制来动态地将通用URL变换成特定于发布查询的用户的URL和/或包括可能频繁变化的当前信息。 以这种方式,当查询或搜索结果被返回给用户时,用户接收对该特定用户有效且有效的链接,将用户引导到适当的站点,应用等,而不需要持续更新非常大的 指数。
-
10.
公开(公告)号:US08458165B2
公开(公告)日:2013-06-04
申请号:US11770027
申请日:2007-06-28
申请人: Ciya Liao , Thomas Chang
发明人: Ciya Liao , Thomas Chang
IPC分类号: G06F17/00
CPC分类号: G06F17/30595
摘要: An enterprise-wide query relaxative support vector machine ranking algorithm approach provides enhanced functionality for query execution in a heterogeneous enterprise environment. Improved query results are obtained by adjusting ranking functions using machine learning methods to automatically train ranking functions. The improved query results are obtained using a list of document-query pairs that are modeled as a binary classification training problem, combination function which requires ranking and learning functions to be implemented representing document attributes and metadata utilizing query relaxation techniques and adjusted ranking functions. Machine learning methods implement user feedback to automatically train ranking functions.
摘要翻译: 企业级查询放松支持向量机排名算法方法为异构企业环境中的查询执行提供了增强的功能。 通过使用机器学习方法调整排名函数来自动训练排名函数,获得改进的查询结果。 改进的查询结果是使用被建模为二进制分类训练问题的文档查询对的列表获得的,组合函数需要使用查询放松技术和调整的排序函数表示文档属性和元数据的排序和学习功能。 机器学习方法实现用户反馈,自动训练排名功能。
-
-
-
-
-
-
-
-
-