-
公开(公告)号:US20080097968A1
公开(公告)日:2008-04-24
申请号:US11712346
申请日:2007-02-28
CPC分类号: G06F17/278 , G06F17/30731
摘要: Systems, methods, and other embodiments associated with extracting knowledge from application data and maintaining an ontology based on the extracted knowledge are described. One example system includes a mapping logic to store mappings between application objects and ontology classes and an information extraction (IE) logic that accesses the mapping logic to identify application data to process based on the mappings. The application data may be stored in application data repositories belonging to an enterprise and may be characterized by the application object. Having identified application data to process, the IE logic may locate data in the application data repositories and selectively manipulate an ontology based on selected application data elements.
摘要翻译: 描述了与从提取的知识中提取知识和维护基于所提取的知识的本体相关联的系统,方法和其他实施例。 一个示例性系统包括用于存储应用对象和本体类之间的映射的映射逻辑,以及访问映射逻辑以识别基于映射进行处理的应用数据的信息提取(IE)逻辑。 应用数据可以存储在属于企业的应用数据存储库中,并且可以由应用对象来表征。 在确定要处理的应用数据之后,IE逻辑可以将数据定位在应用数据存储库中,并基于所选择的应用数据元素选择性地操纵本体。
-
公开(公告)号:US20070208746A1
公开(公告)日:2007-09-06
申请号:US11680571
申请日:2007-02-28
申请人: Hiroshi Koide , Mark Ture , Muralidhar Krishnaprasad , Mark Davis , Cindy Hsin , Meeten Bhavsar , Chi-Ming Yang , Visar Nimani , Hui Ouyang , Sachin Bhatkar , Thomas Chang , Thomas Baby , Ciya Liao
发明人: Hiroshi Koide , Mark Ture , Muralidhar Krishnaprasad , Mark Davis , Cindy Hsin , Meeten Bhavsar , Chi-Ming Yang , Visar Nimani , Hui Ouyang , Sachin Bhatkar , Thomas Chang , Thomas Baby , Ciya Liao
IPC分类号: G06F17/30
CPC分类号: G06F21/6218 , G06F21/6236 , H04L63/0815
摘要: A flexible and extensible architecture allows for secure searching across an enterprise. Such an architecture can provide a simple Internet-like search experience to users searching secure content inside (and outside) the enterprise. The architecture allows for the crawling and searching of a variety or sources across an enterprise, regardless of whether any of these sources conform to a conventional user role model. The architecture further allows for security attributes to be submitted at query time, for example, in order to provide real-time secure access to enterprise resources. The user query also can be transformed to provide for dynamic querying that provides for a more current result list than can be obtained for static queries.
摘要翻译: 灵活可扩展的架构允许跨企业进行安全搜索。 这样的架构可以为在企业内部(和外部)搜索安全内容的用户提供简单的类似Internet的搜索体验。 该架构允许在整个企业中爬行和搜索各种或多个源,无论这些源是否符合常规用户角色模型。 该体系结构进一步允许在查询时提交安全属性,例如为了提供对企业资源的实时安全访问。 用户查询也可以被转换以提供动态查询,其提供比静态查询可获得的更多当前结果列表。
-
公开(公告)号:US08601028B2
公开(公告)日:2013-12-03
申请号:US13536488
申请日:2012-06-28
IPC分类号: G06F17/30
CPC分类号: H04L63/08 , G06F17/30011 , G06F17/30321 , G06F17/30477 , G06F17/30554 , G06F17/30864 , G06F17/30867 , G06F21/31 , G06F21/6227 , H04L63/0815 , H04L63/083 , H04L63/102
摘要: It is desirable to provide a secure search mechanism to provide for searching over any and all content, such as across an enterprise. A secure search, however, requires access to the secure content repositories holding the data to be searched. In some cases the credentials required to crawl a repository may be extremely sensitive, or the user may be reluctant or unwilling to store user identification information in memory or on disk for any longer than is absolutely necessary. An approach is provided that allows a user or an administrator to provide security credentials to be stored and used only during a crawl, and to erase the credentials from the system when the crawl is complete.
摘要翻译: 期望提供一种安全搜索机制来提供对任何和所有内容的搜索,诸如跨企业的搜索。 然而,安全搜索需要访问保存要搜索的数据的安全内容存储库。 在某些情况下,爬取存储库所需的凭据可能非常敏感,或者用户可能不愿意或不愿意将用户标识信息存储在内存或磁盘上,而不是绝对必要的。 提供了一种方法,允许用户或管理员提供仅在爬网期间存储和使用的安全凭据,并在抓取完成时从系统中清除凭据。
-
公开(公告)号:US20130173582A1
公开(公告)日:2013-07-04
申请号:US13539622
申请日:2012-07-02
IPC分类号: G06F17/30
CPC分类号: H04L63/08 , G06F17/30011 , G06F17/30321 , G06F17/30477 , G06F17/30554 , G06F17/30864 , G06F17/30867 , G06F21/31 , G06F21/6227 , H04L63/0815 , H04L63/083 , H04L63/102
摘要: A web crawler indexes documents including information about document contents and metadata including information such as a URL. However, some applications rely on URL's that change frequently or are constructed to include user information so that the contents retrieved is customized to the user. An approach is provided for storing generic URL's in an index at crawl time, which are customized for the user at search time. A callback mechanism may be used to dynamically transform the generic URL into a URL that is specific to the user issuing the query and/or includes current information that may change frequently. In this way, when the query or search results are returned to the user, the user receives links that are active and valid for that particular user, directing the user to the appropriate site, application, etc. without requiring continuous updating of a very large index.
摘要翻译: 网页抓取工具索引文档,包括有关文档内容和元数据的信息,包括诸如URL之类的信息。 然而,一些应用程序依赖于频繁更改的URL或被构造为包括用户信息,以便检索到的内容是为用户定制的。 提供了一种方法,用于将通用URL存储在抓取时间的索引中,这是在搜索时为用户定制的。 可以使用回调机制来动态地将通用URL变换成特定于发布查询的用户的URL和/或包括可能频繁变化的当前信息。 以这种方式,当查询或搜索结果被返回给用户时,用户接收对该特定用户有效且有效的链接,将用户指向适当的站点,应用等,而不需要持续更新非常大的 指数。
-
公开(公告)号:US08326815B2
公开(公告)日:2012-12-04
申请号:US12725357
申请日:2010-03-16
申请人: Narayanan Sadagopan , Yoshiyuki Inagaki , Georges-Eric Albert Marie Robert Dupret , Ciya Liao , Anlei Dong , Yi Chang , Zhaohui Zheng
发明人: Narayanan Sadagopan , Yoshiyuki Inagaki , Georges-Eric Albert Marie Robert Dupret , Ciya Liao , Anlei Dong , Yi Chang , Zhaohui Zheng
CPC分类号: G06F17/30864
摘要: In one embodiment, access one or more query chains, wherein each one of the query chains comprises two or more search queries, {q1, . . . , qn}, which are recency-sensitive, are related to the same subject matter, and are issued to a search engine sequentially, and actual click-through information associated with each one of the query chains; and smooth each one of the query chains using the actual click-through information associated with the query chain. To smooth one of the query chains comprises, for each one of search queries, qj, in the query chain, where 2≦j≦n, if one of the network resources identified for qj has actually been clicked in connection with qj by the corresponding one network user, then presume that the one network resource has been clicked in connection with one or more search queries, qk, in the query chain, where 1≦k
摘要翻译: 在一个实施例中,访问一个或多个查询链,其中每个查询链包括两个或多个搜索查询{q1,..., 。 。 ,qn},它们是新近度敏感的,与相同的主题相关,并且被顺序地发布到搜索引擎,并且与每个查询链相关联的实际点击信息; 并使用与查询链相关联的实际点击信息来平滑每个查询链。 为了平滑一个查询链,对于查询链中的每个搜索查询,包括qj,其中2≦̸ j≦̸ n,如果为qj标识的一个网络资源实际上已经被qj与点对点相关联 一个网络用户,然后假设一个网络资源已被连接到查询链中的一个或多个搜索查询qk,其中1≦̸ k
-
公开(公告)号:US20120272304A1
公开(公告)日:2012-10-25
申请号:US13536488
申请日:2012-06-28
IPC分类号: G06F21/00
CPC分类号: H04L63/08 , G06F17/30011 , G06F17/30321 , G06F17/30477 , G06F17/30554 , G06F17/30864 , G06F17/30867 , G06F21/31 , G06F21/6227 , H04L63/0815 , H04L63/083 , H04L63/102
摘要: It is desirable to provide a secure search mechanism to provide for searching over any and all content, such as across an enterprise. A secure search, however, requires access to the secure content repositories holding the data to be searched. In some cases the credentials required to crawl a repository may be extremely sensitive, or the user may be reluctant or unwilling to store user identification information in memory or on disk for any longer than is absolutely necessary. An approach is provided that allows a user or an administrator to provide security credentials to be stored and used only during a crawl, and to erase the credentials from the system when the crawl is complete.
摘要翻译: 期望提供一种安全搜索机制来提供对任何和所有内容的搜索,诸如跨企业的搜索。 然而,安全搜索需要访问保存要搜索的数据的安全内容存储库。 在某些情况下,爬取存储库所需的凭据可能非常敏感,或者用户可能不愿意或不愿意将用户标识信息存储在内存或磁盘上,而不是绝对必要的。 提供了一种方法,允许用户或管理员提供仅在爬网期间存储和使用的安全凭据,并在抓取完成时从系统中清除凭据。
-
公开(公告)号:US08239414B2
公开(公告)日:2012-08-07
申请号:US13110461
申请日:2011-05-18
IPC分类号: G06F17/00
CPC分类号: H04L63/08 , G06F17/30011 , G06F17/30321 , G06F17/30477 , G06F17/30554 , G06F17/30864 , G06F17/30867 , G06F21/31 , G06F21/6227 , H04L63/0815 , H04L63/083 , H04L63/102
摘要: A flexible and extensible architecture allows for secure searching across an enterprise. Such an architecture can provide a simple Internet-like search experience to users searching secure content inside (and outside) the enterprise. The architecture allows for the crawling and searching of a variety of sources across an enterprise, regardless of whether any of these sources conform to a conventional user role model. The architecture further allows for security, recency, or other attributes to be submitted at query time, for example, in order to re-rank query results from enterprise resources. The user query also can be transformed to provide for dynamic querying that provides for a more current result list than can be obtained for static queries.
摘要翻译: 灵活可扩展的架构允许跨企业进行安全搜索。 这样的架构可以为在企业内部(和外部)搜索安全内容的用户提供简单的类似Internet的搜索体验。 该架构允许在整个企业中爬行和搜索各种源,而不管这些源是否符合常规用户角色模型。 该体系结构还允许在查询时提交安全性,新近度或其他属性,例如,以便从企业资源重新排列查询结果。 用户查询也可以被转换以提供动态查询,其提供比静态查询可获得的更多当前结果列表。
-
公开(公告)号:US08027979B2
公开(公告)日:2011-09-27
申请号:US12829766
申请日:2010-07-02
申请人: Ciya Liao , Omar Alonso , Joaquin A. Delgado , Thomas H. Chang , Meeten Bhavsar
发明人: Ciya Liao , Omar Alonso , Joaquin A. Delgado , Thomas H. Chang , Meeten Bhavsar
CPC分类号: G06F17/30719
摘要: Systems, methods, and other embodiments associated with automatically summarizing a document are described. One method embodiment includes computing term scores for members of a set of terms in a document to be summarized and computing sentence scores for sentences in a set of sentences in the document. The method embodiment also includes computing a set of entries for a term-sentence matrix that relates terms to sentences. The method embodiment also includes computing a dominant topic for the document and simultaneously ranking the set of terms and the set of sentences based on the dominant topic. The method embodiment provides a summarization item(s) selected from the set of terms and/or the set of sentences.
摘要翻译: 描述与自动总结文档相关联的系统,方法和其他实施例。 一个方法实施例包括计算要汇总的文档中的一组术语的成员的术语分数,以及计算文档中一组句子中的句子的句子分数。 方法实施例还包括计算用于将术语与句子相关联的术语矩阵的条目集合。 该方法实施例还包括计算文档的主导主题,并且基于主题来同时对该组语句和一组句子进行排序。 该方法实施例提供从该组项和/或一组句子中选择的摘要项目。
-
公开(公告)号:US07783640B2
公开(公告)日:2010-08-24
申请号:US11647871
申请日:2006-12-29
申请人: Ciya Liao , Omar Alonso , Joaquin A. Delgado , Thomas H. Chang , Meeten Bhavsar
发明人: Ciya Liao , Omar Alonso , Joaquin A. Delgado , Thomas H. Chang , Meeten Bhavsar
CPC分类号: G06F17/30719
摘要: Systems, methods, and other embodiments associated with automatically summarizing a document are described. One method embodiment includes computing term scores for members of a set of terms in a document to be summarized and computing sentence scores for sentences in a set of sentences in the document. The method embodiment also includes computing a set of entries for a term-sentence matrix that relates terms to sentences. The method embodiment also includes computing a dominant topic for the document and simultaneously ranking the set of terms and the set of sentences based on the dominant topic. The method embodiment provides a summarization item(s) selected from the set of terms and/or the set of sentences.
摘要翻译: 描述与自动总结文档相关联的系统,方法和其他实施例。 一个方法实施例包括计算要汇总的文档中的一组术语的成员的术语分数,以及计算文档中一组句子中的句子的句子分数。 方法实施例还包括计算用于将术语与句子相关联的术语矩阵的条目集合。 该方法实施例还包括计算文档的主导主题,并且基于主题来同时对该组语句和一组句子进行排序。 该方法实施例提供从该组项和/或一组句子中选择的摘要项目。
-
公开(公告)号:US07756798B2
公开(公告)日:2010-07-13
申请号:US11714418
申请日:2007-03-06
IPC分类号: G06N5/00
CPC分类号: H04L51/12
摘要: Systems, methods, and other embodiments associated with identifying and selectively deleting duplicate search results are described. One example system embodiment includes logic to receive an identity indicator from a search logic. The identity indicator is associated with a search item that the search logic determines to be relevant to a search request. The example system may also include logic to determine whether the search result associated with the identity indicator is a duplicate result based on comparing the identity indicator to another identity indicator associated with another search result.
摘要翻译: 描述与识别和选择性地删除重复搜索结果相关联的系统,方法和其他实施例。 一个示例系统实施例包括从搜索逻辑接收身份指示符的逻辑。 身份指示符与搜索项目相关联,搜索逻辑确定与搜索请求相关。 该示例系统还可以包括用于基于将身份指示符与与另一搜索结果相关联的另一身份指示符进行比较来确定与身份指示符相关联的搜索结果是否是重复结果的逻辑。
-
-
-
-
-
-
-
-
-