-
公开(公告)号:US20090192987A1
公开(公告)日:2009-07-30
申请号:US12022777
申请日:2008-01-30
IPC分类号: G06F17/30
CPC分类号: G06F16/9535 , G06F16/29
摘要: Exemplary embodiments of the present invention relate to a method for searching navigational pages within an intranet environment. The method comprises identifying a plurality of navigational pages, performing a page-level analysis upon each identified navigational page in order to determine if a navigational page can be categorized as a candidate navigational page, performing a cross-page analysis upon each determined candidate navigational page in order to generate a final set of navigational pages, associating each final navigational page with a predetermined semantic classification group, generating term variants for each navigational page, building a navigational index for each semantic classification grouping, and filtering user queries in association with a user profile of a user that is posing a query.
摘要翻译: 本发明的示例性实施例涉及一种用于在内部网环境中搜索导航页的方法。 该方法包括识别多个导航页面,在每个识别的导航页面上执行页面级分析,以便确定导航页面是否可以被分类为候选导航页面,在每个确定的候选导航页面上执行跨页面分析 为了生成最终导航页面集合,将每个最终导航页面与预定语义分类组相关联,为每个导航页面生成术语变体,为每个语义分类分组构建导航索引,并且与用户相关联地过滤用户查询 正在构成查询的用户的个人资料。
-
公开(公告)号:US07987416B2
公开(公告)日:2011-07-26
申请号:US11939794
申请日:2007-11-14
IPC分类号: G06F17/00
CPC分类号: G06F17/30864 , G06F17/241
摘要: Embodiments of the present invention include a computer-implemented method of extracting information. In one embodiment, the present invention comprises defining a plurality of reusable operators, wherein each operator performs a predefined information extraction task different from the other operators. Composite annotators may be created by specifying a composition of the reusable operators. Each operator may receive a searchable item, such as a web page or an annotation, and may generate one or more output annotations. The output annotations may be further processed by other reusable operators and the annotations may be stored in a repository for use during a search.
摘要翻译: 本发明的实施例包括提取信息的计算机实现的方法。 在一个实施例中,本发明包括定义多个可重用操作符,其中每个操作者执行与其他操作者不同的预定信息提取任务。 可以通过指定可重用操作符的组合来创建复合注释器。 每个运营商可以接收可搜索的项目,诸如网页或注释,并且可以生成一个或多个输出注释。 输出注释可以由其他可重用操作符进一步处理,并且注释可以存储在存储库中以便在搜索期间使用。
-
公开(公告)号:US20090125542A1
公开(公告)日:2009-05-14
申请号:US11939794
申请日:2007-11-14
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , G06F17/241
摘要: Embodiments of the present invention include a computer-implemented method of extracting information. In one embodiment, the present invention comprises defining a plurality of reusable operators, wherein each operator performs a predefined information extraction task different from the other operators. Composite annotators may be created by specifying a composition of the reusable operators. Each operator may receive a searchable item, such as a web page or an annotation, and may generate one or more output annotations. The output annotations may be further processed by other reusable operators and the annotations may be stored in a repository for use during a search.
摘要翻译: 本发明的实施例包括提取信息的计算机实现的方法。 在一个实施例中,本发明包括定义多个可重用操作符,其中每个操作者执行与其他操作者不同的预定信息提取任务。 可以通过指定可重用操作符的组合来创建复合注释器。 每个运营商可以接收可搜索的项目,诸如网页或注释,并且可以生成一个或多个输出注释。 输出注释可以由其他可重用操作符进一步处理,并且注释可以存储在存储库中以便在搜索期间使用。
-
-