Method and system for form-filling crawl and associating rich keywords
    1.
    发明授权
    Method and system for form-filling crawl and associating rich keywords 有权
    表单填充方法和系统抓取和关联丰富的关键字

    公开(公告)号:US08793239B2

    公开(公告)日:2014-07-29

    申请号:US12576011

    申请日:2009-10-08

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30864

    摘要: Techniques are provided for the efficient location, processing, and retrieval of local product information derived from web pages generally locatable through form queries submitted to web pages often referred to as the “deep” or “hidden” web. In an embodiment, information such as product information and dealer-location information is located on a web page form such as a dealer-locator form. After location of a suitable web page form, editorial wrapping is performed to create an automated information extraction process. Using the automated information extractor, deep-web crawling is performed. A grid-based extraction of individual business records is performed, and matching and ingestion are performed in conjunction with a business listing database. Finally, metadata tags are added to entries in the business listing database. Metadata tags also may be added to entries in other databases.

    摘要翻译: 提供了技术,用于有效地定位,处理和检索从通常可通过提交到通常被称为“深”或“隐藏”网络的网页的表单查询的网页获得的本地产品信息。 在一个实施例中,诸如产品信息和经销商位置信息的信息位于诸如经销商定位器形式的网页形式上。 在找到合适的网页表单之后,执行编辑包装以创建自动化信息提取过程。 使用自动信息提取器,执行深度网页抓取。 执行单个业务记录的基于网格的提取,并且与业务列表数据库一起执行匹配和摄取。 最后,元数据标签被添加到业务列表数据库中的条目。 元数据标签也可以添加到其他数据库中的条目。

    Method and System for Form-Filling Crawl and Associating Rich Keywords
    2.
    发明申请
    Method and System for Form-Filling Crawl and Associating Rich Keywords 有权
    填写查询和关联丰富关键字的方法和系统

    公开(公告)号:US20110087646A1

    公开(公告)日:2011-04-14

    申请号:US12576011

    申请日:2009-10-08

    IPC分类号: G06F7/10 G06F17/30

    CPC分类号: G06F17/30864

    摘要: Techniques are provided for the efficient location, processing, and retrieval of local product information derived from web pages generally locatable through form queries submitted to web pages often referred to as the “deep” or “hidden” web. In an embodiment, information such as product information and dealer-location information is located on a web page form such as a dealer-locator form. After location of a suitable web page form, editorial wrapping is performed to create an automated information extraction process. Using the automated information extractor, deep-web crawling is performed. A grid-based extraction of individual business records is performed, and matching and ingestion are performed in conjunction with a business listing database. Finally, metadata tags are added to entries in the business listing database. Metadata tags also may be added to entries in other databases.

    摘要翻译: 提供技术用于从通常通过提交到通常被称为“深”或“隐藏”网络的网页的表单查询的定位的网页获得的本地产品信息的有效定位,处理和检索。 在一个实施例中,诸如产品信息和经销商位置信息的信息位于诸如经销商定位器形式的网页形式上。 在找到合适的网页表单之后,执行编辑包装以创建自动化信息提取过程。 使用自动信息提取器,执行深度网页抓取。 执行单个业务记录的基于网格的提取,并且与业务列表数据库一起执行匹配和摄取。 最后,元数据标签被添加到业务列表数据库中的条目。 元数据标签也可以添加到其他数据库中的条目。

    System and method for indexing food providers and use of the index in search engines
    4.
    发明授权
    System and method for indexing food providers and use of the index in search engines 有权
    用于索引食品供应商的系统和方法以及在搜索引擎中使用索引

    公开(公告)号:US08903800B2

    公开(公告)日:2014-12-02

    申请号:US12792447

    申请日:2010-06-02

    摘要: Methods, systems and computer readable mediums are provided for indexing network resources. One method includes accessing, using one or more computer systems, a data store of menu items. The method further includes accessing identification information associated with one or more food providers from one or more data sources. One or more network resources are crawled based on the identification information to search for one or more menu items in the data store of menu items associated with corresponding ones of the food providers. Using the one or more computing systems, an index feed is generated, the index feed comprising the identification information of one or more of the food providers, and one or more menu items associated with the identification information of corresponding food providers based on the crawl and search.

    摘要翻译: 提供方法,系统和计算机可读介质用于索引网络资源。 一种方法包括使用一个或多个计算机系统访问菜单项的数据存储。 该方法还包括从一个或多个数据源访问与一个或多个食物提供者相关联的识别信息。 基于识别信息来爬行一个或多个网络资源,以搜索与对应的食品供应商相关联的菜单项的数据存储中的一个或多个菜单项。 使用一个或多个计算系统,生成索引馈送,索引馈送包括一个或多个食物提供者的识别信息,以及与基于爬行的相应食品供应商的识别信息相关联的一个或多个菜单项,以及 搜索。

    System and Method for Indexing Food Providers and Use of the Index in Search Engines
    5.
    发明申请
    System and Method for Indexing Food Providers and Use of the Index in Search Engines 有权
    索引食品供应商的系统和方法以及搜索引擎中的索引使用

    公开(公告)号:US20110302148A1

    公开(公告)日:2011-12-08

    申请号:US12792447

    申请日:2010-06-02

    摘要: Methods, systems and computer readable mediums are provided for indexing network resources. One method includes accessing, using one or more computer systems, a data store of menu items. The method further includes accessing identification information associated with one or more food providers from one or more data sources. One or more network resources are crawled based on the identification information to search for one or more menu items in the data store of menu items associated with corresponding ones of the food providers. Using the one or more computing systems, an index feed is generated, the index feed comprising the identification information of one or more of the food providers, and one or more menu items associated with the identification information of corresponding food providers based on the crawl and search.

    摘要翻译: 提供方法,系统和计算机可读介质用于索引网络资源。 一种方法包括使用一个或多个计算机系统访问菜单项的数据存储。 该方法还包括从一个或多个数据源访问与一个或多个食物提供者相关联的识别信息。 基于识别信息来爬行一个或多个网络资源,以搜索与对应的食品供应商相关联的菜单项的数据存储中的一个或多个菜单项。 使用一个或多个计算系统,生成索引馈送,索引馈送包括一个或多个食物提供者的识别信息,以及与基于爬行的相应食品供应商的识别信息相关联的一个或多个菜单项,以及 搜索。

    Robust wrappers for web extraction
    6.
    发明授权
    Robust wrappers for web extraction 有权
    用于网络提取的强大的包装

    公开(公告)号:US08762829B2

    公开(公告)日:2014-06-24

    申请号:US12344076

    申请日:2008-12-24

    IPC分类号: G06F17/22

    摘要: A computer-implemented method to determine a robust wrapper includes developing a model indicative of the temporal history of a document, such as a web document written in a markup language. Based on the developed model, robustness characteristics are determined for a plurality of different wrappers representing associated paths to the data item in a representation of the document. Based on a result of the determining operation, a result wrapper of the plurality of wrappers is provided. The result wrapper has a desired robustness characteristic.

    摘要翻译: 用于确定鲁棒包装器的计算机实现的方法包括开发指示文档的时间历史的模型,诸如以标记语言书写的web文档。 基于所开发的模型,为表示文档的表示中的与数据项的相关联的路径的多个不同的包装器确定鲁棒性特性。 基于确定操作的结果,提供多个包装纸的结果包装纸。 结果包装器具有所需的鲁棒特性。

    Display entity relationship
    9.
    发明授权
    Display entity relationship 有权
    显示实体关系

    公开(公告)号:US09043360B2

    公开(公告)日:2015-05-26

    申请号:US12972179

    申请日:2010-12-17

    IPC分类号: G06F17/30 G06N5/02

    摘要: Method, system, and programs for providing one or more explanations. An inquiry is received via a communication platform where the inquiry is about how a set of entities are related. Information is retrieved from a knowledge storage in accordance with the set of entities and such information describes a plurality of entities and relationships existing among the plurality of entities. Based on such retrieved information, one or more explanations with respect to each relationship by which the set of entities are connected are generated. The one or more explanations are then transmitted as a response to the inquiry.

    摘要翻译: 用于提供一个或多个解释的方法,系统和程序。 通过通信平台接收询问,其中查询是关于一组实体如何相关。 根据该组实体从知识存储器检索信息,并且这样的信息描述了存在于多个实体之间的多个实体和关系。 基于这种检索的信息,生成关于连接该组实体的每个关系的一个或多个解释。 然后作为对查询的响应来发送一个或多个解释。

    SYSTEMS, METHODS, AND APPARATUSES FOR IMPLEMENTING AN INTERFACE TO VIEW AND EXPLORE SOCIALLY RELEVANT CONCEPTS OF AN ENTITY GRAPH
    10.
    发明申请
    SYSTEMS, METHODS, AND APPARATUSES FOR IMPLEMENTING AN INTERFACE TO VIEW AND EXPLORE SOCIALLY RELEVANT CONCEPTS OF AN ENTITY GRAPH 有权
    用于实现视图界面的系统,方法和设备,并探索实体图中的社会相关概念

    公开(公告)号:US20140280108A1

    公开(公告)日:2014-09-18

    申请号:US13828792

    申请日:2013-03-14

    IPC分类号: G06F17/30

    摘要: There are provided means for implementing an interface to view and explore socially relevant concepts of an entity graph including, for example, means of a social network system to perform operations including retrieving contextually relevant data for a plurality of concepts within an entity graph of the social network system; retrieving socially relevant data for a user's node within a social graph of the social network system; identifying intersects between the plurality of concepts within the entity graph and the social relevant data for the user's node within the social graph; selecting one of the plurality of concepts within the entity graph based on the intersects identified; and displaying the one of the plurality of concepts within the entity graph at a user interface associated with the user's node.

    摘要翻译: 提供了用于实现用于查看和探索实体图形的社会相关概念的界面的手段,该实体图包括例如社交网络系统的手段来执行操作,包括在社交网络的实体图形内为多个概念检索与内容相关的数据 网络系统; 在社交网络系统的社交图中检索用户节点的社会相关数据; 识别所述实体图中的所述多个概念与所述社交图中所述用户节点的社会相关数据之间的相交; 基于所识别的相交,在实体图中选择多个概念之一; 以及在与所述用户节点相关联的用户界面处,在所述实体图中显示所述多个概念中的所述一个概念。