Method and system for form-filling crawl and associating rich keywords
    1.
    发明授权
    Method and system for form-filling crawl and associating rich keywords 有权
    表单填充方法和系统抓取和关联丰富的关键字

    公开(公告)号:US08793239B2

    公开(公告)日:2014-07-29

    申请号:US12576011

    申请日:2009-10-08

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30864

    摘要: Techniques are provided for the efficient location, processing, and retrieval of local product information derived from web pages generally locatable through form queries submitted to web pages often referred to as the “deep” or “hidden” web. In an embodiment, information such as product information and dealer-location information is located on a web page form such as a dealer-locator form. After location of a suitable web page form, editorial wrapping is performed to create an automated information extraction process. Using the automated information extractor, deep-web crawling is performed. A grid-based extraction of individual business records is performed, and matching and ingestion are performed in conjunction with a business listing database. Finally, metadata tags are added to entries in the business listing database. Metadata tags also may be added to entries in other databases.

    摘要翻译: 提供了技术,用于有效地定位,处理和检索从通常可通过提交到通常被称为“深”或“隐藏”网络的网页的表单查询的网页获得的本地产品信息。 在一个实施例中,诸如产品信息和经销商位置信息的信息位于诸如经销商定位器形式的网页形式上。 在找到合适的网页表单之后,执行编辑包装以创建自动化信息提取过程。 使用自动信息提取器,执行深度网页抓取。 执行单个业务记录的基于网格的提取,并且与业务列表数据库一起执行匹配和摄取。 最后,元数据标签被添加到业务列表数据库中的条目。 元数据标签也可以添加到其他数据库中的条目。

    System and method for indexing food providers and use of the index in search engines
    3.
    发明授权
    System and method for indexing food providers and use of the index in search engines 有权
    用于索引食品供应商的系统和方法以及在搜索引擎中使用索引

    公开(公告)号:US08903800B2

    公开(公告)日:2014-12-02

    申请号:US12792447

    申请日:2010-06-02

    摘要: Methods, systems and computer readable mediums are provided for indexing network resources. One method includes accessing, using one or more computer systems, a data store of menu items. The method further includes accessing identification information associated with one or more food providers from one or more data sources. One or more network resources are crawled based on the identification information to search for one or more menu items in the data store of menu items associated with corresponding ones of the food providers. Using the one or more computing systems, an index feed is generated, the index feed comprising the identification information of one or more of the food providers, and one or more menu items associated with the identification information of corresponding food providers based on the crawl and search.

    摘要翻译: 提供方法,系统和计算机可读介质用于索引网络资源。 一种方法包括使用一个或多个计算机系统访问菜单项的数据存储。 该方法还包括从一个或多个数据源访问与一个或多个食物提供者相关联的识别信息。 基于识别信息来爬行一个或多个网络资源,以搜索与对应的食品供应商相关联的菜单项的数据存储中的一个或多个菜单项。 使用一个或多个计算系统,生成索引馈送,索引馈送包括一个或多个食物提供者的识别信息,以及与基于爬行的相应食品供应商的识别信息相关联的一个或多个菜单项,以及 搜索。

    System and Method for Indexing Food Providers and Use of the Index in Search Engines
    4.
    发明申请
    System and Method for Indexing Food Providers and Use of the Index in Search Engines 有权
    索引食品供应商的系统和方法以及搜索引擎中的索引使用

    公开(公告)号:US20110302148A1

    公开(公告)日:2011-12-08

    申请号:US12792447

    申请日:2010-06-02

    摘要: Methods, systems and computer readable mediums are provided for indexing network resources. One method includes accessing, using one or more computer systems, a data store of menu items. The method further includes accessing identification information associated with one or more food providers from one or more data sources. One or more network resources are crawled based on the identification information to search for one or more menu items in the data store of menu items associated with corresponding ones of the food providers. Using the one or more computing systems, an index feed is generated, the index feed comprising the identification information of one or more of the food providers, and one or more menu items associated with the identification information of corresponding food providers based on the crawl and search.

    摘要翻译: 提供方法,系统和计算机可读介质用于索引网络资源。 一种方法包括使用一个或多个计算机系统访问菜单项的数据存储。 该方法还包括从一个或多个数据源访问与一个或多个食物提供者相关联的识别信息。 基于识别信息来爬行一个或多个网络资源,以搜索与对应的食品供应商相关联的菜单项的数据存储中的一个或多个菜单项。 使用一个或多个计算系统,生成索引馈送,索引馈送包括一个或多个食物提供者的识别信息,以及与基于爬行的相应食品供应商的识别信息相关联的一个或多个菜单项,以及 搜索。

    Method and System for Form-Filling Crawl and Associating Rich Keywords
    5.
    发明申请
    Method and System for Form-Filling Crawl and Associating Rich Keywords 有权
    填写查询和关联丰富关键字的方法和系统

    公开(公告)号:US20110087646A1

    公开(公告)日:2011-04-14

    申请号:US12576011

    申请日:2009-10-08

    IPC分类号: G06F7/10 G06F17/30

    CPC分类号: G06F17/30864

    摘要: Techniques are provided for the efficient location, processing, and retrieval of local product information derived from web pages generally locatable through form queries submitted to web pages often referred to as the “deep” or “hidden” web. In an embodiment, information such as product information and dealer-location information is located on a web page form such as a dealer-locator form. After location of a suitable web page form, editorial wrapping is performed to create an automated information extraction process. Using the automated information extractor, deep-web crawling is performed. A grid-based extraction of individual business records is performed, and matching and ingestion are performed in conjunction with a business listing database. Finally, metadata tags are added to entries in the business listing database. Metadata tags also may be added to entries in other databases.

    摘要翻译: 提供技术用于从通常通过提交到通常被称为“深”或“隐藏”网络的网页的表单查询的定位的网页获得的本地产品信息的有效定位,处理和检索。 在一个实施例中,诸如产品信息和经销商位置信息的信息位于诸如经销商定位器形式的网页形式上。 在找到合适的网页表单之后,执行编辑包装以创建自动化信息提取过程。 使用自动信息提取器,执行深度网页抓取。 执行单个业务记录的基于网格的提取,并且与业务列表数据库一起执行匹配和摄取。 最后,元数据标签被添加到业务列表数据库中的条目。 元数据标签也可以添加到其他数据库中的条目。

    Selectively adding social dimension to web searches
    7.
    发明授权
    Selectively adding social dimension to web searches 有权
    选择性地将社交维度添加到网络搜索

    公开(公告)号:US08880520B2

    公开(公告)日:2014-11-04

    申请号:US12764818

    申请日:2010-04-21

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867

    摘要: Embodiments are directed towards managing a display of search results by employing a query-classification for a search query to selectively display trust search results that are displayed distinct from non-trust search results. A search query is classified into a query-class. A search is then performed over non-trust sources, and selectively over trust data sources to obtain non-trust and trust search results, respectively. The trust search results are rank ordered based on various categories of search criteria, including, for example, explicit and implicit relationships. Based on the query-class, a different number of trust search results may be displayed. Further, a position for which the trust search results may be displayed may be based on the query-class. Moreover, the non-trust search results displayed distinct or separate from the trust search results to readily distinguish a type of source of the search results.

    摘要翻译: 实施例旨在通过对搜索查询采用查询分类来选择性地显示与非信任搜索结果不同的显示信任搜索结果来管理搜索结果的显示。 搜索查询分为查询类。 然后,通过非信任源执行搜索,并选择性地超过信任数据源,以分别获取非信任和信任搜索结果。 信任搜索结果基于各种类别的搜索标准进行排序,包括例如明确和隐含的关系。 基于查询类,可以显示不同数量的信任搜索结果。 此外,可以显示信任搜索结果的位置可以基于查询类。 此外,非信任搜索结果与信任搜索结果不同或不同,以便容易地区分搜索结果的来源类型。

    METHOD AND SYSTEM FOR MATCHING ADVERTISEMENTS TO WEB FEEDS
    8.
    发明申请
    METHOD AND SYSTEM FOR MATCHING ADVERTISEMENTS TO WEB FEEDS 审中-公开
    用于匹配网页广告广告的方法和系统

    公开(公告)号:US20100306049A1

    公开(公告)日:2010-12-02

    申请号:US12475846

    申请日:2009-06-01

    IPC分类号: G06Q30/00 G06F17/30

    CPC分类号: G06Q30/0251 G06Q30/02

    摘要: A system for serving advertisements in a networked environment includes a web feed ad server operable to receive web feed information, identify concept terms in the web feed information, match advertisements to the concept terms, and communicate the advertisement to a terminal. Concept terms are identified by comparing terms in the web feed to information in an encyclopedia database, a product listing database, and/or a bidded keyword database. Rewrites associated with the concept terms are generated by a sponsored search ad system. The concept terms and rewrites are placed in a document and communicated to a context matching ad system operable to match an advertisement to the content of the document.

    摘要翻译: 用于在联网环境中服务广告的系统包括可用于接收网络馈送信息的网络馈送广告服务器,识别网络馈送信息中的概念术语,将广告与概念术语相匹配,以及将广告传送到终端。 概念术语通过将网络馈送中的术语与百科全书数据库,产品列表数据库和/或投标关键字数据库中的信息进行比较来识别。 与概念术语相关联的重写由赞助的搜索广告系统生成。 概念术语和重写被放置在文档中并且传达给可操作以将广告与文档的内容相匹配的上下文匹配广告系统。

    Matching items of user-generated content to entities
    10.
    发明授权
    Matching items of user-generated content to entities 有权
    将用户生成的内容的项目与实体相匹配

    公开(公告)号:US08412771B2

    公开(公告)日:2013-04-02

    申请号:US12909766

    申请日:2010-10-21

    IPC分类号: G06F15/16

    摘要: A method, apparatus, and computer-readable medium are provided for matching items of user-generated content to entities is provided. Items of user-generated content, such as status updates, are gathered. For each of the items, a machine determines a degree to which the item is associated with an entity. In one aspect, items are matched to an entity by matching the content of the items to attributes of the entity. In another aspect, items are matched to an entity by predicting attributes of an author of the items and determining a distance between the predicted attributes of the author and the attributes of the entity. The distance may be a physical distance between locations of the entity and user or a contextual distance between categories for the entity and posts by the author. Items matched to the entity may be displayed on an interface concurrently with information about the entity.

    摘要翻译: 提供了一种用于将用户生成的内容与实体相匹配的方法,装置和计算机可读介质。 收集用户生成内容的项目,如状态更新。 对于每个项目,机器确定项目与实体相关联的程度。 在一个方面,通过将项目的内容与实体的属性相匹配来将项目与实体相匹配。 在另一方面,通过预测项目的作者的属性并确定作者的预测属性与实体的属性之间的距离来将项目与实体相匹配。 该距离可以是实体和用户的位置之间的物理距离或作者的实体和帖子的类别之间的上下文距离。 与实体匹配的项目可以与接口的实体同时显示。