专利检索 ap:("Nilesh Dalvi" OR "Raghu Ramakrishnan" OR "Vinay Kakade" OR "Arup Kumar Choudhury" OR "Sathiya Keerthi Selvaraj" OR "Philip Bohannon" OR "Mani Abrol" OR "David Ciemiewicz" OR "Arun Shankar Iyer" OR "Vipul Agarwal" OR "Alok S. Kirpal") AND inv:"Arun Shankar Iyer" 第 1 页

1.

发明授权
Method and system for form-filling crawl and associating rich keywords 有权
标题翻译：表单填充方法和系统抓取和关联丰富的关键字

公开(公告)号：US08793239B2

公开(公告)日：2014-07-29

申请号：US12576011

申请日：2009-10-08

申请人： Nilesh Dalvi , Raghu Ramakrishnan , Vinay Kakade , Arup Kumar Choudhury , Sathiya Keerthi Selvaraj , Philip Bohannon , Mani Abrol , David Ciemiewicz , Arun Shankar Iyer , Vipul Agarwal , Alok S. Kirpal

发明人： Nilesh Dalvi , Raghu Ramakrishnan , Vinay Kakade , Arup Kumar Choudhury , Sathiya Keerthi Selvaraj , Philip Bohannon , Mani Abrol , David Ciemiewicz , Arun Shankar Iyer , Vipul Agarwal , Alok S. Kirpal

IPC分类号： G06F17/30 , G06F7/00

CPC分类号： G06F17/30864

摘要： Techniques are provided for the efficient location, processing, and retrieval of local product information derived from web pages generally locatable through form queries submitted to web pages often referred to as the “deep” or “hidden” web. In an embodiment, information such as product information and dealer-location information is located on a web page form such as a dealer-locator form. After location of a suitable web page form, editorial wrapping is performed to create an automated information extraction process. Using the automated information extractor, deep-web crawling is performed. A grid-based extraction of individual business records is performed, and matching and ingestion are performed in conjunction with a business listing database. Finally, metadata tags are added to entries in the business listing database. Metadata tags also may be added to entries in other databases.

摘要翻译： 提供了技术，用于有效地定位，处理和检索从通常可通过提交到通常被称为“深”或“隐藏”网络的网页的表单查询的网页获得的本地产品信息。在一个实施例中，诸如产品信息和经销商位置信息的信息位于诸如经销商定位器形式的网页形式上。在找到合适的网页表单之后，执行编辑包装以创建自动化信息提取过程。使用自动信息提取器，执行深度网页抓取。执行单个业务记录的基于网格的提取，并且与业务列表数据库一起执行匹配和摄取。最后，元数据标签被添加到业务列表数据库中的条目。元数据标签也可以添加到其他数据库中的条目。

2.

发明申请
Method and System for Form-Filling Crawl and Associating Rich Keywords 有权
标题翻译：填写查询和关联丰富关键字的方法和系统

公开(公告)号：US20110087646A1

公开(公告)日：2011-04-14

申请号：US12576011

申请日：2009-10-08

申请人： Nilesh Dalvi , Raghu Ramakrishnan , Vinay Kakade , Arup Kumar Choudhury , Sathiya Keerthi Selvaraj , Philip Bohannon , Mani Abrol , David Ciemiewicz , Arun Shankar Iyer , Vipul Agarwal , Alok S. Kirpal

发明人： Nilesh Dalvi , Raghu Ramakrishnan , Vinay Kakade , Arup Kumar Choudhury , Sathiya Keerthi Selvaraj , Philip Bohannon , Mani Abrol , David Ciemiewicz , Arun Shankar Iyer , Vipul Agarwal , Alok S. Kirpal

IPC分类号： G06F7/10 , G06F17/30

CPC分类号： G06F17/30864

摘要： Techniques are provided for the efficient location, processing, and retrieval of local product information derived from web pages generally locatable through form queries submitted to web pages often referred to as the “deep” or “hidden” web. In an embodiment, information such as product information and dealer-location information is located on a web page form such as a dealer-locator form. After location of a suitable web page form, editorial wrapping is performed to create an automated information extraction process. Using the automated information extractor, deep-web crawling is performed. A grid-based extraction of individual business records is performed, and matching and ingestion are performed in conjunction with a business listing database. Finally, metadata tags are added to entries in the business listing database. Metadata tags also may be added to entries in other databases.

摘要翻译： 提供技术用于从通常通过提交到通常被称为“深”或“隐藏”网络的网页的表单查询的定位的网页获得的本地产品信息的有效定位，处理和检索。在一个实施例中，诸如产品信息和经销商位置信息的信息位于诸如经销商定位器形式的网页形式上。在找到合适的网页表单之后，执行编辑包装以创建自动化信息提取过程。使用自动信息提取器，执行深度网页抓取。执行单个业务记录的基于网格的提取，并且与业务列表数据库一起执行匹配和摄取。最后，元数据标签被添加到业务列表数据库中的条目。元数据标签也可以添加到其他数据库中的条目。

3.

发明申请
METHOD AND SYSTEM FOR BRAND NAME IDENTIFICATION 审中-公开
标题翻译：品牌名称识别方法与系统

公开(公告)号：US20110113063A1

公开(公告)日：2011-05-12

申请号：US12615243

申请日：2009-11-09

申请人： Bob Schulman , Sathiya Keerthi Selvaraj , Vinay Kakade , Mani Abrol , Amit Basu , Arun Shankar Iyer , Philip Bohannon

发明人： Bob Schulman , Sathiya Keerthi Selvaraj , Vinay Kakade , Mani Abrol , Amit Basu , Arun Shankar Iyer , Philip Bohannon

IPC分类号： G06F17/30

CPC分类号： G06F16/907

摘要： A method for identifying a brand name is described herein. The method involves obtaining category keywords associated with a category, designating a subgroup of the category keywords as brand name keywords for a particular brand name, receiving a search term, determining that the search term is a brand name keyword, and identifying the particular brand name corresponding to the brand name keyword.

摘要翻译： 本文描述了用于识别品牌名称的方法。该方法包括获取与类别相关联的类别关键字，将类别关键字的子组指定为特定品牌名称的品牌关键字，接收搜索词，确定搜索词是品牌名称关键字，以及识别特定品牌名称对应品牌名称关键字。

4.

发明授权
Opinion aggregation system 有权
标题翻译：意见汇总制度

公开(公告)号：US09141966B2

公开(公告)日：2015-09-22

申请号：US12646574

申请日：2009-12-23

申请人： Srujana Merugu , Arun Shankar Iyer , Ashwin Kumar V. Machanavajjhala , Sathiya Keerthi Selvaraj , Philip L. Bohannon

发明人： Srujana Merugu , Arun Shankar Iyer , Ashwin Kumar V. Machanavajjhala , Sathiya Keerthi Selvaraj , Philip L. Bohannon

IPC分类号： G06F9/44 , G06N7/02 , G06N7/06 , G06Q30/02 , G06K9/62 , G06N7/00 , G06N5/04 , G06Q10/06 , G06Q50/00

CPC分类号： G06Q30/0203 , G06K9/6277 , G06K9/6278 , G06N5/04 , G06N7/005 , G06Q10/06 , G06Q50/01

摘要： A system is disclosed for obtaining and aggregating opinions generated by multiple sources with respect to one or more objects. The disclosed system uses observed variables associated with an opinion and a probabilistic model to estimate latent properties of that opinion. With those latent properties, the disclosed system may enable publishers to reliably and comprehensively present object information to interested users.

摘要翻译： 公开了一种用于获得和聚集由多个源产生的关于一个或多个对象的意见的系统。所公开的系统使用与意见和概率模型相关联的观察变量来估计该意见的潜在属性。利用这些潜在属性，所公开的系统可以使发布者可以可靠地和全面地向感兴趣的用户呈现对象信息。

5.

发明申请
SELECTIVELY ADDING SOCIAL DIMENSION TO WEB SEARCHES 有权
标题翻译：选择性地增加网络搜索的社会尺寸

公开(公告)号：US20110264648A1

公开(公告)日：2011-10-27

申请号：US12764818

申请日：2010-04-21

申请人： Tom Gulik , Arun Shankar Iyer , Prasenjit Sarkar , Vinay Kakade , Erwin Tam

发明人： Tom Gulik , Arun Shankar Iyer , Prasenjit Sarkar , Vinay Kakade , Erwin Tam

IPC分类号： G06F17/30

CPC分类号： G06F17/30867

摘要： Embodiments are directed towards managing a display of search results by employing a query-classification for a search query to selectively display trust search results that are displayed distinct from non-trust search results. A search query is classified into a query-class. A search is then performed over non-trust sources, and selectively over trust data sources to obtain non-trust and trust search results, respectively. The trust search results are rank ordered based on various categories of search criteria, including, for example, explicit and implicit relationships. Based on the query-class, a different number of trust search results may be displayed. Further, a position for which the trust search results may be displayed may be based on the query-class. Moreover, the non-trust search results displayed distinct or separate from the trust search results to readily distinguish a type of source of the search results.

摘要翻译： 实施例旨在通过对搜索查询采用查询分类来选择性地显示与非信任搜索结果不同的显示信任搜索结果来管理搜索结果的显示。搜索查询分为查询类。然后，通过非信任源执行搜索，并选择性地超过信任数据源，以分别获取非信任和信任搜索结果。信任搜索结果基于各种类别的搜索标准进行排序，包括例如明确和隐含的关系。基于查询类，可以显示不同数量的信任搜索结果。此外，可以显示信任搜索结果的位置可以基于查询类。此外，非信任搜索结果与信任搜索结果不同或不同，以便容易地区分搜索结果的来源类型。

6.

发明授权
Selectively adding social dimension to web searches 有权
标题翻译：选择性地将社交维度添加到网络搜索

公开(公告)号：US08880520B2

公开(公告)日：2014-11-04

申请号：US12764818

申请日：2010-04-21

申请人： Tom Gulik , Arun Shankar Iyer , Prasenjit Sarkar , Vinay Kakade , Erwin Tam

发明人： Tom Gulik , Arun Shankar Iyer , Prasenjit Sarkar , Vinay Kakade , Erwin Tam

IPC分类号： G06F17/30

CPC分类号： G06F17/30867

摘要： Embodiments are directed towards managing a display of search results by employing a query-classification for a search query to selectively display trust search results that are displayed distinct from non-trust search results. A search query is classified into a query-class. A search is then performed over non-trust sources, and selectively over trust data sources to obtain non-trust and trust search results, respectively. The trust search results are rank ordered based on various categories of search criteria, including, for example, explicit and implicit relationships. Based on the query-class, a different number of trust search results may be displayed. Further, a position for which the trust search results may be displayed may be based on the query-class. Moreover, the non-trust search results displayed distinct or separate from the trust search results to readily distinguish a type of source of the search results.

摘要翻译： 实施例旨在通过对搜索查询采用查询分类来选择性地显示与非信任搜索结果不同的显示信任搜索结果来管理搜索结果的显示。搜索查询分为查询类。然后，通过非信任源执行搜索，并选择性地超过信任数据源，以分别获取非信任和信任搜索结果。信任搜索结果基于各种类别的搜索标准进行排序，包括例如明确和隐含的关系。基于查询类，可以显示不同数量的信任搜索结果。此外，可以显示信任搜索结果的位置可以基于查询类。此外，非信任搜索结果与信任搜索结果不同或不同，以便容易地区分搜索结果的来源类型。

7.

发明申请
APPARATUS AND METHODS FOR OPERATOR TRAINING IN INFORMATION EXTRACTION 有权
标题翻译：信息提取中操作员培训的装置和方法

公开(公告)号：US20100227301A1

公开(公告)日：2010-09-09

申请号：US12398126

申请日：2009-03-04

申请人： Cong Yu , Mridul Muralidharan , Arun Shankar Iyer , Philip Lewis Bohannon

发明人： Cong Yu , Mridul Muralidharan , Arun Shankar Iyer , Philip Lewis Bohannon

IPC分类号： G09B19/00

CPC分类号： G09B19/00

摘要： Disclosed are methods and apparatus for extracting information from one or more documents. A training and execution plan is received, and such plan specifies invocation of a trainer operator for initiating training of a trainee operator based on a set of training documents so as to generate a new trained operator that is to then be invoked so as to extract information from one or more unknown documents. The trainee operator is configured to extract information from one or more unknown documents, and each training document is associated with classified information. After receipt of the training and execution plan, the trainer operator is automatically executed to train the trainee operator based on the specified training documents so as to generate a new trained operator for extracting information from documents. The new trained operator is a new version of the trainee operator. After receipt of the training and execution plan, both the trainee operator are automatically retained for later use in extracting information from one or more unknown documents and the new trained operator for later use in extracting information from one or more unknown documents. After receipt of the training and execution plan, the new trained operator is automatically executed on one or more unknown documents so as to extract information from such one or more unknown documents.

摘要翻译： 公开了用于从一个或多个文档中提取信息的方法和装置。接收到训练和执行计划，并且该计划规定了基于一组训练文件来引导训练者操作员启动对训练操作员的训练，以便产生一个新的经过训练的操作者，然后被调用以便提取信息来自一个或多个未知文件。受训操作员被配置为从一个或多个未知文档中提取信息，并且每个训练文档与分类信息相关联。在收到培训和执行计划后，培训师操作员将根据指定的培训文件自动执行培训受训操作员，以便生成一个新的训练有素的操作员，从文档中提取信息。新受过训练的操作员是受训操作员的新版本。在接收到训练和执行计划之后，训练者操作员将被自动保留以便以后用于从一个或多个未知文件中提取信息，并且新训练的操作者用于随后用于从一个或多个未知文档中提取信息。在接收到训练和执行计划之后，新的受过训练的操作者被自动执行一个或多个未知文件，以从这样的一个或多个未知文件中提取信息。

8.

发明授权
Apparatus and methods for operator training in information extraction 有权
标题翻译：信息提取操作员训练的装置和方法

公开(公告)号：US08412652B2

公开(公告)日：2013-04-02

申请号：US12398126

申请日：2009-03-04

申请人： Cong Yu , Mridul Muralidharan , Arun Shankar Iyer , Philip Lewis Bohannon

发明人： Cong Yu , Mridul Muralidharan , Arun Shankar Iyer , Philip Lewis Bohannon

IPC分类号： G06F15/18

CPC分类号： G09B19/00

摘要： After receipt of a training and execution plan, a trainer operator is automatically trained based on specified training documents so as to generate a new trained operator for extracting information from documents. The new trained operator is a new version of the trainee operator. Both trainee operators are automatically retained for later use in extracting information from one or more unknown documents. After receipt of the training and execution plan, the new trained operator is automatically executed on one or more unknown documents so as to extract information from such one or more unknown documents.

摘要翻译： 在接收到训练和执行计划之后，训练员操作员将根据指定的培训文件自动进行培训，以便生成一个新的训练有素的操作员，从文档中提取信息。新受过训练的操作员是受训操作员的新版本。两名学员操作员都会自动保留以供以后使用，从一个或多个未知文件中提取信息。在接收到训练和执行计划之后，新的受过训练的操作者被自动执行一个或多个未知文件，以从这样的一个或多个未知文件中提取信息。

9.

发明申请
OPINION AGGREGATION SYSTEM 有权
标题翻译：意见汇总制度

公开(公告)号：US20110153542A1

公开(公告)日：2011-06-23

申请号：US12646574

申请日：2009-12-23

申请人： Srujana Merugu , Arun Shankar Iyer , Ashwin Kumar V. Machanavajjhala , Santhiya Keerthi Selvaraj , Philip L. Bohannon

发明人： Srujana Merugu , Arun Shankar Iyer , Ashwin Kumar V. Machanavajjhala , Santhiya Keerthi Selvaraj , Philip L. Bohannon

IPC分类号： G06N5/04 , G06N5/02

CPC分类号： G06Q30/0203 , G06K9/6277 , G06K9/6278 , G06N5/04 , G06N7/005 , G06Q10/06 , G06Q50/01

摘要： A system is disclosed for obtaining and aggregating opinions generated by multiple sources with respect to one or more objects. The disclosed system uses observed variables associated with an opinion and a probabilistic model to estimate latent properties of that opinion. With those latent properties, the disclosed system may enable publishers to reliably and comprehensively present object information to interested users.

摘要翻译： 公开了一种用于获得和聚集由多个源产生的关于一个或多个对象的意见的系统。所公开的系统使用与意见和概率模型相关联的观察变量来估计该意见的潜在属性。利用这些潜在属性，所公开的系统可以使发布者可以可靠地和全面地向感兴趣的用户呈现对象信息。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类