发明授权
US08583639B2 Method and system using machine learning to automatically discover home pages on the internet
有权
使用机器学习的方法和系统在互联网上自动发现主页
- 专利标题: Method and system using machine learning to automatically discover home pages on the internet
- 专利标题(中): 使用机器学习的方法和系统在互联网上自动发现主页
-
申请号: US12033160申请日: 2008-02-19
-
公开(公告)号: US08583639B2公开(公告)日: 2013-11-12
- 发明人: Upendra Chitnis , Wojciech Gryc , Ildar Khabibrakhmanov , Richard D. Lawrence , Prem Melville , Cezar Pendus
- 申请人: Upendra Chitnis , Wojciech Gryc , Ildar Khabibrakhmanov , Richard D. Lawrence , Prem Melville , Cezar Pendus
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: F. Chau & Associates, LLC
- 代理商 Daniel P. Morris, Esq.
- 主分类号: G06F7/00
- IPC分类号: G06F7/00 ; G06F17/30
摘要:
A method for automatically determining an Internet home page corresponding to a named entity identified by a specified descriptor including building a trained machine-learning model, generating candidate matches from the specified descriptor, wherein each candidate match includes an Internet address, extracting content-based features from websites associated with the Internet addresses of the candidate matches, determining a model score for each candidate match based on the content-based features using the trained machine-learning model, and determining a match from among the candidate matches according to the scores, wherein the match is returned as the Internet home page corresponding to the named entity.
公开/授权文献
信息查询