METHOD AND SYSTEM FOR IDENTIFYING COMPANIES WITH SPECIFIC BUSINESS OBJECTIVES
    21.
    发明申请
    METHOD AND SYSTEM FOR IDENTIFYING COMPANIES WITH SPECIFIC BUSINESS OBJECTIVES 有权
    用于识别具有特定业务目标的公司的方法和系统

    公开(公告)号:US20090204569A1

    公开(公告)日:2009-08-13

    申请号:US12028877

    申请日:2008-02-11

    IPC分类号: G06F17/30 G06F17/00

    CPC分类号: G06F17/30864

    摘要: A method for identifying companies with specific business objectives that includes using existing sources of company firmographic data to identify a broad set of companies and associated websites, crawling the websites associated with the identified companies and indexing web site content for each of the identified companies with the specific business objective to realize indexed web content. The method further includes joining the company firmographic data with the indexed web content using a business objective common identifier to generate a store of joined structured firmographic data and indexed web content and presenting a display image representation of the store of joined structured firmographic data and indexed web content for user review. The display image further receives user input to score each of said companies identified therein, and using a search interface, querying the store of scored, joined structured firmographic data and indexed web content. The method further includes augmenting the search interface, or search results from a query, with predictive, machine-leaning processes that allow rapid identification of companies possibly missed in the query.

    摘要翻译: 一种用于识别具有特定业务目标的公司的方法,其中包括使用公司隐性数据的现有来源来识别广泛的公司和相关网站,爬行与所识别的公司相关联的网站,并为每个被识别的公司索引网站内容 具体的业务目标来实现索引的Web内容。 该方法还包括使用业务目标公共标识符将公司隐含数据与索引的网页内容相加,以生成连接的结构化地图数据和索引的网页内容的存储,以及呈现连接的结构化地图数据和索引网的存储的显示图像表示 用户评论内容。 显示图像还接收用户输入,以对其中识别的每个所述公司进行评分,并使用搜索界面,查询记分,结合的结构化数据和索引的web内容的存储。 该方法还包括利用预测性机器倾斜过程增强搜索接口或来自查询的搜索结果,其允许快速识别可能在查询中遗漏的公司。

    Method and system using machine learning to automatically discover home pages on the internet
    23.
    发明授权
    Method and system using machine learning to automatically discover home pages on the internet 有权
    使用机器学习的方法和系统在互联网上自动发现主页

    公开(公告)号:US08583639B2

    公开(公告)日:2013-11-12

    申请号:US12033160

    申请日:2008-02-19

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30864

    摘要: A method for automatically determining an Internet home page corresponding to a named entity identified by a specified descriptor including building a trained machine-learning model, generating candidate matches from the specified descriptor, wherein each candidate match includes an Internet address, extracting content-based features from websites associated with the Internet addresses of the candidate matches, determining a model score for each candidate match based on the content-based features using the trained machine-learning model, and determining a match from among the candidate matches according to the scores, wherein the match is returned as the Internet home page corresponding to the named entity.

    摘要翻译: 一种用于自动确定与由指定描述符标识的命名实体相对应的因特网主页的方法,包括建立训练有素的机器学习模型,从指定的描述符生成候选匹配,其中每个候选匹配包括因特网地址,提取基于内容的特征 从与候选匹配的互联网地址相关联的网站,基于使用训练机器学习模型的基于内容的特征来确定每个候选匹配的模型分数,以及根据分数从候选匹配中确定匹配,其中 该匹配将作为与该命名实体相对应的因特网主页返回。