Assembling a tax-information data structure
    2.
    发明授权
    Assembling a tax-information data structure 有权
    组建税务信息数据结构

    公开(公告)号:US09418385B1

    公开(公告)日:2016-08-16

    申请号:US13012098

    申请日:2011-01-24

    CPC分类号: G06Q40/10

    摘要: The disclosed embodiments relate to a tax-information assembly technique, which extracts tax information and associated context information from income-tax documents, where these income-tax documents are associated with an income-tax agency, and some of the income-tax documents include the same tax information in different document formats. During this technique, semantic and structural heuristics are used to identify tax phrases in the extracted tax information. Moreover, additional tax phrases in the extracted tax information are identified using a statistical identification technique. Next, relationships between the tax phrases and the additional tax phrases are determined, and the context information is used to consolidate the tax phrases and the additional tax phrases into a tax-information data structure.

    摘要翻译: 所披露的实施例涉及一种税务信息组合技术,其从所述纳税文件中提取税收信息和相关联的上下文信息,所述收入税文件与所述纳税机构相关联,并且一些所述纳税文件包括 不同文件格式的相同税务信息。 在此技术中,语义和结构启发式用于识别提取的税务信息中的税务短语。 此外,使用统计识别技术来识别提取的税务信息中的额外税务短语。 接下来,确定税务短语和附加税单之间的关系,并使用上下文信息将税务短语和附加税务短语合并到税务信息数据结构中。