Method and system for extracting relevant entities from a text corpus

    公开(公告)号:US11276010B2

    公开(公告)日:2022-03-15

    申请号:US15463901

    申请日:2017-03-20

    申请人: Wipro Limited

    摘要: The present disclosure discloses method and system for extracting relevant entities from a text corpus. The method comprises receiving, by the entity extraction computing device, a text corpus and an entity, determining at least one feature for each block of text from the text corpus, where the at least one feature corresponds to predefined one or more feature heads, calculating a score for each block of text from the text corpus based on training of the entity extraction system, determining a template from one or more templates based on the score, where the one or more templates are generated based on the training of the entity extraction system, and extracting at least one relevant entity from the text corpus, with respect to the entity, based on the template. The method and system disclosed in the present disclosure may be used to extract relevant entities across various domains by training the system.

    METHOD AND SYSTEM FOR EXTRACTING RELEVANT ENTITIES FROM A TEXT CORPUS

    公开(公告)号:US20180253663A1

    公开(公告)日:2018-09-06

    申请号:US15463901

    申请日:2017-03-20

    申请人: Wipro Limited

    IPC分类号: G06N99/00 G06F17/30

    摘要: The present disclosure discloses method and system for extracting relevant entities from a text corpus. The method comprises receiving, by the entity extraction computing device, a text corpus and an entity, determining at least one feature for each block of text from the text corpus, where the at least one feature corresponds to predefined one or more feature heads, calculating a score for each block of text from the text corpus based on training of the entity extraction system, determining a template from one or more templates based on the score, where the one or more templates are generated based on the training of the entity extraction system, and extracting at least one relevant entity from the text corpus, with respect to the entity, based on the template. The method and system disclosed in the present disclosure may be used to extract relevant entities across various domains by training the system.