ENHANCED NAMED ENTITY RECOGNITION (NER) USING CUSTOM-BUILT REGULAR EXPRESSION (REGEX) MATCHER AND HEURISTIC ENTITY RULER
摘要:
Pre-trained models for Named Entity Recognition (NER) come with static NE classes, limited in number, and remain same irrespective of domain of the input text. Thus, domain specific training is required. Embodiments of the present disclosure provide a method and system for enhanced NER using a custom-built REGEX matcher and a heuristic entity ruler. The invention helps in discovering the NE's of the given text with pipeline-based approach with combination of models of NLP transformer, custom-built REGEX, and heuristic entity rules. The method automatically handles class resolution based on the heuristic entity ruler. The method enables a user to customize or add any new heuristic rules for entity ruler or custom regex as a knowledgebase to train the model with automatic relearning and unlearning. The extracted NEs are provided for further processing or masking in a structured format.
信息查询
0/0