MACHINE-LEARNING BASED IRRELEVANT SENTENCE CLASSIFIER

    公开(公告)号:US20230419042A1

    公开(公告)日:2023-12-28

    申请号:US17937606

    申请日:2022-10-03

    IPC分类号: G06F40/30 G06N5/02

    CPC分类号: G06F40/30 G06N5/022

    摘要: There is a need for more effective, efficient, and accurate computer text comprehension. This need is addressed by applying unique text processing techniques to identify and remove irrelevant sentences from a narrative. The text processing techniques include a machine-learning based model that is trained using automatically generated training data that is tailored to a particular circumstance. A method for machine narrative comprehension includes receiving a narrative data object comprising one or more sentences; determining, using a machine-learning based irrelevant classifier model, a relevance of at least one of the one or more sentences; responsive to a determination that at least one sentence is irrelevant, generating a pertinent summary by removing the at least one sentence from the narrative; and generating, based at least in part on the pertinent summary, an output indicia data object for the narrative data object.

    MACHINE-LEARNING BASED TRANSCRIPT SUMMARIZATION

    公开(公告)号:US20230419051A1

    公开(公告)日:2023-12-28

    申请号:US17937616

    申请日:2022-10-03

    摘要: There is a need for more effective and efficient predictive natural language summarization. This need is addressed by applying hybrid extractive and abstractive summarization techniques in a unique processing pipeline to generate a cohesive and comprehensive summary of a multi-party interaction. A method for generating the summary of a multi-party interaction includes receiving a multi-party interaction transcript data object comprising a plurality of interaction utterances from at least two participants; using an extractive summarization model to identify a key sentence of the multi-party interaction transcript data object; identifying an interaction utterance from the multi-party interaction transcript data object that corresponds to the key sentence; generating a contextual summary for the multi-party interaction transcript data object based at least in part on the interaction utterance; and generating a reported contextual summary for the multi-party interaction transcript data object based at least in part on the contextual summary.