-
1.
公开(公告)号:US20240249075A1
公开(公告)日:2024-07-25
申请号:US18425657
申请日:2024-01-29
发明人: Sally Gao , Hella-Franziska Hoffmann , Nina Hristozova , Elizabeth Roman , Nicolai Pogrebnyakov , Yue Feng , Masoud Makrehchi , Tate Sterling Avery , Shohreh Shaghaghian , Borna Jafarpour
IPC分类号: G06F40/279 , G06F3/0481 , G06F40/109 , G06F40/137 , G06F40/166 , G06F40/232 , G06F40/242 , G06F40/258 , G06F40/284 , G06F40/289 , G06V30/416
CPC分类号: G06F40/279 , G06F3/0481 , G06F40/109 , G06F40/137 , G06F40/166 , G06F40/232 , G06F40/242 , G06F40/258 , G06F40/284 , G06F40/289 , G06V30/416
摘要: The present disclosure is directed towards systems and methods for detecting deviations between documents and portions thereof, extracting information from text and detecting deviations between obligations. Information is extracted by identifying defined terms and their definitions in input text as well as by identifying portions of different input texts relevant to a point of interest and detecting deviations in those portions between the different input texts.
-
公开(公告)号:US11886814B2
公开(公告)日:2024-01-30
申请号:US17156567
申请日:2021-01-23
发明人: Sally Gao , Hella-Franziska Hoffmann , Nina Hristozova , Elizabeth Roman , Nicolai Pogrebnyakov , Yue Feng , Masoud Makrehchi , Tate Sterling Avery , Shohreh Shaghaghian , Borna Jafarpour
IPC分类号: G06F40/279 , G06F40/242 , G06F3/0481 , G06F40/166 , G06F40/289 , G06F40/258 , G06F40/284 , G06F40/109 , G06F40/137 , G06F40/232 , G06V30/416
CPC分类号: G06F40/279 , G06F3/0481 , G06F40/109 , G06F40/137 , G06F40/166 , G06F40/232 , G06F40/242 , G06F40/258 , G06F40/284 , G06F40/289 , G06V30/416
摘要: The present disclosure is directed towards systems and methods for detecting deviations between documents and portions thereof, extracting information from text and detecting deviations between obligations. Information is extracted by identifying defined terms and their definitions in input text as well as by identifying portions of different input texts relevant to a point of interest and detecting deviations in those portions between the different input texts.
-
公开(公告)号:US20220366282A1
公开(公告)日:2022-11-17
申请号:US17738057
申请日:2022-05-06
发明人: Masoud Makrehchi , Borna Jafarpour , Nicolai Pogrebnyakov , Firoozeh Sepehr , Vinod Vijaykumar Madyalkar , Seung Min Lee
IPC分类号: G06N5/04 , G06N20/00 , G06F40/211
摘要: Computer systems and computer implemented methods for training a machine learning model are provided that includes: selecting seed data from an unlabeled dataset; labeling the seed data and storing the labeled seed data in a data store; training the machine learning model in an initial iteration using the labeled seed data, where the machine learning model is trained to select a next subset of the unlabeled dataset; selecting a next subset of the unlabeled dataset; computing difficulty scores for at least the next subset of the unlabeled dataset; labeling the next subset of the unlabeled data; and training the machine learning model in a second iteration using the labeled next subset of the unlabeled dataset. The machine learning model is generally trained to select the next subset of the unlabeled dataset for a subsequent training iteration by presenting the labeled next subset of the unlabeled dataset in an order sorted based on the difficulty scores.
-
-