Patent search ap:("SAP SE") AND inv:"Rajesh Vellore Arumugam" Page 1

1.

发明公开
SCALABLE ENTITY MATCHING WITH FILTERING USING LEARNED EMBEDDINGS AND APPROXIMATE NEAREST NEIGHBOURHOOD SEARCH 审中-公开

公开(公告)号：US20240045890A1

公开(公告)日：2024-02-08

申请号：US17817388

申请日：2022-08-04

Applicant: SAP SE

Inventor： Hoang-Vu Nguyen , Li Rong Wang , Matthias Frank , Rajesh Vellore Arumugam , Stefan Klaus Baur , Sundeep Gullapudi

IPC: G06F16/28 , G06N20/00 , G06F16/2457 , G06K9/62

CPC classification number: G06F16/288 , G06N20/00 , G06F16/24578 , G06K9/6215

Abstract: Methods, systems, and computer-readable storage media for a machine learning (ML) system for matching a query entity to one or more target entities, the ML system that reducing a number of query-target entity pairs from consideration as potential matches during inference.

2.

发明授权
Efficient search for combinations of matching entities given constraints 有权

公开(公告)号：US11687575B1

公开(公告)日：2023-06-27

申请号：US17647477

申请日：2022-01-10

Applicant: SAP SE

Inventor： Hoang-Vu Nguyen , Rajesh Vellore Arumugam , Matthias Frank , Stefan Klaus Baur

IPC: G06F16/33 , G06F16/35 , G06F16/31

CPC classification number: G06F16/3347 , G06F16/325 , G06F16/3346 , G06F16/35

Abstract: Methods, systems, and computer-readable storage media for receiving a set of inference results generated by a ML model, the inference results including a set of query entities and a set of target entities, each query entity having one or more target entities matched thereto by the ML model, processing the set of inference results to generate a set of matched sub-sets of target entities by executing a search over target entities in the set of target entities based on constraints, for each problem in a set of problems, providing the problem as a tuple including an index value representative of a target entity in the set of target entities and a value associated with the query entity, the value including a constraint relative to the query entity, and executing at least one task in response to one or more matched sub-sets in the set of matched sub-sets.

3.

发明申请
LARGE LANGUAGE MODELS FOR EXTRACTING CONVERSATIONAL-STYLE EXPLANATIONS FOR ENTITY MATCHES 有权

公开(公告)号：US20250077773A1

公开(公告)日：2025-03-06

申请号：US18358225

申请日：2023-07-25

Applicant: SAP SE

Inventor： Rajesh Vellore Arumugam , Anantharaman Ravi , Matthias Frank , Sundeep Gullapudi , Yi Quan Zhou

IPC: G06F40/284 , G06F16/248 , G06F40/40

Abstract: Methods, systems, and computer-readable storage media for receiving, by an entity matching ML model, a query and target pair including a query entity and a target entity, providing, by the entity matching ML model, a query-target prediction by processing the query entity and the target entity, the query-target prediction indicating a match type between the query entity and the target entity, generating a prompt by populating a prompt template with at least a portion of the query-target prediction, inputting the prompt into a large language model (LLM), and receiving, from the LLM, an explanation that is responsive to the prompt and that describes one or more reasons for the query-target prediction output by the entity matching ML model.

4.

发明申请
GLOBAL ENTITY MATCHING MODEL WITH CONTINUOUS PERFORMANCE ENHANCEMENT USING LARGE LANGUAGE MODELS 有权

公开(公告)号：US20250117663A1

公开(公告)日：2025-04-10

申请号：US18480635

申请日：2023-10-04

Applicant: SAP SE

Inventor： Rajesh Vellore Arumugam , Donglin Ruan , Matthias Frank , Yi Quan Zhou

IPC: G06N3/0895

Abstract: Methods, systems, and computer-readable storage media for training a global matching ML model using a set of enterprise data associated with a set of enterprises, receiving a subset of enterprise data associated with an enterprise that is absent from the set of enterprises, fine tuning the global matching ML model using the subset of enterprise data to provide a fine-tuned matching ML model. deploying the fine-tuned matching ML model for inference, receiving feedback to one or more inference results generated by the fine-tuned matching ML model, receiving synthetic data from a LLM system in response to at least a portion of the feedback, and fine tuning one or more of the global matching ML model and the fine-tuned ML model using the synthetic data.

5.

发明申请
DATA-PRIVACY-PRESERVING SYNTHESIS OF REALISTIC SEMI-STRUCTURED TABULAR DATA 有权

公开(公告)号：US20250068965A1

公开(公告)日：2025-02-27

申请号：US18455775

申请日：2023-08-25

Applicant: SAP SE

Inventor： Matthias Frank , Sundeep Gullapudi , Rajesh Vellore Arumugam , Anantharaman Ravi , Prawira Putra Fadjar , Yi Quan Zhou

IPC: G06N20/00 , G06F16/22

Abstract: Methods, systems, and computer-readable storage media for receiving a real data table, providing a synthetic structured table based on the real data table, providing a sampled data table comprising a sub-set of real data of the real data table, transmitting a prompt to a LLM system, the prompt being generated based on the real data table and the synthetic structured data table, receiving synthetic unstructured data from the LLM system, providing an aggregate synthetic table that includes at least a portion of the synthetic unstructured data, and training a ML model using the aggregate synthetic table.

6.

发明授权
Enhancing accuracy of entity matching inference using large language models 有权

公开(公告)号：US12093300B1

公开(公告)日：2024-09-17

申请号：US18463519

申请日：2023-09-08

Applicant: SAP SE

Inventor： Yi Quan Zhou , Rajesh Vellore Arumugam , Raja Sekhar Juluri , Xingce Bao , Eshwin Sukhdeve

IPC: G06F7/00 , G06F16/33 , G06F16/332 , G06F16/35 , G06F40/174 , G06F40/186

CPC classification number: G06F16/35 , G06F16/3329 , G06F16/3344 , G06F40/174 , G06F40/186

Abstract: Methods, systems, and computer-readable storage media for receiving a first document including structured data and unstructured data, providing a first sub-document and a second sub-document, the first sub-document including the structured data of the first document, the second sub-document including the unstructured data of the first document, generating a prompt using the second sub-document and a second document, inputting the prompt to a LLM, receiving a response from the LLM, providing a calibrated first document by merging the response into the first sub-document, and processing the calibrated first document and the second document using a ML model to provide a prediction, the prediction indicating a matching class between the first document and the second document.

7.

发明公开
EFFICIENT SEARCH FOR COMBINATIONS OF MATCHING ENTITIES GIVEN CONSTRAINTS 审中-公开

公开(公告)号：US20230222147A1

公开(公告)日：2023-07-13

申请号：US17647477

申请日：2022-01-10

Applicant: SAP SE

Inventor： Hoang-Vu Nguyen , Rajesh Vellore Arumugam , Matthias Frank , Stefan Klaus Baur

IPC: G06F16/33 , G06F16/31 , G06F16/35

CPC classification number: G06F16/3347 , G06F16/3346 , G06F16/325 , G06F16/35

Abstract: Methods, systems, and computer-readable storage media for receiving a set of inference results generated by a ML model, the inference results including a set of query entities and a set of target entities, each query entity having one or more target entities matched thereto by the ML model, processing the set of inference results to generate a set of matched sub-sets of target entities by executing a search over target entities in the set of target entities based on constraints, for each problem in a set of problems, providing the problem as a tuple including an index value representative of a target entity in the set of target entities and a value associated with the query entity, the value including a constraint relative to the query entity, and executing at least one task in response to one or more matched sub-sets in the set of matched sub-sets.

8.

发明授权
Entity linking and filtering using efficient search tree and machine learning representations 有权

公开(公告)号：US12277148B2

公开(公告)日：2025-04-15

申请号：US17723586

申请日：2022-04-19

Applicant: SAP SE

Inventor： Sundeep Gullapudi , Rajesh Vellore Arumugam , Matthias Frank , Wei Xia

IPC: G06F16/31 , G06F16/33 , G06F16/332 , G06F16/3332 , G06F40/284

Abstract: Methods, systems, and computer-readable storage media for a ML system that reduces a number of target items from consideration as potential matches to a query item using token embeddings and a search tree.

9.

发明申请
DEPLOYMENT OF MACHINE LEARNING MODELS USING LARGE LANGUAGE MODELS AND FEW-SHOT LEARNING 有权

公开(公告)号：US20250036974A1

公开(公告)日：2025-01-30

申请号：US18358245

申请日：2023-07-25

Applicant: SAP SE

Inventor： Rajesh Vellore Arumugam , Anantharaman Ravi , Isaac New Yi Qing , Sundeep Gullapudi , Yi Quan Zhou

IPC: G06N5/04 , G06N20/20

Abstract: Methods, systems, and computer-readable storage media for providing, for a set of ML models, a set of training metrics determined using test data during a training phase, providing, for a production-use ML model, a set of inference metrics based on predictions generated by the production-use ML model, generating, by a prompt generator, a set of few-shot examples using the set of training metrics and the set of inference metrics, inputting, by the prompt generator, the set of few-shot examples to a LLM as prompts, transmitting, to the LLM a query, displaying, to a user, a recommendation that is received from the LLM and responsive to the query, receiving input from a user indicating a user-selected ML model responsive to the recommendation, and deploying a user-selected ML model to an inference runtime for production use.

10.

发明公开
ENHANCED MODEL EXPLANATIONS USING DYNAMIC TOKENIZATION FOR ENTITY MATCHING MODELS 审中-公开

公开(公告)号：US20240177053A1

公开(公告)日：2024-05-30

申请号：US18070598

申请日：2022-11-29

Applicant: SAP SE

Inventor： Sundeep Gullapudi , Rajesh Vellore Arumugam , Abhinandan Padhi

IPC: G06N20/00

CPC classification number: G06N20/00

Abstract: Methods, systems, and computer-readable storage media for receiving query data representative of query entities and target data representative of target entities, determining, by an attention ML model, a set of character-level embeddings, providing, by a sub-word-level tokenizer, a set of sub-word-level tokens, each sub-word-level token including a string of multiple characters, generating, by the attention ML model, a set of sub-word-level embeddings based on the set of sub-word-level tokens, providing, by the attention ML model, at least one attention matrix including attention scores, each attention score representative of a relative importance of a respective sub-word-level token in a predicted match, the predicted match including a match between a query entity and a target entity, and outputting an explanation based on the at least one attention matrix.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification