-
公开(公告)号:US20230315701A1
公开(公告)日:2023-10-05
申请号:US18331169
申请日:2023-06-07
Applicant: Microsoft Technology Licensing, LLC
Inventor: Meiyalagan BALASUBRAMANIAN , Lengning LIU , Aditya KUPPA , Kirk Hartmann FREIHEIT , Kalen WONG , Paula Budig GREVE , Patrick Clinton LITTLE , Lucas PRITZ , Yue WANG , Vivek Ravindranath NARASAYYA , Katchaguy AREEKIJSEREE , Yehe HE , Surajit CHAUDHURI , Gaurav Ghosh
IPC: G06F16/215 , G06F16/2455
CPC classification number: G06F16/215 , G06F16/24556
Abstract: Solutions for data unification include: receiving a data record, the data record comprising a plurality of data fields; selecting, from among the plurality of data fields, a subset of the data fields, the subset of the data fields being fewer in number than the plurality of data fields, wherein selecting the subset of the data fields comprises: applying a first rule to select at least a first one of the data fields within the data record for inclusion in the subset of the data fields; using content of the subset of the data fields, generating a stable identifier (stableID) for the data record; and inserting the stableID into a primary key data field of the data record.