NATURAL LANGUAGE PROCESSING FOR ENTITY RESOLUTION

    公开(公告)号:US20170091320A1

    公开(公告)日:2017-03-30

    申请号:US15254714

    申请日:2016-09-01

    申请人: Panjiva, Inc.

    IPC分类号: G06F17/30

    摘要: An apparatus includes a data access circuit that interprets data records, each having a number of data fields, a record parsing circuit that determines a number of n-grams from terms of each of the data records and maps the number of n-grams to a corresponding number of mathematical vectors, and a record association circuit that determines whether a similarity value between a first mathematical vector for the first data record and a second mathematical vector for the second data record is greater than a threshold similarity value, and associates the first and second data records in response to the similarity value exceeding the threshold similarity value. An example apparatus includes a reporting circuit that provides a catalog entity identifier, associates each of the first term and the second term to the catalog entity identifier, and provides a summary of activity for an entity.

    SYSTEM, METHOD, AND APPARATUS FOR DETERMINING AND CORRECTING SHIPPING VOLUMES

    公开(公告)号:US20190251506A1

    公开(公告)日:2019-08-15

    申请号:US16159584

    申请日:2018-10-12

    申请人: Panjiva, Inc.

    摘要: Systems and methods for processing shipping records are described. A system includes a record access circuit structured to interpret a plurality of records, each including multiple shipment description values. A container representation circuit is structured to determine a container volume value corresponding to each of the plurality of records, and to further determine the container volume value in response to a weighting between the shipment description values. A shipping volume reporting circuit is structured to update each of the records with the container volume value. A related method includes interpreting a plurality of records, each record including multiple shipment description values. The method includes determining a container volume value corresponding to each of the records, in response to a weighting between the shipment description values, and updating each of the plurality of records with the container volume value.

    Natural language processing for entity resolution

    公开(公告)号:US11514096B2

    公开(公告)日:2022-11-29

    申请号:US15254714

    申请日:2016-09-01

    申请人: Panjiva, Inc.

    摘要: An apparatus includes a data access circuit that interprets data records, each having a number of data fields, a record parsing circuit that determines a number of n-grams from terms of each of the data records and maps the number of n-grams to a corresponding number of mathematical vectors, and a record association circuit that determines whether a similarity value between a first mathematical vector for the first data record and a second mathematical vector for the second data record is greater than a threshold similarity value, and associates the first and second data records in response to the similarity value exceeding the threshold similarity value. An example apparatus includes a reporting circuit that provides a catalog entity identifier, associates each of the first term and the second term to the catalog entity identifier, and provides a summary of activity for an entity.

    Mtransaction processing improvements

    公开(公告)号:US10949450B2

    公开(公告)日:2021-03-16

    申请号:US16209887

    申请日:2018-12-04

    申请人: Panjiva, Inc.

    摘要: The technology features a system and computer-implemented method for resolving a relationship between objects. A target object index is generated based on a group of target objects. One or more lookup operations is performed on each target object in the target object index for each source object in a group of source objects. A plurality of source target object pairs is generated, each source target object pair comprising one source object and one target object having at least one matching data value. Each source target object pair is converted into a numeric feature vector. The numeric feature vector is classified corresponding to each source target object pair using a binary classifier. A match score to each source target object pair is applied based on the classification using the binary classifier. Any source target object pair having a match score lower than a match threshold value is discarded.

    MTRANSACTION PROCESSING IMPROVEMENTS
    7.
    发明申请

    公开(公告)号:US20190171655A1

    公开(公告)日:2019-06-06

    申请号:US16209887

    申请日:2018-12-04

    申请人: Panjiva, Inc.

    摘要: The technology features a system and computer-implemented method for resolving a relationship between objects. A target object index is generated based on a group of target objects. One or more lookup operations is performed on each target object in the target object index for each source object in a group of source objects. A plurality of source target object pairs is generated, each source target object pair comprising one source object and one target object having at least one matching data value. Each source target object pair is converted into a numeric feature vector. The numeric feature vector is classified corresponding to each source target object pair using a binary classifier. A match score to each source target object pair is applied based on the classification using the binary classifier. Any source target object pair having a match score lower than a match threshold value is discarded.