Methods and apparatus for creating domain-specific intended-meaning natural language processing pipelines

    公开(公告)号:US11681878B2

    公开(公告)日:2023-06-20

    申请号:US17982760

    申请日:2022-11-08

    IPC分类号: G06F17/00 G06F40/30

    CPC分类号: G06F40/30

    摘要: A method includes receiving a dataset that includes a plurality of input texts. Each input text from the plurality of texts is associated with a content category from a plurality of content categories based on a comparison between that input text and an intended meaning that is common for each comparison. For each model in a plurality of models, and for each content category from the plurality of content categories, that model is executed on each input text from the plurality of input texts to generate an average similarity/dissimilarity score for that content category. At least one model from the plurality of models is selected, based on the average similarity score for each content category from the plurality of content categories for each model in the plurality of models, to determine whether an input text is similar/dissimilar to the intended meaning.

    BIOSIGNATURE-BASED TOKENIZATION OF ASSETS IN A BLOCKCHAIN

    公开(公告)号:US20200351094A1

    公开(公告)日:2020-11-05

    申请号:US16654720

    申请日:2019-10-16

    摘要: An apparatus includes a tester to detect a biological signature of a biological sample, a processor, and a memory operably coupled to the processor. The memory stores instructions to cause the processor to receive an indication of the biological signature from the tester, and to generate, using a smart contract and through communication with a distributed ledger, a cryptographic token including a digital identifier based on the biological signature. The cryptographic token is transmitted to a remote processor for verification of the biological sample, in response to receiving the cryptographic token. The tester can detect the biological signature within a predetermined test duration that is less than a DNA sequencing duration associated with the biological sample, and the biological signature has a data precision sufficient to uniquely identify the biological sample from a plurality of biological samples.

    Biosignature-based tokenization of assets in a blockchain

    公开(公告)号:US11206138B2

    公开(公告)日:2021-12-21

    申请号:US16654720

    申请日:2019-10-16

    摘要: An apparatus includes a tester to detect a biological signature of a biological sample, a processor, and a memory operably coupled to the processor. The memory stores instructions to cause the processor to receive an indication of the biological signature from the tester, and to generate, using a smart contract and through communication with a distributed ledger, a cryptographic token including a digital identifier based on the biological signature. The cryptographic token is transmitted to a remote processor for verification of the biological sample, in response to receiving the cryptographic token. The tester can detect the biological signature within a predetermined test duration that is less than a DNA sequencing duration associated with the biological sample, and the biological signature has a data precision sufficient to uniquely identify the biological sample from a plurality of biological samples.

    MACHINE LEARNING BASED FILE RANKING METHODS AND SYSTEMS

    公开(公告)号:US20200327407A1

    公开(公告)日:2020-10-15

    申请号:US16381505

    申请日:2019-04-11

    IPC分类号: G06N3/08 G06N3/04

    摘要: A multi-label ranking method includes receiving, at a processor and from a first set of artificial neural networks (ANNs), multiple signals representing a first set of ANN output pairs for a first label. A signal representing a second set of ANN output pairs for a second label different from the first label is received at the processor from a second set of ANNs different from the first set of ANNs, substantially concurrently with the first set of ANN output pairs. A first activation function is solved based on the first set of ANN output pairs, and a second activation function is solved based on the second set of ANN output pairs. Loss values are calculated based on the solved activations, and a mask is generated based on at least one ground truth label. A signal, including a representation of the mask, is sent from the processor to each of the sets of ANNs.

    MACHINE LEARNING BASED EXTRACTION OF PARTITION OBJECTS FROM ELECTRONIC DOCUMENTS

    公开(公告)号:US20200327373A1

    公开(公告)日:2020-10-15

    申请号:US16790945

    申请日:2020-02-14

    摘要: An object-extraction method includes generating multiple partition objects based on an electronic document, and receiving a first user selection of a data element via a user interface of a compute device. In response to the first user selection, and using a machine learning model, a first subset of partition objects from the multiple partition objects is detected and displayed via the user interface. A user interaction, via the user interface, with one of the partition objects is detected, and in response, a weight of the machine learning model is modified, to produce a modified machine learning model. A second user selection of the data element is received via the user interface, and in response and using the modified machine learning model, a second subset of partition objects from the multiple partition objects is detected and displayed via the user interface, the second subset different from the first subset.

    MACHINE LEARNING BASED EXTRACTION OF PARTITION OBJECTS FROM ELECTRONIC DOCUMENTS

    公开(公告)号:US20210166074A1

    公开(公告)日:2021-06-03

    申请号:US17169825

    申请日:2021-02-08

    摘要: An object-extraction method includes generating multiple partition objects based on an electronic document, and receiving a first user selection of a data element via a user interface of a compute device. In response to the first user selection, and using a machine learning model, a first subset of partition objects from the multiple partition objects is detected and displayed via the user interface. A user interaction, via the user interface, with one of the partition objects is detected, and in response, a weight of the machine learning model is modified, to produce a modified machine learning model. A second user selection of the data element is received via the user interface, and in response and using the modified machine learning model, a second subset of partition objects from the multiple partition objects is detected and displayed via the user interface, the second subset different from the first subset.