SYSTEMS AND METHODS FOR SELECTING NEURAL NETWORK MODELS FOR BUILDING A CUSTOM ARTIFICIAL INTELLIGENCE STACK

    公开(公告)号:US20250005276A1

    公开(公告)日:2025-01-02

    申请号:US18498886

    申请日:2023-10-31

    Abstract: Embodiments described herein provide a system for selecting a neural network based natural language processing (NLP) model for building a custom artificial intelligence (AI) stack for a user. The system includes a communication interface that established connections to one or more external servers hosting one or more neural network based NLP models, a memory; and a processor executing operations including: selecting a source document based on a custom NLP application; generating, by a first language model, a summary of the source document; generating, by a second language model, one or more questions based on at least one of the summary or the source document; transmitting, via the communication interface, the one or more questions to the one or more neural network based NLP models; receiving, via the communication interface, one or more answers generated by the one or more neural network based NLP models.

    SYSTEMS AND METHODS FOR UNSUPERVISED TRAINING IN TEXT RETRIEVAL TASKS

    公开(公告)号:US20240202530A1

    公开(公告)日:2024-06-20

    申请号:US18303313

    申请日:2023-04-19

    CPC classification number: G06N3/084 G06F40/20 G06F40/40 G06N3/0455 G06N3/088

    Abstract: Embodiments described herein provide systems and methods for training a text retrieval model. A system may generate queries associated with provided documents. The queries may be generated in one or more different manners. Examples of query generation may include extracting relevant spans of text from the documents, prompting a language model for a topic, title, abstractive summary, and/or extractive summary based on the documents. Metadata such as title or other HTML tags may be used as queries. Using the one or more queries, the text retrieval model may be trained using contrastive learning, using the generated query, and positive and negative sample documents. A fine-tuning training phase may be performed using domain-specific data which may also be done with generated query pairs, or may be done in a supervised fashion with provided queries. The text retrieval model may be used to locate documents given an input query.

Patent Agency Ranking