SYSTEMS AND METHODS FOR SHARED LATENT SPACE PROMPT TUNING

    公开(公告)号:US20230419027A1

    公开(公告)日:2023-12-28

    申请号:US18060411

    申请日:2022-11-30

    CPC classification number: G06F40/20 G06N3/084

    Abstract: Embodiments described herein provide a prompt-based transfer learning method that employs shared latent space prompt tuning). Specifically, a shared latent space is assumed, among all source and target tasks, where each vector in the space captures a basis skill to do a particular task. Given an instance (from either a source task or a target task), it is first encoded into an instance representation vector and then queries the latent space, which yields a skill vector for this instance. This vector modulates a frozen model, via soft prompts which are a simple prompt transformation (the prompt generator in FIG. 3) of the basis skill vector, to generate an answer for the instance. The latent space and prompt transformation are learned end-to-end in upstream pre-training on source tasks.

    Systems and methods for a conversational framework of program synthesis

    公开(公告)号:US12079602B2

    公开(公告)日:2024-09-03

    申请号:US17889998

    申请日:2022-08-17

    CPC classification number: G06F8/35 G06F8/10 G06N20/00

    Abstract: Embodiments described herein provide a program synthesis framework that generates code programs through a multi-turn conversation between a user and a system. Specifically, the description to solve a target problem is factorized into multiple steps, each of which includes a description in natural language (prompt) to be input into the generation model as a user utterance. The model in turn synthesizes functionally correct subprograms following the current user utterance and considering descriptions and synthesized subprograms at previous steps. The subprograms generated at the multiple steps are then combined to form an output of program in response to the target problem.

    SYSTEMS AND METHODS FOR SELECTING NEURAL NETWORK MODELS FOR BUILDING A CUSTOM ARTIFICIAL INTELLIGENCE STACK

    公开(公告)号:US20250005276A1

    公开(公告)日:2025-01-02

    申请号:US18498886

    申请日:2023-10-31

    Abstract: Embodiments described herein provide a system for selecting a neural network based natural language processing (NLP) model for building a custom artificial intelligence (AI) stack for a user. The system includes a communication interface that established connections to one or more external servers hosting one or more neural network based NLP models, a memory; and a processor executing operations including: selecting a source document based on a custom NLP application; generating, by a first language model, a summary of the source document; generating, by a second language model, one or more questions based on at least one of the summary or the source document; transmitting, via the communication interface, the one or more questions to the one or more neural network based NLP models; receiving, via the communication interface, one or more answers generated by the one or more neural network based NLP models.

    Systems and methods for long document summarization

    公开(公告)号:US11941346B2

    公开(公告)日:2024-03-26

    申请号:US17589650

    申请日:2022-01-31

    CPC classification number: G06F40/166 G06F40/284

    Abstract: Embodiments described herein provide methods and systems for effectively and efficiently summarizing long documents. A transformer is provided with bottom-up and top-down inference combined to effectively capture long-range dependency. In the bottom-up inference, each token only attends to nearby tokens within a window of a specified size. In the top-down inference, full self-attention is given using units with coarser granularity. The bottom-up-inferred token representations are then updated with the top-down representations, which is achieved with cross-attention between the top and token levels. Multiple levels of top-down representations with increasingly coarser granularity can be used if documents are extremely long.

    SYSTEMS AND METHODS FOR A CONVERSATIONAL FRAMEWORK OF PROGRAM SYNTHESIS

    公开(公告)号:US20230280985A1

    公开(公告)日:2023-09-07

    申请号:US17889998

    申请日:2022-08-17

    CPC classification number: G06F8/35 G06F8/10 G06N20/00

    Abstract: Embodiments described herein provide a program synthesis framework that generates code programs through a multi-turn conversation between a user and a system. Specifically, the description to solve a target problem is factorized into multiple steps, each of which includes a description in natural language (prompt) to be input into the generation model as a user utterance. The model in turn synthesizes functionally correct subprograms following the current user utterance and considering descriptions and synthesized subprograms at previous steps. The subprograms generated at the multiple steps are then combined to form an output of program in response to the target problem.

Patent Agency Ranking