Automated conversational response generation

    公开(公告)号:US11736423B2

    公开(公告)日:2023-08-22

    申请号:US17307175

    申请日:2021-05-04

    CPC classification number: H04L51/04 G06F11/302 G06F18/2178 G06F40/30

    Abstract: Systems, computer-implemented methods, and/or computer program products facilitating a process to identify and respond to a primary electronic message are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can include a determination component can determine that a primary electronic message has not received a response electronic message. An analysis component can generate a generated electronic message addressing the informational or emotional content of the primary electronic message. In one or more embodiments, an updating component can update the analytical model based on one or more feedbacks to the generated electronic message, where the analytical model can remain active while being updated. The one or more feedbacks can comprise a feedback from an entity-in-the-loop monitoring outputs of the analytical model including the generated electronic message.

    Neural-Symbolic Action Transformers for Video Question Answering

    公开(公告)号:US20230027713A1

    公开(公告)日:2023-01-26

    申请号:US17381408

    申请日:2021-07-21

    Abstract: Mechanisms are provided for performing artificial intelligence-based video question answering. A video parser parses an input video data sequence to generate situation data structure(s), each situation data structure comprising data elements corresponding to entities, and first relationships between entities, identified by the video parser as present in images of the input video data sequence. First machine learning computer model(s) operate on the situation data structure(s) to predict second relationship(s) between the situation data structure(s). Second machine learning computer model(s) execute on a received input question to predict an executable program to execute to answer the received question. The program is executed on the situation data structure(s) and predicted second relationship(s). An answer to the question is output based on results of executing the program.

    Compositional Action Machine Learning Mechanisms

    公开(公告)号:US20230360364A1

    公开(公告)日:2023-11-09

    申请号:US17737535

    申请日:2022-05-05

    CPC classification number: G06V10/764 G06V10/7753 G06V10/806

    Abstract: Mechanisms are provided for performing machine learning (ML) training of a ML action recognition computer model which involves processing an original input dataset to generate an object feature bank comprising object feature data structures for a plurality of different objects. For an input video, a verb data structure and an original object data structure are generated and a candidate object feature data structure is selected from the object feature bank for generation of pseudo composition (PC) training data. The PC training data is generated based on the selected candidate object feature data structure and comprises a combination of the verb data structure and the candidate object feature data structure. The PC training data represents a combination of an action and an object not represented in the original input dataset. ML training of the ML action recognition computer model is performed based on an unseen combination comprising the PC training data.

    TRAINING A POSE ESTIMATION MODEL TO DETERMINE ANATOMY KEYPOINTS IN IMAGES

    公开(公告)号:US20240404106A1

    公开(公告)日:2024-12-05

    申请号:US18327608

    申请日:2023-06-01

    Abstract: Provided are a computer program product, system, and method for training a pose estimation model to determine anatomy keypoints in images. A teacher network, implementing machine learning, processes images representing anatomies to produce heatmaps representing keypoints of the anatomies. An anatomy parsing network, implementing machine learning, processes the images to produce segmentation representations labeling anatomies represented in the images. The segmentation representations from the anatomy parsing network and the heatmaps from the teacher network are concatenated to produce mixed heatmaps. A pose estimation model, implementing machine learning, is trained to process the images to output predicted heatmaps to minimize a loss function of the output predicted heatmaps from the pose estimation model and the mixed heatmaps.

    Audio Understanding with Fixed Language Models

    公开(公告)号:US20240127001A1

    公开(公告)日:2024-04-18

    申请号:US17964633

    申请日:2022-10-12

    CPC classification number: G06F40/40 G10L15/26

    Abstract: Techniques for audio understanding using fixed language models are provided. In one aspect, a system for performing audio understanding tasks includes: a fixed text embedder for, on receipt of a prompt sequence having (e.g., from 0-10) demonstrations of an audio understanding task followed by a new question, converting the prompt sequence into text embeddings; a pretrained audio encoder for converting the prompt sequence into audio embeddings; and a fixed autoregressive language model for answering the new question using the text embeddings and the audio embeddings. A method for performing audio understanding tasks is also provided.

Patent Agency Ranking