IMAGE BASED HUMAN-COMPUTER INTERACTION METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20240338962A1

    公开(公告)日:2024-10-10

    申请号:US18747599

    申请日:2024-06-19

    CPC classification number: G06V30/414 G06V30/418

    Abstract: The present disclosure provides an image based human-computer interaction method, which includes: acquiring a to-be-analyzed image, and determining image layout information and image content information of the to-be-analyzed image, where the to-be-analyzed image includes a variety of modal data, the image layout information represents distribution of image elements with preset granularity in the to-be-analyzed image, and the image content information represents a content expressed by the modal data in the to-be-analyzed image; and determining, in response to acquiring question information, response information corresponding to the question information according to the image layout information and the image content information, where the question information represents a question proposed by a user for the to-be-analyzed image, and the response information represents a reply answer corresponding to the question information. By extracting layout information and content information from an image, the accuracy of answering a question and user experience of human-computer interaction are improved.

    Summary generation model training method and apparatus, device and storage medium

    公开(公告)号:US12093297B2

    公开(公告)日:2024-09-17

    申请号:US17577561

    申请日:2022-01-18

    CPC classification number: G06F16/345 G06F40/51 G06F40/56

    Abstract: The present disclosure provides a summary generation model training method and apparatus, a device and a storage medium, and relates to the field of computer technologies, and in particular, to the field of artificial intelligence such as natural language processing and deep learning. The summary generation model training method includes: acquiring a document representation corresponding to a document sample; constructing, based on the document representation, a summary representation corresponding to the document representation, the summary representation including a positive summary representation and a negative summary representation; and constructing a total contrastive loss function based on the document representation, the positive summary representation and the negative summary representation, and training a summary generation model based on the total contrastive loss function. The present disclosure may improve accuracy of the summary generation model.

    METHOD AND APPARATUS FOR PROCESSING MODEL GENERATION RESULT, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20240303430A1

    公开(公告)日:2024-09-12

    申请号:US18667504

    申请日:2024-05-17

    CPC classification number: G06F40/20

    Abstract: A technical solution for processing a model generation result, which relates to the field of artificial intelligence technologies is disclosed. An implementation includes: disassembling a text generation result of a generative large model to obtain a plurality of result logic units; wherein each result logic unit includes a segment in the text generation result; each segment is capable of independently identifying one premise or conclusion in a logical inference relationship of the text generation result; and the text generation result is a response result generated by the generative large model based on text input information; generating a logical inference graph capable of characterizing a logical inference relationship among the plurality of result logic units based on the plurality of result logic units; and determining whether logical inference of generation of the text generation result by the generative large model is correct or not based on the logical inference graph.

    METHOD OF PROCESSING TEXT, TRAINING METHOD, GENERATING METHOD AND ELECTRONIC DEVICE

    公开(公告)号:US20240281602A1

    公开(公告)日:2024-08-22

    申请号:US18651183

    申请日:2024-04-30

    Inventor: Yaqing WANG

    CPC classification number: G06F40/20

    Abstract: A method of processing a text, a training method, a method of generating a knowledge graph, an electronic device, and a non-transitory computer-readable storage medium are provided, which relates to a field of artificial intelligence, particularly to fields such as deep learning, natural language processing, computer vision, and speech processing. The method of processing a text includes: encoding a text to be processed to obtain a feature information; identifying a plurality of entity information from the text, based on the feature information; generating a word relation tensor based on the feature information; and determining a relation between the plurality of entity information by using the word relation tensor, so as to generate a plurality of relational triplets related to the text.

    ENTITY RECOGNITION METHOD, MODEL TRAINING METHOD, ELECTRONIC DEVICE, AND MEDIUM

    公开(公告)号:US20240273297A1

    公开(公告)日:2024-08-15

    申请号:US18642593

    申请日:2024-04-22

    CPC classification number: G06F40/295

    Abstract: An entity recognition method, a model training method, an electronic device, and a medium, which relate to fields of artificial intelligence, information acquiring technologies. The entity recognition method includes: extracting specified entities from a text in a source file of a webpage to be recognized, and acquiring a text encoding result for each specified entity; determining a text block formed by each specified entity in the webpage, and encoding a relative layout information between each two text blocks, to obtain a position encoding result; constructing a triple by the position encoding result for each two text blocks and the text encoding results for respective specified entities of the two text blocks; and performing a graph convolution on each triple to obtain a relation recognition result for the webpage to be recognized, where the relation recognition result indicates whether an association exists between each two text blocks in the webpage.

    METHOD OF FUSING IMAGE FEATURE, ELECTRONIC DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20240265687A1

    公开(公告)日:2024-08-08

    申请号:US18020396

    申请日:2022-04-22

    CPC classification number: G06V10/806 G06V10/7715

    Abstract: A method of fusing an image feature, an electronic device, and a storage medium are provided, which relate to the field of artificial intelligence, in particular to fields of computer vision and depth learning, and may be applied to scenarios such as image processing and image recognition. The method includes: inputting an image into a first image processing model among N serially connected image processing models, to obtain an output feature of the first image processing model, an i-th image processing model includes a first shared layer to an i-th shared layer, i=1, . . . , N, and N is a natural number greater than or equal to 2; inputting an output feature of a j-th image processing model into a (j+1)-th image processing model, to obtain an output feature of the (j+1)-th image processing model, j=1, . . . , N−1; and fusing the output features of the N image processing models.

Patent Agency Ranking