HUMAN-OBJECT INTERACTION DETECTION
    161.
    发明申请

    公开(公告)号:US20230051232A1

    公开(公告)日:2023-02-16

    申请号:US17976673

    申请日:2022-10-28

    Abstract: A human-object interaction detection method, a neural network and a training method therefor is provided. The human-object interaction detection method includes: performing first target feature extraction on an image feature of an image; performing first interaction feature extraction on the image feature; processing a plurality of first target features to obtain target information of a plurality of detected targets; processing one or more first interaction features to obtain motion information of a motion, human information of a human target corresponding to each motion, and object information of an object target corresponding to each motion; matching the plurality of detected targets with one or more motions; and updating human information of a corresponding human target based on target information of a detected target matching the corresponding human target, and updating object information of a corresponding object target based on target information of a detected target matching the corresponding object target.

    METHOD OF DETERMINING REGIONAL LAND USAGE PROPERTY, ELECTRONIC DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20230024680A1

    公开(公告)日:2023-01-26

    申请号:US17957275

    申请日:2022-09-30

    Abstract: A method of determining a regional land usage property, an electronic device and a storage medium, which relate to a field of an information technology, in particular to a field of a deep learning. The method includes: acquiring a human interaction information between a plurality of regions at a specified time; updating an initial representation vector of each of the regions according to the human interaction information, so as to obtain an embedding representation vector of each of the regions; selecting a target region from the regions, and selecting a plurality of static neighbor regions within a preset range around the target region; generating a feature map of the target region according to the embedding representation vector of the target region and the embedding representation vectors of the plurality of static neighbor regions; and predicting a land usage property of the target region by using the feature map.

    METHOD FOR MANAGING FUNCTION BASED ON ENGINE, ELECTRONIC DEVICE AND MEDIUM

    公开(公告)号:US20230023290A1

    公开(公告)日:2023-01-26

    申请号:US17814666

    申请日:2022-07-25

    Inventor: Bin He

    Abstract: Disclosed are a method for managing a function based on an engine, an electronic device and a medium, which relate to a field of computer technologies, and particularly to a field of artificial intelligence (AI) technologies such as cloud computing, big data and deep learning. The technical solution includes: generating a function creating request, in which the function creating request comprises Java Archive File (JAR) package path information; sending the function creating request to a coordinate machine node of the engine; obtaining, by the coordinate machine node based on the JAR package path information, a JAR package; copying the JAR package to a plug-in directory corresponding to each worker node of at least one worker node of the engine; and performing, by a daemon thread, registration and loading of a function corresponding to the JAR package in the plug-in directory.

    METHOD OF RECOGNIZING TEXT, DEVICE, STORAGE MEDIUM AND SMART DICTIONARY PEN

    公开(公告)号:US20230020022A1

    公开(公告)日:2023-01-19

    申请号:US17885882

    申请日:2022-08-11

    Abstract: A method of recognizing a text, which relates to a field of an artificial intelligence technology, in particular to a field of computer vision and deep learning technology, and may be applied to optical character recognition or other applications. The method includes: acquiring a plurality of image sequences by continuously scanning a document; performing an image stitching, so as to obtain a plurality of successive frames of stitched images corresponding to the plurality of image sequences respectively, an overlapping region exists between each two successive frames of stitched images; performing a text recognition based on the plurality of successive frames of stitched images, so as to obtain a plurality of corresponding recognition results; and performing a de-duplication on the plurality of recognition results based on the overlapping region between each two successive frames of stitched images, so as to obtain a text recognition result for the document.

    TRANSLATION METHOD, CLASSIFICATION MODEL TRAINING METHOD, DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20230015313A1

    公开(公告)日:2023-01-19

    申请号:US17656160

    申请日:2022-03-23

    Abstract: Disclosed are a translation method, a classification model training method, a device and a storage medium, which relate to the field of computer technologies, particularly to the field of artificial intelligence such as natural language processing and deep learning. The translation method includes: obtaining a current processing unit of a source language text based on a segmented word in the source language text; determining a classification result of the current processing unit with a classification model; and in response to determining that the classification result is the current processing unit being translatable separately, translating the current processing unit to obtain translation result in a target language corresponding to the current processing unit.

    MULTIMODAL DATA PROCESSING
    169.
    发明申请

    公开(公告)号:US20230010160A1

    公开(公告)日:2023-01-12

    申请号:US17945415

    申请日:2022-09-15

    Abstract: Disclosed are a method for processing multimodal data using a neural network, a device, and a medium, and relates to the field of artificial intelligence and, in particular to multimodal data processing, video classification, and deep learning. The neural network includes: an input subnetwork configured to receive the multimodal data to output respective first features of a plurality of modalities; a plurality of cross-modal feature subnetworks, each of which is configured to receive respective first features of two corresponding modalities to output a cross-modal feature corresponding to the two modalities; a plurality of cross-modal fusion subnetworks, each of which is configured to receive at least one cross-modal feature corresponding to a corresponding target modality and other modalities to output a second feature of the target modality; and an output subnetwork configured to receive respective second features of the plurality of modalities to output a processing result of the multimodal data.

    METHOD FOR TRAINING SEMANTIC REPRESENTATION MODEL, DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20230004721A1

    公开(公告)日:2023-01-05

    申请号:US17655770

    申请日:2022-03-21

    Abstract: Disclosed are a method for training a semantic representation model, a device and a storage medium, which relate to the field of computer technologies, and particularly to the field of artificial intelligence, such as a natural language processing technology, a deep learning technology, or the like. The method for training a semantic representation model includes: obtaining an anchor sample based on a sentence, and obtaining a positive sample and a negative sample based on syntactic information of the sentence; processing the anchor sample, the positive sample and the negative sample using the semantic representation model respectively, so as to obtain an anchor-sample semantic representation, a positive-sample semantic representation and a negative-sample semantic representation; constructing a contrast loss function based on the anchor-sample semantic representation, the positive-sample semantic representation, and the negative-sample semantic representation; and training the semantic representation model based on the contrast loss function.

Patent Agency Ranking