IMAGE SEGMENTATION METHOD AND APPARATUS, AND DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20230394671A1

    公开(公告)日:2023-12-07

    申请号:US18251228

    申请日:2021-09-27

    Inventor: Tao KONG Ya JING Lei LI

    CPC classification number: G06T7/11 G06T3/40 G06T2207/20221

    Abstract: Provided are an image segmentation method and apparatus, a device, and a storage medium. The image segmentation method includes: fusing a visual feature corresponding to an original image with a text feature corresponding to a description language to obtain a multimodal feature, where the description language is used for specifying a target object to be segmented in the original image; determining a visual region of the target object according to an image corresponding to the multimodal feature and recording an image corresponding to the visual region as a response heat map; and determining a segmentation result of the target object according to the image corresponding to the multimodal feature and the response heat map.

    METHOD, DEVICE AND MEDIUM FOR OPERATING ROBOT ARM

    公开(公告)号:US20250108507A1

    公开(公告)日:2025-04-03

    申请号:US18774064

    申请日:2024-07-16

    Abstract: Methods, devices, and media for operating a robot arm are provided. In one method, receive a language description for specifying a target implemented by the robot arm; obtain a current state of the robot arm; and determine, according to an action model, an action to be performed by the robot arm based on the language description and the current state. With the example implementation of the present disclosure, the problem of insufficient training data of the robot arm may be alleviated. Further, the pre-trained action model may obtain the basic knowledge about the association relationship between the language description and the person action, may obtain a more accurate action model, and further obtain the action of the robot arm matching the language description in a more efficient manner.

    INFORMATION PROCESSING METHOD, TASK EXECUTION METHOD, APPARATUS, DEVICE AND MEDIUM

    公开(公告)号:US20250018567A1

    公开(公告)日:2025-01-16

    申请号:US18900502

    申请日:2024-09-27

    Abstract: The present application discloses an information processing method, a task execution method, an apparatus, a device and a medium. The method includes: processing, through a target visual encoding model in a target analysis model, obtained image information to be analyzed, to obtain a corresponding target sequence; fusing, through a target feature fusion model in the target analysis model, the target sequence and obtained text information to be analyzed, to obtain a target fusion result; processing the target fusion result through a target task analysis model in the target analysis model to obtain target task information; and controlling the action execution apparatus to perform an action corresponding to the target task information. The target analysis model is obtained by training an initial analysis model and the initial analysis model comprises an initial visual encoding model and an initial feature fusion model.

    IMAGE PROCESSING METHOD, STORAGE MEDIUM AND COMPTUER DEVICE

    公开(公告)号:US20250157202A1

    公开(公告)日:2025-05-15

    申请号:US18839645

    申请日:2023-08-10

    Inventor: Ya JING Tao KONG

    Abstract: An image processing method, a storage medium and a computer device are provided. The method includes obtaining observation information acquired by a target robot within a target observation space, wherein the observation information includes observation images, depth images, and sensor pose information; obtaining a three-dimensional semantic distribution map based on the observation information; learning an exploration policy of the target robot based on conditions of a semantic distribution inconsistency and a class distribution uncertainty according to the three-dimensional semantic distribution map; obtaining, based on at least one condition of the semantic distribution inconsistency and the class distribution uncertainty, hard sample images from the target observation images corresponding to the exploration trajectory; adjusting a perception model of the target robot based on the hard sample images.

    METHOD AND APPARATUS FOR ROBOT CONTROL, DEVICE, ROBOT, AND MEDIUM

    公开(公告)号:US20240375287A1

    公开(公告)日:2024-11-14

    申请号:US18659595

    申请日:2024-05-09

    Abstract: Embodiments of the disclosure relate to a method and apparatus for controlling a robot, a device, a robot, and a medium. The method for controlling a robot according to the embodiments of the disclosure includes determining, based on a real-time image captured by the robot at a first moment, a reference motion parameter and a reference control force corresponding to the real-time image. The method further includes determining a target pose and a target control force of the robot at a second moment after the first moment according to the reference motion parameter and the reference control force. The method further includes determining a target action of the robot at the second moment according to the target pose and the target control force.

Patent Agency Ranking