-
公开(公告)号:US20250018567A1
公开(公告)日:2025-01-16
申请号:US18900502
申请日:2024-09-27
Inventor: Hanbo ZHANG , Xinghang LI , Minghuan LIU , Jie XU , Hongtao WU , Ya JING , Chilam CHEANG , Tao KONG , Hang LI
IPC: B25J9/16 , G06F40/284 , G06V10/80
Abstract: The present application discloses an information processing method, a task execution method, an apparatus, a device and a medium. The method includes: processing, through a target visual encoding model in a target analysis model, obtained image information to be analyzed, to obtain a corresponding target sequence; fusing, through a target feature fusion model in the target analysis model, the target sequence and obtained text information to be analyzed, to obtain a target fusion result; processing the target fusion result through a target task analysis model in the target analysis model to obtain target task information; and controlling the action execution apparatus to perform an action corresponding to the target task information. The target analysis model is obtained by training an initial analysis model and the initial analysis model comprises an initial visual encoding model and an initial feature fusion model.