-
公开(公告)号:US20240233311A1
公开(公告)日:2024-07-11
申请号:US18562784
申请日:2021-06-30
Applicant: Microsoft Technology Licensing, LLC
Inventor: Shiqi JIANG , Yuanchun LI , Yuanchao SHU , Yunxin LIU
CPC classification number: G06V10/25 , G06T3/40 , G06T7/11 , G06T7/62 , G06V2201/07
Abstract: Implementations of the present disclosure provide a solution for object detection. In this solution, object distribution information and performance metrics are obtained. The object distribution information indicates a size distribution of detected objects in a set of historical images captured by a camera. The performance metric indicates corresponding performance levels of a set of predetermined object detection models. At least one detection plan is further generated based on the object distribution information and the performance metric. The at least one detection plan indicates which of the set of predetermined object detection models is to be applied to each of at least one sub-image in a target image to be captured by the camera. Additionally, the at least one detection plan is provided for object detection on the target image. In this way, a balance between the detection latency and the detection accuracy may be improved.
-
公开(公告)号:US20230095006A1
公开(公告)日:2023-03-30
申请号:US17042060
申请日:2020-05-25
Applicant: Microsoft Technology Licensing, LLC
Inventor: Oriana RIVA , Yuanchun LI
IPC: G06F9/451 , G06F8/38 , G06F16/953 , G06F16/951 , G06F3/0483 , G06F40/40 , G06F40/20 , G06F40/30 , G06F3/0484
Abstract: For a given input query specifying a task to be performed on a website, the correct sequence of actions (or UI script) is machine learned without having any previous knowledge about the website or the query. To learn the correct UI script, a task agent is created that performs multiple task agent runs comprising different sequences of actions of UI elements on the website (e.g., buttons, text fields, menus, and the like). The states of the webpages are monitored after each action of a UI element is performed. Tasklets are created that include the performed sequences of actions for each task agent as well as their assigned scores, and the correct UI script is chosen from the tasklets based on the scores (e.g., tasklet with the highest score).
-
公开(公告)号:US20240370280A1
公开(公告)日:2024-11-07
申请号:US18663006
申请日:2024-05-13
Applicant: Microsoft Technology Licensing, LLC
Inventor: Oriana RIVA , Yuanchun LI
IPC: G06F9/451 , G06F3/0483 , G06F3/0484 , G06F8/38 , G06F16/951 , G06F16/953 , G06F40/20 , G06F40/30 , G06F40/40
Abstract: For a given input query specifying a task to be performed on a website, the correct sequence of actions (or UI script) is machine learned without having any previous knowledge about the website or the query. To learn the correct UI script, a task agent is created that performs multiple task agent runs comprising different sequences of actions of UI elements on the website (e.g., buttons, text fields, menus, and the like). The states of the webpages are monitored after each action of a UI element is performed. Tasklets are created that include the performed sequences of actions for each task agent as well as their assigned scores, and the correct UI script is chosen from the tasklets based on the scores (e.g., tasklet with the highest score).
-
-