-
公开(公告)号:US11945106B2
公开(公告)日:2024-04-02
申请号:US18158072
申请日:2023-01-23
Applicant: Google LLC
Inventor: Michael Quinlan , Sean Kirmani
IPC: G06V10/44 , B25J5/00 , G06F18/214 , G06V10/40 , G06V10/764 , G06V10/776 , G06V10/82 , G06V10/94 , G06V20/10 , G06V20/64
CPC classification number: B25J5/007 , G06F18/214 , G06V10/40 , G06V10/454 , G06V10/764 , G06V10/776 , G06V10/82 , G06V10/95 , G06V10/955 , G06V20/10 , G06V20/64
Abstract: A method includes receiving image data representing an environment of a robotic device from a camera on the robotic device. The method further includes applying a trained dense network to the image data to generate a set of feature values, where the trained dense network has been trained to accomplish a first robot vision task. The method additionally includes applying a trained task-specific head to the set of feature values to generate a task-specific output to accomplish a second robot vision task, where the trained task-specific head has been trained to accomplish the second robot vision task based on previously generated feature values from the trained dense network, where the second robot vision task is different from the first robot vision task. The method also includes controlling the robotic device to operate in the environment based on the task-specific output generated to accomplish the second robot vision task.
-
公开(公告)号:US20240153314A1
公开(公告)日:2024-05-09
申请号:US18417414
申请日:2024-01-19
Applicant: Google LLC
Inventor: Sean Kirmani , Michael Quinlan , Sarah Coe
CPC classification number: G06V40/23 , B25J9/163 , B25J9/1697 , G05B13/0265 , G06V10/40 , G06V20/10 , G06V40/103
Abstract: A method includes receiving, from a camera disposed on a robotic device, a two-dimensional (2D) image of a body of an actor and determining, for each respective keypoint of a first subset of a plurality of keypoints, 2D coordinates of the respective keypoint within the 2D image. The plurality of keypoints represent body locations. Each respective keypoint of the first subset is visible in the 2D image. The method also includes determining a second subset of the plurality of keypoints. Each respective keypoint of the second subset is not visible in the 2D image. The method further includes determining, by way of a machine learning model, an extent of engagement of the actor with the robotic device based on (i) the 2D coordinates of keypoints of the first subset and (ii) for each respective keypoint of the second subset, an indicator that the respective keypoint is not visible.
-
公开(公告)号:US11766783B2
公开(公告)日:2023-09-26
申请号:US17817076
申请日:2022-08-03
Applicant: Google LLC
Inventor: Sean Kirmani , Guy Satat , Michael Quinlan
Abstract: A method includes receiving sensor data representing a first object in an environment and generating, based on the sensor data, a first state vector that represents physical properties of the first object. The method also includes generating, by a first machine learning model and based on the first state vector and a second state vector that represents physical properties of a second object previously observed in the environment, a metric indicating a likelihood that the first object is the same as the second object. The method further includes determining, based on the metric, to update the second state vector and updating, by a second machine learning model configured to maintain the second state vector over time and based on the first state vector, the second state vector to incorporate into the second state vector information concerning physical properties of the second object as represented in the first state vector.
-
公开(公告)号:US11769269B2
公开(公告)日:2023-09-26
申请号:US17878535
申请日:2022-08-01
Applicant: Google LLC
Inventor: Guy Satat , Michael Quinlan , Sean Kirmani , Anelia Angelova , Ariel Gordon
CPC classification number: G06T7/593 , B25J13/089 , G05D1/0231 , G06T3/20 , H04N13/128 , G06T2207/10028 , H04N2013/0081
Abstract: A method includes receiving a first depth map that includes a plurality of first pixel depths and a second depth map that includes a plurality of second pixel depths. The first depth map corresponds to a reference depth scale and the second depth map corresponds to a relative depth scale. The method includes aligning the second pixel depths with the first pixel depths. The method includes transforming the aligned region of the second pixel depths such that transformed second edge pixel depths of the aligned region are coextensive with first edge pixel depths surrounding the corresponding region of the first pixel depths. The method includes generating a third depth map. The third depth map includes a first region corresponding to the first pixel depths and a second region corresponding to the transformed and aligned region of the second pixel depths.
-
公开(公告)号:US20240217091A1
公开(公告)日:2024-07-04
申请号:US18605464
申请日:2024-03-14
Applicant: Google LLC
Inventor: Michael Quinlan , Sean Kirmani
IPC: B25J5/00 , G06F18/214 , G06V10/40 , G06V10/44 , G06V10/764 , G06V10/776 , G06V10/82 , G06V10/94 , G06V20/10 , G06V20/64
CPC classification number: B25J5/007 , G06F18/214 , G06V10/40 , G06V10/454 , G06V10/764 , G06V10/776 , G06V10/82 , G06V10/95 , G06V10/955 , G06V20/10 , G06V20/64
Abstract: A method includes receiving image data representing an environment of a robotic device from a camera on the robotic device. The method further includes applying a trained dense network to the image data to generate a set of feature values, where the trained dense network has been trained to accomplish a first robot vision task. The method additionally includes applying a trained task-specific head to the set of feature values to generate a task-specific output to accomplish a second robot vision task, where the trained task-specific head has been trained to accomplish the second robot vision task based on previously generated feature values from the trained dense network, where the second robot vision task is different from the first robot vision task. The method also includes controlling the robotic device to operate in the environment based on the task-specific output generated to accomplish the second robot vision task.
-
公开(公告)号:US11915523B2
公开(公告)日:2024-02-27
申请号:US17815361
申请日:2022-07-27
Applicant: Google LLC
Inventor: Sean Kirmani , Michael Quinlan , Sarah Coe
CPC classification number: G06V40/23 , B25J9/163 , B25J9/1697 , G05B13/0265 , G06V10/40 , G06V20/10 , G06V40/103
Abstract: A method includes receiving, from a camera disposed on a robotic device, a two-dimensional (2D) image of a body of an actor and determining, for each respective keypoint of a first subset of a plurality of keypoints, 2D coordinates of the respective keypoint within the 2D image. The plurality of keypoints represent body locations. Each respective keypoint of the first subset is visible in the 2D image. The method also includes determining a second subset of the plurality of keypoints. Each respective keypoint of the second subset is not visible in the 2D image. The method further includes determining, by way of a machine learning model, an extent of engagement of the actor with the robotic device based on (i) the 2D coordinates of keypoints of the first subset and (ii) for each respective keypoint of the second subset, an indicator that the respective keypoint is not visible.
-
-
-
-
-