FINE-TUNING POLICIES TO FACILITATE CHAINING

Invention Publication

US20230280726A1 FINE-TUNING POLICIES TO FACILITATE CHAINING 审中-公开

Please log in to see more content

Patent Title: FINE-TUNING POLICIES TO FACILITATE CHAINING
Application No.: US17684245

Application Date: 2022-03-01
Publication No.: US20230280726A1

Publication Date: 2023-09-07
Inventor: Yuke Zhu , Anima Anandkumar , Youngwoon Lee
Applicant: NVIDIA Corporation
Applicant Address: US CA Santa Clara
Assignee: NVIDIA Corporation
Current Assignee: NVIDIA Corporation
Current Assignee Address: US CA Santa Clara
Main IPC: G05B19/418
IPC: G05B19/418

FINE-TUNING POLICIES TO FACILITATE CHAINING

Abstract:

A manipulation task may include operations performed by one or more manipulation entities on one or more objects. This manipulation task may be broken down into a plurality of sequential sub-tasks (policies). These policies may be fine-tuned so that a terminal state distribution of a given policy matches an initial state distribution of another policy that immediately follows the given policy within the plurality of policies. The fine-tuned plurality of policies may then be chained together and implemented within a manipulation environment.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G05	控制；调节
G05B	一般的控制或调节系统；这种系统的功能单元；用于这种系统或单元的监视或测试装置（应用流体作用的一般流体压力执行器或系统入F15B；阀门本身入F16K；仅按机械特征区分的入G05G；传感元件见相应小类，例如G12B，G01、H01的小类；校正单元见相应的小类，例如H02K）
G05B19/00	程序控制系统（特殊应用见有关位置，例如A47L15/46；附带或内装有在预定时间间隔操作任一器件的装置的时钟入G04C23/00；记录或读取数字信息的记录载体入G06K；信息存储器入G11；在程序执行完了后自动终止其运行的时间或时间程序开关入H01H43/00）
G05B19/02	.电的
G05B19/418	..全面工厂控制，即集中控制许多机器，例如直接或分布数字控制（DNC）、柔性制造系统（FMS）、集成制造系统（IMS）、计算机集成制造（CIM）