专利检索 ap:("International Business Machines Corporation" OR "University of Illinois at Urbana-Champaign") AND inv:"Bo Li" 第 1 页

1.

发明授权
State-augmented reinforcement learning 有权

公开(公告)号：US11880765B2

公开(公告)日：2024-01-23

申请号：US17074054

申请日：2020-10-19

申请人： International Business Machines Corporation , University of Illinois at Urbana-Champaign

发明人： Pin-Yu Chen , Yada Zhu , Jinjun Xiong , Kumar Bhaskaran , Yunan Ye , Bo Li

IPC分类号： G06Q30/00 , G06N3/08 , G06F40/279 , G06Q40/06

CPC分类号： G06N3/08 , G06F40/279 , G06Q40/06

摘要： A processor training a reinforcement learning model can include receiving a first dataset representing an observable state in reinforcement learning to train a machine to perform an action. The processor receives a second dataset. Using the second dataset, the processor trains a machine learning classifier to make a prediction about an entity related to the action. The processor extracts an embedding from the trained machine learning classifier, and augments the observable state with the embedding to create an augmented state. Based on the augmented state, the processor trains a reinforcement learning model to learn a policy for performing the action, the policy including a mapping from state space to action space.

2.

发明申请
STATE-AUGMENTED REINFORCEMENT LEARNING 有权

公开(公告)号：US20220121921A1

公开(公告)日：2022-04-21

申请号：US17074054

申请日：2020-10-19

申请人： International Business Machines Corporation , University of Illinois at Urbana-Champaign

发明人： Pin-Yu Chen , Yada Zhu , Jinjun Xiong , Kumar Bhaskaran , Yunan Ye , Bo Li

IPC分类号： G06N3/08 , G06Q40/06 , G06Q10/10 , G06F40/279

摘要： A processor training a reinforcement learning model can include receiving a first dataset representing an observable state in reinforcement learning to train a machine to perform an action. The processor receives a second dataset. Using the second dataset, the processor trains a machine learning classifier to make a prediction about an entity related to the action. The processor extracts an embedding from the trained machine learning classifier, and augments the observable state with the embedding to create an augmented state. Based on the augmented state, the processor trains a reinforcement learning model to learn a policy for performing the action, the policy including a mapping from state space to action space.

3.

发明授权
Using dynamic information to refine control flow graphs 有权

公开(公告)号：US10360004B2

公开(公告)日：2019-07-23

申请号：US15443038

申请日：2017-02-27

申请人： International Business Machines Corporation

发明人： Si Bin Fan , Bo Li , Nai Jie Li , Jia Sen Wu , Zi Ying Xin , Xiao Zhen Zhu

IPC分类号： G06F9/44 , G06F8/41 , G06F8/75

摘要： A system, method and computer program product to refine an original complex CFG into a simpler CFG showing interesting paths and reducing interfering paths with dynamic input for the state of program. The method receives/encodes dynamic user input in the form of annotations which encodes user's special interests or knowledge of the program at run time, e.g., some assumptions of any variables appeared, which can be equations of variable and value or relationships between variables. The method then simplifies all the branching points in a generated AST (Abstract Syntax Tree) whenever possible by querying a SMT (Satisfiability Modulo Theories) solver with branching condition and the user annotations and by evaluating immediate values of expressions or eliminate unreachable parts in the CFG. Finally, the method generates a simplified CFG by simplified AST. This can assist a programmer to understand the code and facilitates correlating different basic blocks under a same scenario.

4.

发明公开
MODEL-AGNOSTIC INPUT TRANSFORMATION FOR NEURAL NETWORKS 审中-公开

公开(公告)号：US20230214705A1

公开(公告)日：2023-07-06

申请号：US17566624

申请日：2021-12-30

申请人： International Business Machines Corporation , The Board of Trustees of the University of Illinois

发明人： Pin-Yu Chen , Nandhini Chandramoorthy , Karthik V Swaminathan , Jinjun Xiong , Devansh Paresh Shah , Bo Li

IPC分类号： G06N20/00 , G06N5/04

CPC分类号： G06N20/00 , G06N5/04

摘要： An input transformation function that transforms input data for a second machine learning system is learned using a first machine learning system, the learning being based on minimizing a summation of a task loss and a post-activation density loss. The input data is transformed using the learned input transformation function to alter the post-activation density to reduce an amount of energy consumed for an inferencing task and the inferencing task is carried out on the transformed input data using the second machine learning system.