发明申请
- 专利标题: SYSTEM TO PROVE SAFETY OF ARTIFICIAL GENERAL INTELLIGENCE VIA INTERACTIVE PROOFS
-
申请号: US17855712申请日: 2022-06-30
-
公开(公告)号: US20230008689A1公开(公告)日: 2023-01-12
- 发明人: Kristen William Carlson
- 申请人: Kristen William Carlson
- 申请人地址: US MA Concord
- 专利权人: Kristen William Carlson
- 当前专利权人: Kristen William Carlson
- 当前专利权人地址: US MA Concord
- 主分类号: G06N5/02
- IPC分类号: G06N5/02 ; G06N7/00 ; G06N5/04 ; H04L9/32
摘要:
A method to prove the safety (e.g., value-alignment) and other properties of artificial intelligence systems possessing general and/or super-human intelligence (together, AGI). The method uses probabilistic proofs in Interactive proof systems (IPS), in which a Verifier queries a computationally more powerful Prover and reduces the probability of the Prover deceiving the Verifier to any specified low probability (e.g., 2−100) IPS-based procedures can be used to test AGI behavior control systems that incorporate hard-coded ethics or valuelearning methods. An embodiment of the method, mapping the axioms and transformation rules of a behavior control system to a finite set of prime numbers, makes it possible to validate safe behavior via IPS number-theoretic methods. Other IPS embodiments can prove an unlimited number of AGI properties. Multi-prover IPS, program-checking IPS, and probabilistically checkable proofs extend the power of the paradigm. The method applies to value-alignment between future AGI generations of disparate power.
信息查询