Logical Inductors

paper

2016·arXiv·arxiv.org/abs/1609.03543

Authors

Scott Garrabrant·Tsvi Benson-Tilsen·Andrew Critch·Nate Soares·Jessica Taylor

Credibility Rating

3/5

Good(3)

Good quality. Reputable source with community review or editorial standards, but less rigorous than peer-reviewed venues.

Rating inherited from publication venue: arXiv

A foundational MIRI paper addressing how bounded reasoners can assign coherent probabilities to logical statements; highly relevant to decision theory, embedded agency, and the theoretical underpinnings of AI alignment research.

Paper Details

Citations

1 influential

Year

2016

arXiv:1609.03543 Semantic Scholar

Metadata

Importance: 85/100arxiv preprintprimary source

Abstract

We present a computable algorithm that assigns probabilities to every logical statement in a given formal language, and refines those probabilities over time. For instance, if the language is Peano arithmetic, it assigns probabilities to all arithmetical statements, including claims about the twin prime conjecture, the outputs of long-running computations, and its own probabilities. We show that our algorithm, an instance of what we call a logical inductor, satisfies a number of intuitive desiderata, including: (1) it learns to predict patterns of truth and falsehood in logical statements, often long before having the resources to evaluate the statements, so long as the patterns can be written down in polynomial time; (2) it learns to use appropriate statistical summaries to predict sequences of statements whose truth values appear pseudorandom; and (3) it learns to have accurate beliefs about its own current beliefs, in a manner that avoids the standard paradoxes of self-reference. For example, if a given computer program only ever produces outputs in a certain range, a logical inductor learns this fact in a timely manner; and if late digits in the decimal expansion of $π$ are difficult to predict, then a logical inductor learns to assign $\approx 10\%$ probability to "the $n$th digit of $π$ is a 7" for large $n$. Logical inductors also learn to trust their future beliefs more than their current beliefs, and their beliefs are coherent in the limit (whenever $φ\implies ψ$, $\mathbb{P}_\infty(φ) \le \mathbb{P}_\infty(ψ)$, and so on); and logical inductors strictly dominate the universal semimeasure in the limit. These properties and many others all follow from a single logical induction criterion, which is motivated by a series of stock trading analogies. Roughly speaking, each logical sentence $φ$ is associated with a stock that is worth \$1 per share if [...]

Summary

This paper introduces logical inductors, a computable algorithm that assigns and refines probabilities to logical statements in formal languages over time, satisfying numerous desirable epistemic properties. The approach resolves longstanding challenges in assigning coherent prior probabilities to logical statements, including self-referential claims, using a criterion motivated by the impossibility of exploiting the algorithm via stock trading strategies. All desirable properties—including timely pattern learning, calibration, and dominating universal semimeasures—follow from a single logical induction criterion.

Key Points

•Provides a computable algorithm assigning coherent probabilities to all statements in formal languages like Peano arithmetic, including self-referential beliefs and undecidable claims.
•Learns patterns in logical truth values in polynomial time, even before having resources to directly evaluate statements.
•Avoids standard self-reference paradoxes while achieving accurate beliefs about the algorithm's own current probability assignments.
•All desirable properties derive from one unified logical induction criterion, framed as resistance to exploitation by polynomial-time trading strategies.
•Relevant to AI safety as it provides foundations for bounded, self-aware reasoning agents with coherent probabilistic beliefs under computational constraints.

Cited by 1 page

Page	Type	Quality
Machine Intelligence Research Institute (MIRI)	Organization	50.0

Cached Content Preview

HTTP 200Fetched Apr 9, 202698 KB

[1609.03543] Logical Induction 
 
 
 
 
 
 
 
 
 
 
 

 
 

 
 
 
 
 
 
 
 theorem]Definition

 
 
 \publishingnote 
 See https://intelligence.org/files/LogicalInductionAbridged.pdf for an abridged version of this paper.

 
 Logical Induction

 
 
 
Scott Garrabrant
 
 
 
 
 Tsvi Benson-Tilsen
 
 
 
 
 Andrew Critch
 
 
 
 
 Nate Soares
 
 
 
 
 
 
 
 
 
 Jessica Taylor
 {scott
 
 
 
 
 tsvi
 
 
 
 
 critch
 
 
 
 
 nate
 
 
 
 
 jessica}@intelligence.org
 Machine Intelligence Research Institute
 
 
 
 

 
 Abstract

 We present a computable algorithm that assigns probabilities to every logical statement in a given formal language, and refines those probabilities over time. For instance, if the language is Peano arithmetic, it assigns probabilities to all arithmetical statements, including claims about the twin prime conjecture, the outputs of long-running computations, and its own probabilities. We show that our algorithm, an instance of what we call a logical inductor , satisfies a number of intuitive desiderata, including: (1) it learns to predict patterns of truth and falsehood in logical statements, often long before having the resources to evaluate the statements, so long as the patterns can be written down in polynomial time; (2) it learns to use appropriate statistical summaries to predict sequences of statements whose truth values appear pseudorandom; and (3) it learns to have accurate beliefs about its own current beliefs, in a manner that avoids the standard paradoxes of self-reference. For example, if a given computer program only ever produces outputs in a certain range, a logical inductor learns this fact in a timely manner; and if late digits in the decimal expansion of π 𝜋 \pi are difficult to predict, then a logical inductor learns to assign ≈ 10 % absent percent 10 \approx 10\% probability to “the n 𝑛 n th digit of π 𝜋 \pi is a 7” for large n 𝑛 n . Logical inductors also learn to trust their future beliefs more than their current beliefs, and their beliefs are coherent in the limit (whenever ϕ → ψ → italic-ϕ 𝜓 \phi\rightarrow\psi , ℙ ∞  ( ϕ ) ≤ ℙ ∞  ( ψ ) subscript ℙ italic-ϕ subscript ℙ 𝜓 \mathbb{P}_{\infty}(\phi)\leq\mathbb{P}_{\infty}(\psi) , and so on); and logical inductors strictly dominate the universal semimeasure in the limit.

 These properties and many others all follow from a single logical induction criterion , which is motivated by a series of stock trading analogies. Roughly speaking, each logical sentence ϕ italic-ϕ \phi is associated with a stock that is worth $1 per share if ϕ italic-ϕ \phi is true and nothing otherwise, and we interpret the belief-state of a logically uncertain reasoner as a set of market prices, where ℙ n  ( ϕ ) = 50 % subscript ℙ 𝑛 italic-ϕ percent 50 \mathbb{P}_{n}(\phi)=50\% means that on day n 𝑛 n , shares of ϕ italic-ϕ \phi may be bought or sold from the reasoner for 50¢. The logical induction criterion says (very roughly) that there should not be any polynomial-time computabl

... (truncated, 98 KB total)

Resource ID: dc4b84a089564392 | Stable ID: sid_2udTjVchgg