A SHORT NOTE ON THE SIGNIFICANCE OF THE PENROSE-HALTING THEOREM

NICK HUGGETT

doi:10.1017/bsl.2025.10135

A SHORT NOTE ON THE SIGNIFICANCE OF THE PENROSE-HALTING THEOREM

Part of: Computability and recursion theory

Published online by Cambridge University Press: 03 December 2025

NICK HUGGETT

Show author details

NICK HUGGETT*: Affiliation:
UNIVERSITY OF ILLINOIS CHICAGO UNITED STATES
*: E-mail: huggett@uic.edu

Article contents

Abstract
The Penrose-halting theorem
Discussion
Footnotes
References

Rights & Permissions

Abstract

In The Emperor’s New Mind [7], Roger Penrose proves a variant of the halting problem, and uses it to argue that humans have cognitive capacities beyond the computable. In this short note, I explicate his argument, and show how it fails, via a corollary of his result. My response to Penrose is in fact of a kind with a number of prior responses: he assumes human powers, that (as the corollary shows) no computer could have. However, as far as I am aware, no one has previously addressed this specific form of the argument, in the direct way that I will.

Keywords

halting problem computationalism computability

MSC classification

Primary: 03D10: Turing machines and related notions 03D80: Applications of computability and recursion theory

Information

Type: Article
Information: Bulletin of Symbolic Logic , First View , pp. 1 - 8

DOI: https://doi.org/10.1017/bsl.2025.10135 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press on behalf of The Association for Symbolic Logic

1 The Penrose-halting theorem

What we’ll call the ‘Penrose-halting theorem’ concerns the ‘self-halting’ problem: to compute whether any given computer will halt when its own machine number is input. To state and prove the theorem we first make some definitions.Footnote ¹

By Turing Machine (TM) I mean any mechanical computational device implementing an algorithm, within the scope of the Church–Turing thesis, not only those of the type explicitly introduced by Turing. Let T $_m$ be the mth TM in a particular computable enumeration of some class of TMs (assumed computationally equivalent to Turing’s).

For clarity of presentation I will use a ‘black box’ scheme for defining TMs: for instance, machines to compute $x+1$ and $2\times x$ could be represented by

respectively. Such boxes can be compounded to define new TMs, each component taking as its input the output of the preceding box. For instance, the TM Q defined as

obviously computes $2x+1$ .

Let $A[m$ ] denote the result of TM A acting on input $m\in \mathbb {N}$ : $A[m$ ] will either be the natural number output, or that $A[m]$ fails to halt (in which case the value of $A[m$ ] is undefined). So, for our example TM, $Q[m]=2m+1$ (defined for all $m\in \mathbb {N}$ ).

A partial self-halting computer (PSHC) is any TM with the following description:

where 0 and 1 are the only possible outputs. That is, if P halts then it accurately informs us whether or not T $_m$ is a ‘self-halter’. Of course, if P were to halt for all values of m then it would, per impossibile, solve the self-halting problem, so it must be the case that every PSHC fails to halt for some $m\in \mathbb {N}$ .Footnote ²

For any TM

define

where a diamond represents a decision based on its input (here, whether or not the input is 0), and loop is some specific routine that loops endlessly. Note that the only output that A* can have is 0; otherwise it fails to halt.

With these definitions in place, we can prove the Penrose-halting theorem Footnote ³ [Reference Penrose7, pp. 64–66]. Let P be a PSHC, and consider P*, with input p*, its own machine number. If P[p*] is 1 then (i) P*[p*] halts by definition of P, but (ii) by definition of A*, P*[p*] ends in loop and so does not halt; a contradiction. If P[p*] is 0 then (i) P*[p*] does not halt by definition of P, but (ii) by definition of A*, P*[p*] halts on its 0 out arrow; a contradiction. Since P[p*] is neither 1 nor 0, and these are its only possible outputs, for any PSHC P, P[p*] does not halt, hence P*[p*] does not halt either. □

On the basis of (his version of) this theorem Penrose argues (in our terms) against a computationalist theory of cognition: ‘we have a well-defined procedure, whichever [PSHC P] is given to us, for finding a corresponding k for which we know that T $_k(k)$ defeats P, and for which we can therefore do better than the algorithm’ [Reference Penrose7, p. 65]. (By ‘defeats’ he means that P[p*] fails to halt.) That is, we can determine on the basis of the theorem that for all PSHC P, P*[p*] fails to halt. But no PSHC computes for all such P whether P*[p*] halts; in particular, it follows from the theorem that for any PSHC P, P[p*] is undefined—but inputting p* into P is to attempt to determine whether P*[p*] halts. Apparently, then, we can do something that no TM can do!

But this argument is question begging: our ability to see that P*[p*] does not halt is predicated on the information that P is a PSHC, so that the theorem applies. But the PSHC is simply given P*’s machine number p*, and no information that it is the machine number of a PSHC. To compare abilities, both TM and person must be presented with the same task: for instance, given some m—and no other information about T $_m$ —determine whether T $_m$ [m] halts. Of course, that is enough (in principle) to determine concretely which TM in the enumeration T $_m$ is, since it’s computable. Thus to apply the theorem we would have to determine whether T $_m$ is a P* for some PSHC P. Not surprisingly this problem is uncomputable: no general mechanical procedure to determine whether a TM is a PSHC exists, as we shall now show.

We prove that PSHC-hood is not computable by showing that if it were, then there would be a PSHC that can exploit the Penrose-halting theorem to determine that its own P*[p*] halts, contrary to the theorem! That is, we suppose (for reductio) that the following TM exists:

First, recalling that loop is a specific routine, we observe that whether T $_m$ is T $_n^*$ for some T $_n$ is computable: simply determine whether the lone arrow on which T $_m$ halts is of the form

Moreover, since the enumeration of TMs is computable, the value of n is also computable: it is the machine number of the machine obtained from T $_m$ by removing this circuit. Thus the following TM exists:

It thus follows from our supposition that

exists. Call it X.

Suppose that T $_m$ is T $^*_n$ and T $_n$ is a PSHC: by definition the B subroutine outputs $n\neq 0$ , which is input to the S subroutine; and since $T_n$ is a PSHC, S outputs 1 by definition—hence X outputs 0. Otherwise X does not halt: either $T_m$ is not T $^*_n$ for any n, so B outputs 0, and X enters an endless loop; or it is, but T $_n$ is not a PSHC, so S outputs 0 and again X enters an endless loop.

(i) That is, X outputs 1 if T $_m$ is T $^*_n$ and T $_n$ is a PSHC, otherwise it does not halt—X would be a positive solution of the ‘PSHC*-hood problem’.Footnote ⁴

(ii) If X outputs 1 for input m then T $_m[m]$ halts; trivially, since the antecedent is false for any m.

(iii) If X outputs 0 for input m then the S subroutine gave output 1. Thus (a) T $_n$ is a PSHC by definition of S; and (b) the B subroutine output was n, so T $_m$ is T $^*_n$ by definition of B. Hence from the Penrose-halting theorem, T $_m[m]$ does not halt.

(iv) From (ii) and (iii) X is a PSHC.

Let x* be the machine number of X*.

(v) From (i) and (iv), if x* is input into X then the output of X is 0.

(vi) From (iv) and the Penrose-halting theorem, if x* is input into X then X does not halt.

But this is a contradiction, so neither X* nor X exist, so S does not exist (since B does), and contrary to our supposition PSHC-hood is not computable. □

Since according to this corollary to the Penrose-halting theorem no machine has this ability, it is question begging to assume that we possess such an ability, in an argument attempting to show that our cognitive capacities outstrip the computable. But without such an ability we are in no position to use the Penrose-halting theorem to determine that an arbitrary TM A self-halts; to appeal to the theorem one first has to determine that A is P* for some PSHC P, and that in turn first requires that P be identified as a PSHC. So there is no reason to think that our ability to recognize self-halting includes the ability to determine for an arbitrary PSHC P—when not already identified as such—whether P*[p*] halts; no reason to think that we can outperform a TM in this regard.

2 Discussion

Readers familiar with arguments of the type given by Penrose will recognize a familiar pattern in the foregoing. Appeal to Gödel’s incompleteness theorems and their correlates in incomputability results goes back even earlier than [Reference Lucas6] famous argument (Penrose believes that his argument succeeds where Lucas’s fails). The reply given here is in essence that given in response in [Reference Benacerraf1] (if my argument is correct, then Penrose’s does not fare better then Lucas’).Footnote ⁵ But although my point has been made (numerous times) previously, I believe my formulation provides a very direct and clear account of what goes wrong in the concrete form that Penrose’s argument takes.

To support that claim, let me conclude by showing how to deal with two responses that can be made on Penrose’s behalf. First, Penrose himself goes on to recognize a similar point to ours; there is ‘an algorithm for generating [P*], given [PSHC P]’ [Reference Penrose7, p. 65], thereby ‘improving’ on P. That is, given a PSHC P, the machine number of a TM (namely P*) that does not self-halt, yet ‘defeats’ P, is computable. However, Penrose is ambivalent about what to make of this observation; on the one hand it seems to undermine his argument, while on the other perhaps such a TM does not know that the machine halts. Certainly, the Penrose-halting theorem means that such a computation cannot be incorporated in a PSHC P that identifies P* as a non-self-halter! But once again, who is to say that we can? For us to determine that P* is a non-self-halter via the theorem would require us to identify P as a PSHC, and it is question begging to suppose we can. Moreover, no reason is given to think that there is any behavior of which we are capable given the machine number of P*, that a TM is not equally capable. So I find this response unconvincing.

Second, Molyneux, in correspondence, offered a different way of taking Penrose’s original argument, which I will reformulate, making explicit a hidden assumption, as follows. First, for reductio, suppose that in regards to determining self-halting I am a PSHC, P: I can’t always tell, but given a TM, if I say it halts on its own machine number then it does, and if I say it doesn’t then it doesn’t. So if I know that p is my machine number, and if I further know that I am a PSHC, then I infer from the Penrose-halting theorem that P*[p*] does not halt. However, it follows from the theorem that P[p*] does not halt, so P does not recognize that P*[p*] fails to halt: and since by supposition I ‘am’ P for purposes of determining self-halting, I do not recognize that P*[p*] does not halt. Contradiction: therefore I am not P as supposed, for any P.

But of course the argument makes two suppositions, not one. Instead of concluding that I am not a PSHC, I can instead conclude that I do not know that I am, for then I cannot apply the theorem to myself. And such self-ignorance is what the argument entails for TMs, which we can see as follows. Suppose that I am a TM. First, note that the corollary we proved earlier does not immediately rule out my knowing that P is a PSHC; the non-existence of TMs solving the general problem of identifying PSHCs does not preclude TMs that partially solve the problem. So second, we model the attempted use of such a partial solution by my PSHC to apply the Penrose-halting theorem, and see that it must fail. Suppose that a subroutine of my algorithm P for determining self-halting (a) computes whether input a is c* for some TM, and if so (b) attempts to compute whether C is a PHSC, and if so (c) outputs ‘A is a non-self-halter’ (then halts), in accordance with Penrose-halting. That is completely possible, and consistent with P’s being a PSHC. So assuming it is, then the theorem tells us that if p* is input into P, then P does not halt. However, p* is c* for a PHSC, namely for p, so the computation would have halted if steps (a)–(c) were successfully completed. Since steps (a) and (c) are computable, the only way the algorithm can fail to give an output is if step (b) fails to halt, which therefore must happen when p* is input into the subroutine; which is to say when I attempt to compute whether P is a PSHC. Or in yet other words, neither I nor any other TM can compute whether our own algorithm for determining self-halting is a PSHC, if that information is used as part of our computation of self-halting. (In essence, this is a second corollary of the Penrose-halting theorem.)Footnote ⁶

The question, of course, is whether or not I am a computer (specifically with respect to determining self-halting). The argument we are considering derives a contradiction by assuming that I can determine that P*[p*] fails to halt by appeal to the Penrose-halting theorem; even though, in accordance with the same theorem, my algorithm P cannot. But I can only appeal to the theorem if I can successfully determine that P is in fact a PSHC; the contradiction entails that a TM cannot, as we just saw. Is there any reason to suppose that I can? None is given in [Reference Penrose7].Footnote ⁷ And since a TM cannot, and what is at stake is whether I am a TM, it is again question begging to assume that I can.

So we have seen the flaw in two attempts to leverage the Penrose-halting theorem for an argument against a computationalist account of cognition. They both fail for the same reason; the theorem entails that a TM is incapable of determining whether a TM is a PSHC, and so given what is at stake it is question begging to assume that we can. Quite possibly such arguments could be formulated in other ways, but the argument here is strong evidence that they will fail for much the same reason.Footnote ⁸

Footnotes

1 The following formulation of the proof I learned from Daine Danielson, who learned it as a student of Bernard Molyneux. Thanks to them both for discussion and feedback.

2 Two points: (i) Bear in mind that being a PSHC means for a specific enumeration of TMs. What a TM outputs for any given input is of course independent of the enumeration. (ii) We make the standard assumption that a TM that will sometimes misidentify a self-halter as a non-self-halter (or vice versa) is not computing the self-halting problem at all. To support this assumption, note that no TM can incorrectly solve the self-halting problem for only a finite (or other computable) subset of the natural numbers, else the self-halting problem would be soluble. Imperfect solutions to the problem get it wrong a lot!

3 What follows restricts to the self-halting problem, not the general halting problem considered by Penrose (the problem to determine for all $(m,n)$ whether T $_m[n]$ halts). Since the former is a special case of the latter, insolubility results for the former also apply to the latter.

4 Note that TMs which compute the same function may have different machine numbers, since TMs are defined in terms of their composition, in some specific scheme for constructing TMs. X then does not compute whether T $_m$ is computationally equivalent to P* for some PSHC P, but rather whether it is constructed from a PSHC according to the definition.

5 See [Reference Boolos, Boyle, Breuel, Butterfield, Chalmers, Davis, Dennett, Doyle, Eagleson, Eccles, Ettinger, Garnham, Gigerenzer, Gilden, Glymour, Higginbotham, Hodgkin, Houston, Hubbard, Johnson, Kelly, Kentridge, Lappin, Libet, Lutz, MacLennan, Madsen, Manaster-Ramer, McDermott, Mortensen, Niall, Penrose, Perlis, Pustejovsky, Roper, Roskies, Savitch, Smithers, Stanovich, Taylor, Tsotsos, Varela, Waltz, Wilensky, Zadrozny and Zytkow2] for further responses of this type to Penrose’s argument, and his replies. See also [Reference Koellner3] for a recent damning assessment of the validity of arguments of the Lucas type. The sequel [Reference Koellner4] addresses the ‘new’ argument found in [Reference Penrose8], which has a structural similarity to that considered here, but is rather more abstract.

6 Moreover, it is a instance of a a more general result found in [Reference LaForte, Hayes and Ford5], on the basis of Kleene’s (second) recursion theorem.

7 Penrose does address this issue elsewhere (e.g., [Reference Boolos, Boyle, Breuel, Butterfield, Chalmers, Davis, Dennett, Doyle, Eagleson, Eccles, Ettinger, Garnham, Gigerenzer, Gilden, Glymour, Higginbotham, Hodgkin, Houston, Hubbard, Johnson, Kelly, Kentridge, Lappin, Libet, Lutz, MacLennan, Madsen, Manaster-Ramer, McDermott, Mortensen, Niall, Penrose, Perlis, Pustejovsky, Roper, Roskies, Savitch, Smithers, Stanovich, Taylor, Tsotsos, Varela, Waltz, Wilensky, Zadrozny and Zytkow2]), but like [Reference LaForte, Hayes and Ford5] I find these arguments unconvincing. In any case, they lie outside the scope of this paper.

8 For instance, suppose the input is not a machine number m but an indexical expression, such as ‘the * of your self-halting algorithm’. Assume that it denotes P* for some PSHC P (if not, then the PSHC is simply not being asked the right question), then everything applies as before: by the Penrose-halting theorem P will not recognize that ‘the * of your self-halting algorithm’ is a non-self-halter. But neither would you be able to use the theorem to infer it, unless you recognize that you are a PSHC; so again you can consistently be a PSHC as long as you don’t know it. (Thanks to Molyneux for raising this question.)

References

Benacerraf, P., God, the devil, and Gödel . The Monist, vol. 51 (1967), pp. 9–32.Google Scholar

Boolos, G., Boyle, F., Breuel, T. M., Butterfield, J., Chalmers, D. J., Davis, M., Dennett, D. C., Doyle, J., Eagleson, R., Eccles, J. C., Ettinger, R. H., Garnham, A., Gigerenzer, G., Gilden, D. L., Glymour, C., Higginbotham, J., Hodgkin, D., Houston, A. I., Hubbard, T. L., Johnson, J. L., Kelly, K., Kentridge, R. W., Lappin, J. S., Libet, B., Lutz, R., MacLennan, B., Madsen, M. S., Manaster-Ramer, A., McDermott, D., Mortensen, C., Niall, K. K., Penrose, R., Perlis, D., Pustejovsky, J., Roper, T., Roskies, A., Savitch, W. J., Smithers, T., Stanovich, K. E., Taylor, M. M., Tsotsos, J. K., Varela, F. J., Waltz, D., Wilensky, R., Zadrozny, W., and Zytkow, J. M., An open peer commentary on the emperor’s new mind . Behavioral and Brain Sciences, vol. 13 (1990), pp. 643–705.Google Scholar

Koellner, P., On the question of whether the mind can be mechanized, i: From Gödel to Penrose . Journal of Philosophy, vol. 115 (2018), no. 7, pp. 337–360.Google Scholar

Koellner, P., On the question of whether the mind can be mechanized, ii: Penrose’s new argument . Journal of Philosophy, vol. 115 (2018), no. 9, pp. 453–484.Google Scholar

LaForte, G., Hayes, P. J., and Ford, K. M., Why Gödel’s theorem cannot refute computationalism . Artificial Intelligence, vol. 104 (1998), nos. 1–2, pp. 265–286.Google Scholar

Lucas, J. R., Minds, machines and Gödel . Philosophy, vol. 36 (1961), no. 137, pp. 112–127.Google Scholar

Penrose, R., The Emperor’s New Mind, Penguin, New York, 1989.Google Scholar

Penrose, R., Shadows of the Mind: A Search for the Missing Science of Consciousness, Oxford University Press, Oxford and New York, 1994.Google Scholar

Article contents

A SHORT NOTE ON THE SIGNIFICANCE OF THE PENROSE-HALTING THEOREM

Abstract

Keywords

MSC classification

Information

1 The Penrose-halting theorem

2 Discussion

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests