Enumerative Induction and Semi-uniform Convergence to the Truth

Lin, Hanti

doi:10.1007/978-3-662-55665-8_25

Hanti Lin¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10455))

Included in the following conference series:

International Workshop on Logic, Rationality and Interaction

1127 Accesses

Abstract

I propose a new definition of identification in the limit, also called convergence to the truth, as a new success criterion that is meant to complement, but not replace, the classic definition due to Putnam (1963) and Gold (1967). The new definition is designed to explain how it is possible to have successful learning in a kind of scenario that the classic account ignores—the kind of scenario in which the entire infinite data stream to be presented incrementally to the learner is not presupposed to completely determine the correct learning target. For example, suppose that a scientists is interested in whether all ravens are black, and that she will never observe a counterexample in her entire life. This still leaves open whether all ravens (in the universe) are black. From a purely mathematical point of view, the proposed definition of convergence to the truth employs a convergence concept that generalizes net convergence and sits in between pointwise convergence and uniform convergence. Two results are proved to suggest that the proposed definition provides a success criterion that is by no means weak: (i) Between the proposed identification in the limit and the classic one, neither implies the other. (ii) If a learning method identifies the correct target in the limit in the proposed sense, any U-shaped learning involved therein has to be essentially redundant. I conclude that we should have (at least) two success criteria that correspond to two senses of identification in the limit: the classic one and the one proposed here. They are complementary: meeting any one of the two is good; meeting both at the same time, if possible, is even better.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Induction: A Logical Analysis

Article Open access 08 July 2020

A Solution to Wiehagen’s Thesis

Article 21 April 2016

The Futility of Bias-Free Learning and Search

Notes

1.
(Directedness) Each partially ordered set $(\mathcal {I}(w), \le )$ is directed, namely $i, j \in \mathcal{I}_{w}$ implies $i, j \le k$ for some $k \in \mathcal{I}_{w}$.
2.
For the problems and learning methods defined here to be interesting in computer science, we need to require that the hypotheses in $\mathcal {H}$ and the information states in $\mathcal {I}$ can in principle be encoded by natural numbers. But the results in this paper hold generally whether or not we add this requirement. Furthermore, this definition can be generalized by allowing a learning method to output not just hypotheses in $\mathcal {H}$ but also their Boolean combinations.
3.
Definitions 1–3 are essentially the order-theoretic counterparts of the topologically formulated definitions proposed in Baltag et al. (2015) and in Kelly et al. (2016), provided that we include the generalizations mentioned in preceding footnotes: allowing $(\mathcal {I}(w), \le )$ to be directed, and allowing a learning method to output Boolean combinations of hypotheses.
4.
Sketch of Proof. Suppose that M solves the hard raven problem in the limit semi-uniformly. Then, since $\mathcal {I}(\texttt {yes})$ is the set of all finite sequences of 0s linearly ordered by extension, M has to identify the truth $\texttt {yes}$ in the limit at world $(\texttt {yes}, {(0, 0, \ldots , 0, \ldots )})$. So M has to fail to identify the truth $\texttt {no}$ in the limit at world $(\texttt {no}, { (0, 0, \ldots , 0, \ldots )})$.
5.
For example, consider the following problem $\mathcal {P}= \big ( \mathcal {H}, \mathcal {I}, \le , \mathcal {W}, |\!\cdot \!|) $ with $\mathcal {W}= \{w_0, w_1, w_2\}$, $\mathcal {I}= \{0, 1, 2\}$, $0< 1 < 2$, $|0| = \{w_0, w_1, w_2\}$, $|1| = \{w_1, w_2\}$, $|2| = \{w_2\}$, $\mathcal {H}= \{H_\text {even}, H_\text {odd}\}$, $|H_\text {even}| = \{w_0, w_2\}$, $|H_\text {odd}| = \{w_1\}$. Consider this method $M: 0 \mapsto \texttt {?}, 1 \mapsto H_\text {odd}, 2 \mapsto H_\text {even}$. This solves the problem in the limit semi-uniformly. But we should not be satisfied with this method, for there is a better one: $M': 0 \mapsto H_\text {even}, 1 \mapsto H_\text {odd}, 2 \mapsto H_\text {even}$, which solves the problem both semi-uniformly and classically. I thank Konstantin Genin for bringing this example to my attention.
6.
See Kelley (1991) for a review.

References

Baltag, A., Gierasimczuk, N., Smets, S.: A study in epistemic topology. In: Ramanujam, R. (ed.) Proceedings of the 15th Conference on Theoretical Aspects of Rationality and Knowledge (TARK 2015). ACM 2015 (ILLC Prepublication Series PP-2015-13) (2015)
Google Scholar
Carlucci, L., Case, J.: On the necessity of U-shaped learning. Topics Cogn. Sci. 5(1), 56–88 (2013)
Article Google Scholar
Carlucci, L., Case, J., Jain, S., Stephan, F.: Non u-shaped vacillatory and team learning. In: Jain, S., Simon, H.U., Tomita, E. (eds.) ALT 2005. LNCS, vol. 3734, pp. 241–255. Springer, Heidelberg (2005). doi:10.1007/11564089_20
Chapter Google Scholar
Gold, E.M.: Language identification in the limit. Inf. Control 10(5), 447–474 (1967)
Article MathSciNet MATH Google Scholar
Kelley, J.L.: General Topology. Springer, New York (1991)
MATH Google Scholar
Kelly, T.K., Genin, K., Lin, H.: Realism, rhetoric, and reliability. Synthese 193(4), 1191–1223 (2016)
Article MathSciNet MATH Google Scholar
Putnam, H.: Degree of confirmation and inductive logic. In: Schilpp, P.A. (ed.) The Philosophy of Rudolf Carnap. Open Court, La Salle (1963)
Google Scholar
Valiant, L.: A theory of the learnable. Commun. ACM 27(11), 1134–1142 (1984)
Article MATH Google Scholar

Download references

Acknowledgements

I thank Kevin Kelly, Konstantin Genin, and two anonymous referees for their helpful comments and suggestions.

Author information

Authors and Affiliations

UC Davis, 1 Shields Ave, Davis, CA, 95616, USA
Hanti Lin

Authors

Hanti Lin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hanti Lin .

Editor information

Editors and Affiliations

Institute for Logic, Language and Computation, University of Amsterdam , Amsterdam, Noord-Holland, The Netherlands
Alexandru Baltag
Department of Philosophy, The University of Auckland, Auckland, Auckland, New Zealand
Jeremy Seligman
Graduate School of Letters, Hokkaido University , Sapporo, Japan
Tomoyuki Yamada

A Convergence Generalized

As mentioned in the introduction, the classic definition of identification in the limit only employs the most familiar concept of convergence—pointwise convergence of sequences—while in analysis and topology mathematicians have already worked out more elaborated concepts of convergence. This section provides a quick review of those concepts.

Definition 9

(Sequence Convergence). Let $f: \omega \rightarrow Y$ be a sequence of points in a topological space Y. Say that f(n) converges to y as n travels upward in $(\omega , \le )$ just in case:

for any open neighborhood U of y in Y,
there exists $n \in \omega $ such that
$f(m) \in U$ for every $m \ge n$ in $\omega $.

The next step is to go from convergence of a sequence to convergence of a “generalized” sequence, a.k.a. “net”. A sequence is defined on a very special set of indices, namely $\omega $, linearly ordered by $\le $. Let us have a more general set I of indices with a weaker ordering structure:

Definition 10

(Directed Poset). A poset $(I, \le )$ consists of a set I partially ordered by $\le $. It is directed just in case $i, j \in I$ implies $i, j \le k$ for some $k \in I$.

Definition 11

(Generalized Sequence, or Net). A generalized sequence or net $f: (I, \le ) \rightarrow Y$, is a function from a nonempty directed poset $(I, \le )$ to a topological space Y.

Definition 12

(Net Convergence). Let $f: (I, \le ) \rightarrow Y$ be a net. Say that f(i) converges to y as i travels in $(I, \le )$ just in case:

for any open neighborhood U of y in Y,
there exists i in I such that
$f(i') \in U$ for every $i' \ge i$ in I.

Note that the definition of net convergence is formally identical to that of sequence convergence, except that the underlying spaces of indices are generalized. Net convergence has a number of equivalent formulations, some of which are very sophisticated (and stated in terms of, say, filters).^{Footnote 6} Let me introduce the following one:

Proposition 3

(Net Convergence Redefined by “Cofinal Upper”). Let $f: (I, \le ) \rightarrow Y$ be a net. Then net convergence of f can be equivalently redefined in terms of cofinal upper subsets as follows. f(i) converges to y as i travels in $(I, \le )$ if and only if:

for any open neighborhood U of y,

there exists a cofinal upper subset S of $(I, \le )$ such that

$f(i) \in U$ for all $i \in S$.

Proof

The ($\Rightarrow $) side follows immediately from the order-theoretic result that every principal upper subset of a nonempty directed poset is cofinal. The ($\Leftarrow $) side follows immediately from the order-theoretic result that every cofinal upper subset of a nonempty directed poset is nonempty (because of cofinality) and thus (by upward closure) includes a principal upper subset. $\square $

With the above formulation of net convergence, we can generalize further by relaxing the restriction to directed posets and allowing any nonempty posets:

Definition 13

(Generalized Convergence and Limit). Let f be a function from a nonempty poset $(I, \le )$ to an arbitrary topological space Y. Say that f(i) converges to y as i travels in $(I, \le )$ (in the sense of generalized convergence) just in case:

for any open neighborhood U of y,
there exists a cofinal upper subset S of $(I, \le )$ such that
$f(i) \in U$ for all $i \in S$.

If such a y exists uniquely, also say that y is the generalized limit of f(i) as i travels in $(I, \le )$.

The above is the concept of convergence that underlines the definition of semi-uniform identification in the limit: the index i travels in a partially ordered set I in general, a convergence zone is required to be cofinal and upper, and the underlying topology of the codomain is discrete.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lin, H. (2017). Enumerative Induction and Semi-uniform Convergence to the Truth. In: Baltag, A., Seligman, J., Yamada, T. (eds) Logic, Rationality, and Interaction. LORI 2017. Lecture Notes in Computer Science(), vol 10455. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-55665-8_25

Download citation

DOI: https://doi.org/10.1007/978-3-662-55665-8_25
Published: 24 August 2017
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-55664-1
Online ISBN: 978-3-662-55665-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the Association of Logic, Language and Information. (opens in a new tab)

Enumerative Induction and Semi-uniform Convergence to the Truth

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Induction: A Logical Analysis

A Solution to Wiehagen’s Thesis

The Futility of Bias-Free Learning and Search

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

A Convergence Generalized

A Convergence Generalized

Definition 9

Definition 10

Definition 11

Definition 12

Proposition 3

Proof

Definition 13

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships