Abstract
I propose a new definition of identification in the limit, also called convergence to the truth, as a new success criterion that is meant to complement, but not replace, the classic definition due to Putnam (1963) and Gold (1967). The new definition is designed to explain how it is possible to have successful learning in a kind of scenario that the classic account ignores—the kind of scenario in which the entire infinite data stream to be presented incrementally to the learner is not presupposed to completely determine the correct learning target. For example, suppose that a scientists is interested in whether all ravens are black, and that she will never observe a counterexample in her entire life. This still leaves open whether all ravens (in the universe) are black. From a purely mathematical point of view, the proposed definition of convergence to the truth employs a convergence concept that generalizes net convergence and sits in between pointwise convergence and uniform convergence. Two results are proved to suggest that the proposed definition provides a success criterion that is by no means weak: (i) Between the proposed identification in the limit and the classic one, neither implies the other. (ii) If a learning method identifies the correct target in the limit in the proposed sense, any U-shaped learning involved therein has to be essentially redundant. I conclude that we should have (at least) two success criteria that correspond to two senses of identification in the limit: the classic one and the one proposed here. They are complementary: meeting any one of the two is good; meeting both at the same time, if possible, is even better.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
(Directedness) Each partially ordered set \((\mathcal {I}(w), \le )\) is directed, namely \(i, j \in \mathcal{I}_{w}\) implies \(i, j \le k\) for some \(k \in \mathcal{I}_{w}\).
- 2.
For the problems and learning methods defined here to be interesting in computer science, we need to require that the hypotheses in \(\mathcal {H}\) and the information states in \(\mathcal {I}\) can in principle be encoded by natural numbers. But the results in this paper hold generally whether or not we add this requirement. Furthermore, this definition can be generalized by allowing a learning method to output not just hypotheses in \(\mathcal {H}\) but also their Boolean combinations.
- 3.
Definitions 1–3 are essentially the order-theoretic counterparts of the topologically formulated definitions proposed in Baltag et al. (2015) and in Kelly et al. (2016), provided that we include the generalizations mentioned in preceding footnotes: allowing \((\mathcal {I}(w), \le )\) to be directed, and allowing a learning method to output Boolean combinations of hypotheses.
- 4.
Sketch of Proof. Suppose that M solves the hard raven problem in the limit semi-uniformly. Then, since \(\mathcal {I}(\texttt {yes})\) is the set of all finite sequences of 0s linearly ordered by extension, M has to identify the truth \(\texttt {yes}\) in the limit at world \((\texttt {yes}, {(0, 0, \ldots , 0, \ldots )})\). So M has to fail to identify the truth \(\texttt {no}\) in the limit at world \((\texttt {no}, { (0, 0, \ldots , 0, \ldots )})\).
- 5.
For example, consider the following problem \(\mathcal {P}= \big ( \mathcal {H}, \mathcal {I}, \le , \mathcal {W}, |\!\cdot \!|) \) with \(\mathcal {W}= \{w_0, w_1, w_2\}\), \(\mathcal {I}= \{0, 1, 2\}\), \(0< 1 < 2\), \(|0| = \{w_0, w_1, w_2\}\), \(|1| = \{w_1, w_2\}\), \(|2| = \{w_2\}\), \(\mathcal {H}= \{H_\text {even}, H_\text {odd}\}\), \(|H_\text {even}| = \{w_0, w_2\}\), \(|H_\text {odd}| = \{w_1\}\). Consider this method \(M: 0 \mapsto \texttt {?}, 1 \mapsto H_\text {odd}, 2 \mapsto H_\text {even}\). This solves the problem in the limit semi-uniformly. But we should not be satisfied with this method, for there is a better one: \(M': 0 \mapsto H_\text {even}, 1 \mapsto H_\text {odd}, 2 \mapsto H_\text {even}\), which solves the problem both semi-uniformly and classically. I thank Konstantin Genin for bringing this example to my attention.
- 6.
See Kelley (1991) for a review.
References
Baltag, A., Gierasimczuk, N., Smets, S.: A study in epistemic topology. In: Ramanujam, R. (ed.) Proceedings of the 15th Conference on Theoretical Aspects of Rationality and Knowledge (TARK 2015). ACM 2015 (ILLC Prepublication Series PP-2015-13) (2015)
Carlucci, L., Case, J.: On the necessity of U-shaped learning. Topics Cogn. Sci. 5(1), 56–88 (2013)
Carlucci, L., Case, J., Jain, S., Stephan, F.: Non u-shaped vacillatory and team learning. In: Jain, S., Simon, H.U., Tomita, E. (eds.) ALT 2005. LNCS, vol. 3734, pp. 241–255. Springer, Heidelberg (2005). doi:10.1007/11564089_20
Gold, E.M.: Language identification in the limit. Inf. Control 10(5), 447–474 (1967)
Kelley, J.L.: General Topology. Springer, New York (1991)
Kelly, T.K., Genin, K., Lin, H.: Realism, rhetoric, and reliability. Synthese 193(4), 1191–1223 (2016)
Putnam, H.: Degree of confirmation and inductive logic. In: Schilpp, P.A. (ed.) The Philosophy of Rudolf Carnap. Open Court, La Salle (1963)
Valiant, L.: A theory of the learnable. Commun. ACM 27(11), 1134–1142 (1984)
Acknowledgements
I thank Kevin Kelly, Konstantin Genin, and two anonymous referees for their helpful comments and suggestions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
A Convergence Generalized
A Convergence Generalized
As mentioned in the introduction, the classic definition of identification in the limit only employs the most familiar concept of convergence—pointwise convergence of sequences—while in analysis and topology mathematicians have already worked out more elaborated concepts of convergence. This section provides a quick review of those concepts.
Definition 9
(Sequence Convergence). Let \(f: \omega \rightarrow Y\) be a sequence of points in a topological space Y. Say that f(n) converges to y as n travels upward in \((\omega , \le )\) just in case:
-
for any open neighborhood U of y in Y,
-
there exists \(n \in \omega \) such that
-
\(f(m) \in U\) for every \(m \ge n\) in \(\omega \).
The next step is to go from convergence of a sequence to convergence of a “generalized” sequence, a.k.a. “net”. A sequence is defined on a very special set of indices, namely \(\omega \), linearly ordered by \(\le \). Let us have a more general set I of indices with a weaker ordering structure:
Definition 10
(Directed Poset). A poset \((I, \le )\) consists of a set I partially ordered by \(\le \). It is directed just in case \(i, j \in I\) implies \(i, j \le k\) for some \(k \in I\).
Definition 11
(Generalized Sequence, or Net). A generalized sequence or net \(f: (I, \le ) \rightarrow Y\), is a function from a nonempty directed poset \((I, \le )\) to a topological space Y.
Definition 12
(Net Convergence). Let \(f: (I, \le ) \rightarrow Y\) be a net. Say that f(i) converges to y as i travels in \((I, \le )\) just in case:
-
for any open neighborhood U of y in Y,
-
there exists i in I such that
-
\(f(i') \in U\) for every \(i' \ge i\) in I.
Note that the definition of net convergence is formally identical to that of sequence convergence, except that the underlying spaces of indices are generalized. Net convergence has a number of equivalent formulations, some of which are very sophisticated (and stated in terms of, say, filters).Footnote 6 Let me introduce the following one:
Proposition 3
(Net Convergence Redefined by “Cofinal Upper”). Let \(f: (I, \le ) \rightarrow Y\) be a net. Then net convergence of f can be equivalently redefined in terms of cofinal upper subsets as follows. f(i) converges to y as i travels in \((I, \le )\) if and only if:
-
for any open neighborhood U of y,
there exists a cofinal upper subset S of \((I, \le )\) such that
\(f(i) \in U\) for all \(i \in S\).
Proof
The (\(\Rightarrow \)) side follows immediately from the order-theoretic result that every principal upper subset of a nonempty directed poset is cofinal. The (\(\Leftarrow \)) side follows immediately from the order-theoretic result that every cofinal upper subset of a nonempty directed poset is nonempty (because of cofinality) and thus (by upward closure) includes a principal upper subset. \(\square \)
With the above formulation of net convergence, we can generalize further by relaxing the restriction to directed posets and allowing any nonempty posets:
Definition 13
(Generalized Convergence and Limit). Let f be a function from a nonempty poset \((I, \le )\) to an arbitrary topological space Y. Say that f(i) converges to y as i travels in \((I, \le )\) (in the sense of generalized convergence) just in case:
-
for any open neighborhood U of y,
-
there exists a cofinal upper subset S of \((I, \le )\) such that
-
\(f(i) \in U\) for all \(i \in S\).
If such a y exists uniquely, also say that y is the generalized limit of f(i) as i travels in \((I, \le )\).
The above is the concept of convergence that underlines the definition of semi-uniform identification in the limit: the index i travels in a partially ordered set I in general, a convergence zone is required to be cofinal and upper, and the underlying topology of the codomain is discrete.
Rights and permissions
Copyright information
© 2017 Springer-Verlag GmbH Germany
About this paper
Cite this paper
Lin, H. (2017). Enumerative Induction and Semi-uniform Convergence to the Truth. In: Baltag, A., Seligman, J., Yamada, T. (eds) Logic, Rationality, and Interaction. LORI 2017. Lecture Notes in Computer Science(), vol 10455. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-55665-8_25
Download citation
DOI: https://doi.org/10.1007/978-3-662-55665-8_25
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-55664-1
Online ISBN: 978-3-662-55665-8
eBook Packages: Computer ScienceComputer Science (R0)