Problems of Control and Information Theory 4. (Budapest, 1975)

1975 / 1. szám - Csibi, S.: Learning under computational constraints from weakly dependent samples

4 CSIBI: LEARNING UNDER COMPUTATIONAL CONSTRAINTS Suppose X and Y = {Yn\ n = 1, 2,.. .} to be related in a memoryless manner, in the following sense P(F„ = j I Xn, Zn l) = P(T„ = j I Xn) = ITj(Xn) for j = 0, 1. (Zn = {Zj, i <Z n}. We call, according to the usual terminology, II j= {rij(x), x £6X } a posteriori probability function with respect to //,. In this paper we show how this problem, concerning weakly dependent observations, may be reduced to disciplines of learning from totally indepen­dent sample sequences. In so doing we first extend two well-known theorems, due to Braverman and Rozonoer [5], in a way appropriate for our further purposes. Next we illustrate how to use these results for learning from weakly dependent samples under a set of usual computational constraints. 2. Extending stability theorems We adopt, concerning a pair of sequences S = {Sn; n = 1,2,...} and Z of random variables, the following well-known [8] notions: Definition A [8]. We say S to be adopted to Z if, for any n, S„ is measur­able with respect to §{Zn). (Here and in what follows we refer by A, B, .. . and 1,2,... to cited and new notions, results, etc. respectively.) Definition В [8]. A sequence S of random variables, adapted to Z, is centered (with respect to Z) if E(Sn+1 | Zn) = 0 for any n. (Here and in what follows E stands for the expectation and E(.|.), specifically, for the condi­tional expectation.) Let us now turn, concerning Braverman’s and Rozonoer’s stability studies [5], to a fundamental result which underlies a number of theorems on learning either functions or classification rules from labelled samples. Theorem A [5]. Given a sequence G = {Gn, n = 1, 2, . . .} of nonnegative valued random variables adapted to Z. Let (i) \L(!y < °o and (ii) E(Gn+1|Zn)<G„ + Ä(n0). (1) (S(0) = {Sn; n = 1,2,...} stands for a sequence of nonnegative valued ran­dom variables adapted to Z, such that ^ E.S'}^ < °o.) Then there exists a /1 = 1 real valued random variable q for which P(limG„ = p) = 1. /1—00 Let us extend Theorem A according to our further purposes.

Next