AMethod for the Recovery of Gaps in General Analytic Signals

Gapped (or missing) data is encountered when it is hard to obtain contiguous measurements of a function for a long time. The recovery of the missing function values is possible, at least in theory, given a certain a priori knowledge about the function, for example, analyticity on a certain interval. A real-valued function f , defined on a closed interval I on the real line, is called analytic on I , if there exists an analytic extension of f onto some open set G of the complex plane C that contains I [1]. That is, there is a unique single-valued analytic function, defined on G, that coincides with f on I . Let f be analytic on I . When the values of f on a subinterval of I are given, f can be determined on I , by means of analytic continuation. In many missing data problems, the functions are assumed to be bandlimited. Let L2(R) denote the Hilbert space of all square integrable functions defined on the real line R. For τ > 0, and a function f ∈ L2(R), we define the operators Pτ and Qτ as follows:


Introduction
Gapped (or missing) data is encountered when it is hard to obtain contiguous measurements of a function for a long time.The recovery of the missing function values is possible, at least in theory, given a certain a priori knowledge about the function, for example, analyticity on a certain interval.A real-valued function f , defined on a closed interval I on the real line, is called analytic on I, if there exists an analytic extension of f onto some open set G of the complex plane C that contains I [1].That is, there is a unique single-valued analytic function, defined on G, that coincides with f on I. Let f be analytic on I.When the values of f on a subinterval of I are given, f can be determined on I, by means of analytic continuation.
In many missing data problems, the functions are assumed to be bandlimited.Let L 2 (R) denote the Hilbert space of all square integrable functions defined on the real line R.For τ > 0, and a function f ∈ L 2 (R), we define the operators P τ and Q τ as follows: (1) dt, and, F −1 f be the Fourier transform and inverse Fourier transform of f , respectively.For σ > 0, and a function f ∈ L 2 (R), we define the bandlimiting operator P σ as follows: where ( The bandlimited signal extrapolation is defined as follows.
Find f having g = P τ f and knowing that f = P σ f .Various methods were proposed for solving the bandlimited extrapolation problem; [2] contains a detailed comparison of some of the methods.The iterative bandlimited signal extrapolation algorithm of Papoulis and Gerchberg [3,4] is attractive due to its relative simplicity, requiring only Fourier transform and inverse Fourier transform in each iteration.
Based on the identity Papoulis [3] and Gerchberg [4] proposed the following algorithm: ( A proof for the convergence of the algorithm was given by Papoulis [3] using signal expansions in terms of the prolate spheroidal wave functions.Gerchberg [4] actually solved the dual problem of extrapolating the spectrum of a finite object beyond its diffraction limits, leading to superresolution and image enhancement. A bandlimited function f (t), defined on the real-line R, is an entire function of exponential type when t is extended to C [5].Hence, bandlimited signals are a very special case of analytic functions, and other analytic signals may be encountered in practice.In this paper, we propose a method for the recovery of gaps in general analytic signals using a generalization of the Papoulis-Gerchberg algorithm, based on signal expansions in terms of the Chebyshev polynomials of the first kind.In Section 2, we briefly review the properties of Chebyshev polynomials.We generalize in Section 3 the Papoulis-Gerchberg algorithm from continuous-time bandlimited signals to continuous-time signals in polynomials spaces and general analytic signals.We then focus, in Section 4, on the discrete implementation of the continuoustime algorithm, based on a suitable nonuniform sampling scheme and the discrete cosine transform.The performance of the recovery algorithm is demonstrated, in Section 6, by a numerical example.

Chebyshev Polynomials
The Chebyshev polynomial T n (x) of the first kind is a polynomial in x of degree n, defined by the relation Without loss of generality, we focus hereafter on the interval ) be the Hilbert space of all real-valued square integrable functions on [−1, 1] with respect to the nonnegative weight function This is the space of functions f such that the norm is finite.The associated inner product is The set {T i (x), i = 0, 1, . ..} forms a complete orthogonal polynomial system in L 2 w ([−1, 1]).The orthogonality relation is given by [6] T where The N zeroes of the Chebyshev polynomial T N (x) are These zeroes are listed in decreasing order in x.The (N −1)th degree polynomial p N−1 (x), interpolating a function f (x) in the zeroes of T N , can be written as a sum of Chebyshev polynomials in the form where the coefficients c n in ( 14) are given by the explicit formula where It follows [7][8][9] that which is the well-known discrete cosine transform (type II) [10] applied on the samples of the function f (x) taken at a nonuniform sampling grid corresponding to the zeroes of T N .Efficient algorithms exist for computing the discrete cosine transform (see, e.g., [11] and references therein).

Gap Recovery in General Analytic Signals Based on A Chebyshev Polynomial Series Expansion
Let P N−1 denote the subspace of all polynomials of degree where ).We denote by P S and Q S the orthogonal projection operators onto subspace C S and C + S , respectively.It follows that Q S = Id − P S , where Id is the identity operator in L 2 w ([−1, 1]).If we assume that f = P N−1 f , then we have: The continuous-time recovery problem can be stated as follows.Find f having g = P S f and knowing that f = P N−1 f .We propose the following Papoulis-Gerchberg type algorithm for recovering gaps in polynomial signals in The PGP Algorithm.
The iteration process is stopped when f m − f m−1 ≤ , where > 0 is a user specified threshold.f m or P N−1 f m is taken as the result of the recovery algorithm after m iterations.
If f ∈ P N−1 , then f = P N−1 f , and it follows that We know from ( 22) and (23) that Therefore, which proves that the PGP algorithm has the property that it reduces the error energy during the iteration process.
Lemma 1.A signal f ∈ P N−1 can be uniquely determined from P S f by using the PGP algorithm (21), and (22).
Proof.Iterating (22), we obtain Substituting ( 23) in (26), we obtain According to von Neumann's theorem on alternating projections [12], for every where f c is the projection of f onto the closed subspace 1] and can vanish in an interval if and only if it is the zero function.Hence, the subspace C + S ∩ P N−1 contains only the zero function.We obtain that f c = 0 and that the PGP converges in norm to the required function f .P N−1 P S P N−1 is a bounded positive self-adjoint compact operator.Hence, there exists an orthonormal basis of P N−1 consisting of eigenfunctions φ n , n = 0, 1, . . ., N − 1, of the operator P N−1 P S P N−1 , with corresponding real positive eigenvalues λ n , such that 1 Lemma 2. Let f = φ k be an eigenfunction of P N−1 P S P N−1 with corresponding eigenvalue λ k , and let g = P S f be the given values of f .Then, Proof.We will prove (29) by induction.It is true for m = 0 that where we used the identity P N−1 φ k = φ k (since φ k ∈ P N−1 ).Suppose that it is true for some m ≥ 1.By using (22), we obtain Substituting P N−1 P S P N−1 φ k = λ k φ k and using the induction assumption, we obtain In ( 31) and (32), we used the idempotent property of the orthogonal projection operator P N−1 (i.e., P N−1 = P N−1 P N−1 ).It is easy to see that (33) Hence, which completes the proof.

ISRN Signal Processing
Any f ∈ P N−1 can be represented as where Using ( 29) and (35), we obtain that after m iterations of the PGP algorithm The energy of error is given by It follows that the rate of convergence of the PGP algorithm is dependent on the spectral representation of the signal in terms of the eigenfunctions of the operator P N−1 P S P N−1 .
If the signal contains only significant components which correspond to relatively high eigenvalues, the convergence is fast.If this is not the case, the PGP will be slowly convergent.
The PGP can also be viewed as an example of signal restoration by the method of projections onto convex sets (POCS) [14,15].The two convex sets, or constraints, are the affine set V 1 , which contains all the functions f ∈ L 2 w ([−1, 1]) with g = P S f and the subspace we have two inconsistent convex constraints.The convergence of a POCS algorithm with two inconsistent convex constraints is a well-studied problem [16,17].In this case, the PGP algorithm iterates P N−1 f m converge to the signal q ∈ P N−1 closest to V 1 , that is, with the minimum distance d, where d is given by Thus, the PGP algorithm is exact for all f ∈ P N−1 , and a certain "best polynomial approximation" in the sense of (39) is obtained when f / ∈ P N−1 .

The expansion of analytic function in terms of the Chebyshev polynomials converges rapidly (exponentially). It can be proved that if the function f (x) can be extended to a function f (z) analytic on the ellipse
. This property of rapid convergence combined with the approximation property (39) makes the proposed algorithm a practical gap recovery tool for general analytic functions.

Discrete Implementation of the Continuous-Time-Recovery Algorithm
A Chebyshev polynomial-based series expansion allows a convenient and accurate discrete implementation by a nonuniform sampling scheme, where the samples are taken at time locations that are Chebyshev polynomial zeroes and a discrete cosine transform (DCT) applied on the nonuniform samples [8,9].Equation ( 15) is a Gauss-Chebyshev quadrature for the original integral (19), which is known to be a very accurate numerical method [6].In addition, if the maximum absolute error is used as the optimality criterion, then the interpolating polynomial given by ( 14) provides optimal approximation among polynomials of degree = N − 1 to the function f [18].Let R N denote the real N-dimensional space.The l 2 inner product is given by and the induced norm is given by Let C be the N × N unitary DCT matrix defined as The DCT of the vector y ∈ R N (arranged as a column vector) is given by y = Cy.The inverse DCT of the vector y is given by y = C T y, where C T is the transpose of C. A discrete signal y is bandlimited if its DCT y vanishes on some fixed set, that is, if where S is a fixed nonempty proper subset of {0, 1, . . ., N −1}.
The set of signals bandlimited to a specific set S is a linear subspace of R N , of dimension equal to the cardinal of the complement of S.
Let y[k] = f (x k ), k = 0, . . ., N − 1 be a vector in R N , whose elements are the samples of the analytic function f (x) taken in the zeroes of T N (13).Since only P S f is available, only certain Q < N samples are known.Assuming that f ∈ P M−1 and M ≤ Q, the discrete recovery problem is to determine the DCT coefficients y[m] for m = 0, 1, . . ., M − 1 from the Q known samples.Using the DCT coefficients (with an appropriate simple scaling) in (14), we obtain the (M − 1)th degree polynomial that approximates f (x).
Let S 1 be a subset of size Q in [0, 1, 2, . . ., N − 1] containing the indices of the known samples, let S 2 denote the subset of indices [0, 1, . . ., M − 1], and denote D = C −1 = C T .The required DCT coefficients can be determined by solving the least squares problem It can be proved [19] that any square submatrix (M = Q) of the matrix D in (44) is the product of a Vandermonde and upper triangular matrix, and therefore, it is invertible.It follows that the submatrix D ki , i ∈ S 2 , k ∈ S 1 , is a full column rank matrix when M ≤ Q.The resulting least squares problem (44) can be solved by many methods (e.g., singular value decomposition, QR factorization, conjugate-gradient method for solving the normal equations, etc.) [20,21].A simple popular option (not necessarily the best in terms of computational complexity) is a version of the discrete Papoulis-Gerchberg algorithm to be described below.
Let the operator P A map a signal y ∈ R N into another, P A y = Ay, by multiplying y with a diagonal matrix A containing only zeroes or ones.A mm = 1, if m ∈ S 1 , and A mm = 0, otherwise.It is easy to see that this operation is idempotent and self-adjoint.Hence, P A is the orthogonal projector onto the subspace of R N signals with zero-valued elements at the indices not belonging to S 1 .
Let the operator P B map a signal y into another P B y = C T BC y, where B a diagonal matrix containing only zeroes or ones.B mm = 1, if m ∈ S 2 , and B mm = 0, otherwise.Using the identities B 2 = B and B T = B, it follows that P B is idempotent and self-adjoint for every y, h ∈ R N .Hence, P B is the orthogonal projector onto the subspace of bandlimited (in terms of the DCT and A discrete version of the PGP algorithm is obtained by working in the finite-dimensional space R N and using the operators P A instead of P S and P B instead of P N−1 .The discrete version of the PGP is also a projection onto convex sets algorithm, and it follows that the iterates P B y m converge to the required least squares solution (see the discussion before (39)).

Discussion
Our model is based on two assumptions.First, the original analytic signal can be accurately approximated by a linear combination of a few Chebyshev polynomials (exponential convergence), and, secondly, that the Gauss-Chebyshev quadrature (15) provides a good approximation to the original integral (19).When these assumptions are not (approximately) satisfied (e.g., due to noise), the algorithm may converge (it always converges) to a crude approximation of the original signal.In such cases, a higher-order polynomial approximation and a denser sampling grid would provide an improvement.If the model assumptions are (approximately) satisfied, the resulting approximation is very accurate and minimizes a certain objective function in the sense of (39).
A Chebyshev polynomial series expansion is a Fourier cosine series with a change of variable x = cos θ for f (cos θ) [6].While f (x) itself is not periodic in x, the function f (cos θ) is periodic in θ.The smoother the function is, the more rapidly its Fourier coefficients decrease.Since f (cos θ) is periodic and analytic, its Fourier series must have exponential convergence.For a Fourier cosine series expansion for the nonperiodic function f (x), the first derivative of the function is discontinuous at the interval's borders and for a Fourier series expansion, the function itself is discontinuous at the interval's borders.It follows that a Chebyshev polynomial series expansion of an analytic function provides a better approximation than a Fourier series expansion (using a discrete Fourier transform in discrete implementations [22]) with the same number of terms, which makes it attractive to recovery problems.The improved approximation comes, however, at the price of a nonuniform sampling grid.The sampling points are clustered near the boundaries of the interval [−1, 1].It follows that the proposed algorithm is more suited to the recovery of gaps that occur in the middle of the signal or nearby.In such cases, we benefit from both the excellent approximation properties of the Chebyshev polynomials and the fact that less missing samples have to be recovered.

Numerical Example
We demonstrate the performance of the discrete implementation of the recovery algorithm by a gap recovery example.The function f (x) defined on the interval [−1, 1] It is analytic on [−1, 1] but has a pole at x = ±0.2j, where j = √ −1.We assume that the piece [−0.4,0.4] is missing and that signal is well approximated by 64 Chebyshev polynomials.The samples are taken at the 128 zeroes of the polynomial T 128 , and it follows that 34 samples are missing.The proposed algorithm is used to recover the missing samples: the l 2 norm of the error is 0.16, while the norm of the missing samples is 3.5244.We see that the recovery algorithm succeeds in approximating the missing data.Finally, the function and the resulting polynomial approximation are given at a uniform grid of 128 points in the interval [−1, 1] in Figure 1.