Minimizing the Bright / Shadow Focal Spot Size with Controlled Side-Lobe Increase in High-Numerical-Aperture Focusing Systems

Minimizing the bright/shadow focal spot size for differently polarized incident waves through the additional apodization of the focusing system output pupil by use of an optical element with the vortex phase dependence on angle and the polynomial amplitude dependence on radius is studied. The coefficients of the radial polynomial were optimized with the aim of fulfilling certain conditions such as the energy efficiency preservation and keeping the side lobes under control. The coefficients were chosen so as to minimize the functional using Brent’s method.


Introduction
Recent years have seen the publication of a large number of articles dealing with obtaining a smaller transverse size of the focal spot using a high NA focusing system [1][2][3][4][5].The smallest transverse size of the focal spot has been obtained for a radially polarized beam in which the longitudinal component makes the maximal contribution to the total intensity.
There is a variety of techniques to gain the contribution of the longitudinal component to the total intensity, including the use of an annular aperture which allows only the peripheral part of radiation to pass through the lens [1,2] and the use of additional phase optical elements, which provides a higher energy efficiency [3][4][5].
However, this study is concerned with not simply controlling the contribution of different electric field components into the focal spot region [5], but rather minimizing the (bright/dark) focal spot size by optimally selecting the pupil's transmission function.
In [6], it was proposed that the focal spot formed by a high NA focusing system should be reduced in size using radially polarized Laguerre-Gauss modes of higher radial order, devoid of vortex phase component.
The positive role of the vortex phase function which allows obtaining a smaller focal spot in individual components of the sharply focused electric field at different polarization types was analyzed in [4], whereas the possibility of obtaining a smaller focal spot in terms of total intensity by introducing additional variations of radius was shown in [5].The use of the transmission functions in the form of Zernike polynomials, including those containing vortex phase dependence, was considered in [7].Thus, it has become possible to simultaneously introduce vortex phase dependence and amplitude variations on radius.Note that it has also been shown that it is possible to reduce not only the bright spot but the shadow region as well.
In areas such as optical trapping and micromanipulation [8], STED-microscopy [9], and shadow microscopy [10], there has also been a problem of focusing into a shadow spot or a bright ring with a zero intensity region tightly localized at the center.The described focal distribution is implemented through introducing the phase singularity into the beam under focusing.
In this work, we look into minimizing the bright/shadow focal spot size for differently polarized beams through the additional apodization of the focusing system using an 2 Advances in Optical Technologies optical element with the vortex phase dependence on angle and the polynomial amplitude dependence on radius.The order and direction of the phase vortex component were chosen with regard to the polarization of the focusing system incident beam [11], whereas the coefficients of the low-degree polynomial were optimized so as to fulfill certain conditions, in particular, the energy efficiency preservation and keeping the side lobes at a desired level.The coefficients were chosen so as to minimize the functional using Brent's method [12].

Scalar Diffraction Limit
In the scalar case, the result of focusing a bounded plane wave of wavelength  and wavenumber  = 2/ with a lens of radius  and focal length  is given by the complex amplitude as follows: where (, ) are the polar coordinates in the focal plane and   () is the Bessel function of the th order.Note that the integral ( 1) is calculated precisely.The focal spot radius can be derived from the first zero of the first-order Bessel function as follows: where NA is the lens numerical aperture.
A widely known approach to obtaining a tighter focal spot is by replacing the full aperture by a narrow peripheral ring of width .In this case, the resulting spectrum is associated with the zero-order Bessel function (for small , the integrand can be calculated at midpoint) as follows: with the focal spot radius derived from the first zero of the zero-order Bessel function as follows: The comparison between (1) and (3) suggests that the smaller transverse size of the focal spot is achieved at the cost of lower focal spot intensity ( = 0).At  ∼ , the focal spot intensity will drop in direct proportion to the focal length squared.
Also of great interest is the problem of focusing into a shadow spot or a bright ring with the zero intensity tightly localized at the center.The described focal pattern can be generated by introducing a vortex phase singularity into the beam under focusing.
In the scalar case, the result of focusing of a bounded plane wave with the first-order vortex phase singularity is given by The integral in ( 5) is normally expressed through a superposition of the Bessel and Struve functions [13] in the form which is inconvenient to analyze.Because of this, the integral in ( 5) can be expressed through a hypergeometric function [13] as follows: where Thus, (5) can be written as By retaining in (8) only the first two terms and equating the derivative to zero, the estimate of the focal ring radius is given by The calculation of the hypergeometric function (e.g., using Matlab) or the calculation of the integral in (8) gives a larger value of radius (Figure 1, dashed line) as follows: The focal spot size can also be made smaller by applying a narrow annular diaphragm allowing only the peripheral rays to be transmitted as follows: Advances in Optical Technologies In this case, the light ring radius is derived from the maximal value of the first-order Bessel function (Figure 1, solid line) as follows: The smaller size is attained at the sacrifice of the energy focused in the central ring (intensity is proportional to  2 ) and, hence, a lower intensity of the light barrier.Note, however, that the compactness of the light vortex may prove to be a more important parameter than its height.The above-described computations suggest that, with the aberration-free full-aperture focusing system (NA = 1), the scalar diffraction-limited radius of the light ring is   ≈ 0.39 (with the ring's internal radius defined by the same value at intensity FWHM), whereas, with the narrow annular aperture, we have   ≈ 0.29.
Thus, the diffraction-limited radius of the light ring is smaller than that of the light spot [14].
With high NA focusing systems, the achievement of the diffraction-limited size is problematic because different electric field components contribute to the focal region [4,5].This problem is addressed with the use of specially polarized beams, thus enabling the contribution of an individual component to be maximized.For example, with a radially polarized incident beam, the diffraction-limited focal spot size can be achieved with the aid of a narrow annular aperture [1,2] or using full-aperture transmission functions characterized by higher energy efficiency [3-7, 11, 14-16].The full-aperture apodization can be most easily implemented by complementing the focusing system with phase optical elements [3][4][5][14][15][16] and focusing higher-order laser modes [6,7].In the latter case, the intensity distribution at the system's output pupil can be represented as a polynomial.Below, we discuss the possibility of minimizing the transverse focal spot with the aid of low-order polynomials under the fulfillment of certain conditions, including the energy efficiency preservation and keeping the side-lobe level under control.

Model of a High NA Focusing System in Debye's Approximation
For a high NA focusing system, the near-focus vector electric field in a homogeneous dielectric medium can be described in Debye's approximation [17], which holds if the focal length of the optical system is much larger than the incident wavelength as follows: where (, , ) are the cylindrical coordinates in the focal region, (, ) are the spherical angular coordinates of the focusing system's output pupil, Θ is the maximal value of the azimuthal angle related to the system's NA, (, ) is the transmission function, () = √ cos  is the pupil's apodization function for aplanatic systems, P(, ) is the polarization matrix, and  is the focal length.Below, we analyze sharply focusing differently polarized laser beams with the transmission function (, ) defined by the following superposition: where   are the polynomial coefficients at a fixed value of vortex order .
With the above-defined incident beam, ( 13) can be essentially simplified for the widely used types of polarization.
Then, the vector electric field in the focal plane can be calculated using a 1D integral [4] as follows: where () = exp( cos ) √ cos  ∑  =0   sin +1 , with the form of Q  (, , ) depending on the input field polarization.

Optimization of the Transmission Function of the Focusing System
After choosing a definite type of polarization and the phase vortex order , the coefficients in (14) were optimized through minimizing a functional relating the output distribution intensity in (15) and a target function (, ) in the focusing region as follows: The target function is a superposition of functions that define the conditions which the focal intensity distribution Advances in Optical Technologies needs to satisfy.In particular, these conditions require that the intensity be localized in a definite region or imply a penalty if the normalized intensity becomes higher than a threshold on a certain interval.
For a fixed polar angle , the constituent functions of the functional (16) will only depend on the radius, making it possible to visually illustrate in which way the above-described conditions are imposed on the intensity distribution in (15), as seen in Figure 2. To achieve a definite trade-off between the minimal focal spot size and suppressed side lobes, it is required to impose the condition that the energy should be localized in a focal plane's circle of radius   and provide a definite level of side lobes, , in a ring of radii [  ,  Δ ].In the region of radius larger than  Δ , the intensity distribution is not controlled.
The functional ( 15) was minimized at a fixed value of the polar angle  using Brent's method [12].The method was implemented using the public-domain software from ALGLIB [18].[1][2][3][4]6] that the smallest transverse size of the focal spot is provided by a radially polarized incident beam, as in this case the contribution of the longitudinal component to the total intensity on the optical axis is maximal.

Minimizing the Focal Spot. It has been known
For the radially polarized wave, the field in (15), which depends on the laser beam's vortex component exp(), takes the (Cartesian) form, It can be seen from ( 17) that at  = 0 the maximal contribution to the total intensity on the optical axis is coming from the longitudinal component; however, the focal spot size is also affected by the transverse components which contribute to the near-axis region (Table 1, row 1).
The use of a narrow annular diaphragm [1,2] of the radius close to that of the focusing lens and the relative width  (normalized with respect to the aperture radius) minimizes the contribution of the transverse components, with the focal region intensity being defined only by the longitudinal component [7] as follows: whence it follows that the central focal spot radius is   ≈ 0, 38, which corresponds to the diffraction-limited size in (4).Its intensity is very low because the relative width of the annular aperture is  ≪ 1 (Table 1, row 2).Introducing a phase jump of  radian [19] along the middle radius of the narrow annular aperture (which is equivalent to the destructive interference of two beams) yields the intensity distribution generated by the (major) longitudinal component in the focal plane (NA = 1) as follows: From (20), it is seen that there is a nonzero intensity region at the focal spot center.The radius of the lowintensity light spot can be estimated using the Bessel function approximation as follows: By decreasing the annular diaphragm width, we can get a focal spot size smaller than (4): Note that (21) predicts an even smaller size for a larger diaphragm width; however, in this case the approximation used to derive the expression becomes invalid.Table 1 (row 3) shows the results of focusing with use of the annular diaphragms of width  = 0, 2 and a phase -jump introduced along the diaphragm's medium radius.The focal spot size is seen to become smaller, FWHM = 0.329, and, though the side lobes become higher, their intensity is smaller than 30% of the major peak.Such side lobes can be efficiently filtered out [20,21].We also note that for the side-lobe intensity which is lower than a threshold, their influence can be eliminated due to nonlinear interaction of light with the medium [22].
With a full-aperture transmission function with the power dependence on the radial coordinate, the longitudinal component takes the following form: Advances in Optical Technologies Then, the near-axis intensity is approximately given by [7]   0 (, ,  = 0) whence it follows that the central focal spot radius is determined by the power  as follows: With increasing , the central light spot radius will tend to   →  → ∞ / ≈ 0.32.Thus, with use of the full-aperture power function, a tighter focal spot than that with a narrow annular diaphragm (  ≈ 0.38) can be obtained, so that the resulting focal spot is of practically the same size as that obtained with the annular phase jump ( ℎ ≈ 0.38).
However, such an effect is achieved when the power  tends to infinity, actually resulting in the concentration of the incident beam energy on a peripheral lens portion.In this way, the power function serves as an approximation of the narrow annular diaphragm.
Below, a different approach to obtaining a tighter focal spot based on the destructive interference is analyzed.In this case, the coefficients in the superposition in (14) should be selected in a special way [3].
At  = 0 and in view of ( 14), (17) in the focal plane takes the following form: where it becomes evident that there is a similar dependence for the same-parity terms.In this case, the Zernike polynomials will be described by simpler relationships [7].
With only zero-and second-power terms of the radial polynomial retained, we obtain From (28), it is seen that the contribution of the longitudinal component to the focus ( = 0) is zero at  2 = −3 0 /2.Meanwhile, the contribution of the transverse components is zero at  2 = −11 0 /6.Thus, at  2 = −2 0 , the Zernike polynomial (2, 0) provides near-optimal maximization of the longitudinal component contribution [7].
With the analytical computations becoming too cumbersome for a larger number of terms, it appears more convenient to conduct the optimization of the coefficients using a numerical procedure under certain conditions.
In particular, the focal spot can be arbitrarily decreased in size based on a manyfold growth of the side lobes, with practically the entire energy being redirected from the focus [23,24] (Table 1, row 4).To achieve a trade-off between a smaller focal spot and the amount of light energy coming to the focus, we impose a condition that the energy should be concentrated in the focal plane within a circle of radius   and a desired level  of side lobes' intensity is provided within a ring with radii [  ,  Δ ] (Figure 2).The corresponding functional ( 16) was minimized by fitting the coefficients in ( 14) using Brent's method [12].Note that beyond the radius  Δ the intensity level is not controlled and can be essentially higher than that at the center.Alongside the energy loss, an increase in the region under control usually results in a larger focal spot.
Row 5 of Table 1 shows the focusing results for the full-aperture transmission function described by a thirdorder polynomial whose coefficients were chosen so as to concentrate the energy in a circle of radius   = 0.05, while keeping the side-lobe intensity in the ring of radii [0.05, 2] not higher than  = 0.3 of the focal spot intensity.In this case, the focal spot is smaller in size when compared with a focal spot obtained with a narrow annular aperture, whereas the focal intensity in the former case is two times higher.
In [6], it was proposed that the focal spot should be made smaller with the aid of the transmission function proportional to the Laguerre-Gauss (LG) modes.In this case, the focusing results depend on the waist radius and the LG mode radial number.Note that the use of a higher-order LG mode does not have to lead to a smaller focal spot.In particular, for  = /2, the focusing results were the following: FWHM = 0.437 (for the LG mode radial number  = 1), FWHM = 0.875 ( = 2), FWHM = 0.317 ( = 3, see Table 1, row 6), FWHM = 0.384 ( = 4), and FWHM = 0.475 ( = 5).
In this case, essential growth of the side lobes is also observed, with their intensity approaching  = 0.86 of the total maximum (Table 1, row 6).
With the LG modes, the result can be improved with the side-lobe level attaining  = 0.5 and the focal spot size remaining unchanged (FWHM = 0.313) by applying the optimization algorithm to the superposition of (14); see Table 1, row 7.

Minimizing the Light Spot Radius.
If the radially polarized incident wave contains first-order vortex components (|| = 1), the transverse components will take nonzero values on the optical axis, whereas the longitudinal component, on the contrary, will form an annular distribution pattern [7].Note that the contribution of the longitudinal component into the total intensity is comparable with that of the transverse components.It is most essential that this factor should be taken into consideration when dealing with the interaction of light with the media being selectively sensitive to the different electric field components [25].
The longitudinal wave component generates a distribution given by For the even-valued powers of sine ( = 2), we will use the formula [26] ) whereas for the odd-valued powers ( = 2+1), a more general formula needs to be used [26] as follows: ) . ( With no transmission functions whatsoever (which corresponds to  = 0), we have the following: 7/4 () . (32) If the Bessel function at small argument values is approximated by the right-hand side of (32) in the optical axis vicinity can be estimated as (4/21)(1 − (() 2 /11)).The shadow spot's radius can be derived by setting the derivative with respect to  equal to zero as follows: For the odd-valued powers , it is also possible to estimate the right-hand side of (31).For the near-axis region, we can write down and deduce the shadow spot's radius as follows: It can be seen from relation (36) that as the value of  increases, the shadow spot's radius decreases, tending to the diffraction-limited value However, based on the superposition in ( 14), a better result can be achieved for the polynomials at lower-order  due to mutual interference.
With a narrow annular diaphragm, (29) takes a simple form independent of the radial polynomials as follows: In this case, the shadow spot's radius is   ≈ 0.29.Thus, the spot is smaller in size than that obtained with a conventional full-aperture focusing system but larger than that achieved based on the use of the radial polynomial.
Table 2 gives the numerically simulated results for the radially polarized beam containing a first-order phase vortex.In this case, the parameter FWHM describes the shadow spot size as a full-width half-maximum intensity at the light barrier's inner walls.
Table 2 suggests that the presence of the transverse components is a hindrance to obtaining a "pure" result in the total intensity (Table 2, row 1).
When focusing light with the aid of a narrow annular diaphragm, the minimal light ring is achieved with the transverse components' contribution being minimal (Table 2, row 2).However, an essential energy loss is observed.
Based on the polynomials, a trade-off can be achieved between the decrease of the light ring's size and the energy loss in the central part: row 3 of Table 2 shows the result for a near-minimal ring with the 2.5 times intensity.
As a rule, a tighter focal spot is achieved by redistributing the energy to the peripheral lens part (approximation of a narrow annular slit) or destructive interference between the beams generated by different lens regions (similarly to a diaphragm with a phase jump, (19)).In the former case, obtaining a still narrower slit requires increasing the polynomial order.In the latter case, the lens pupil can be divided into radial zones, with low-order polynomials being considered for each zone.
The optimization results for a three-zone pupil function-a small-radius central (shielding) zone and two zones containing third-order polynomials-are shown in Table 2, row 4. In this case, the focal spot is being tightened more effectively but, as in the previous case, at the expense of higher side lobes, which were kept at a level of no higher than  = 0.9 on the radius  Δ = 1.2.Note that, for the shadow focal spot, the side lobes' effect is of lower significance when compared with the light spot because it is the region found within the light barrier which serves the useful purpose.

Azimuthal Polarization
For azimuthal polarization, the field in the focal plane ( = 0) takes the followimg form: whence it follows that the -component is zero at any .Upon sharp focusing, the electric field on the optical axis ( = 0) will contain nonzero components just for one singularity order, || = 1.For all the rest values, || ̸ = 1, an absolute intensity zero will be formed at the axis points, because all components will be equal to zero.Because of this, the azimuthal polarization is best suited for generating a shadow spot (Table 3, row 1).
At  = 0 and in view of ( 14), ( 38) is reduced to a compact form as follows: The integrals in (39) can be taken with the aid of hypergeometric functions whose analysis [7] has shown that, while depending on the power , the radius of the resulting light ring is limited by  min ≈ 0.26 (lower estimate).After introducing a narrow annular diaphragm, the integral in (39) takes a simple, polynomial-independent form as follows: that is, the light ring's radius is determined by the first maximum of the first-order Bessel function   ≈ 0.293, which corresponds to the scalar limit in (10) (Table 3, row 2).The intensity of the focal ring drops with a decrease in the annular diaphragm width .
Also note that introducing an additional -jump of phase along the medium radius of the diaphragm enables obtaining a smaller focal spot of FWHM = 0.329 at the penalty of further reduction of the ring intensity (Table 3, row 3).
Based on the superposition of ( 14), an attempt can be made to find a trade-off between attaining a smaller focal ring and the energy loss at the center.In particular, this can be implemented by keeping the side-lobe intensity relatively low when compared with the central focal ring intensity for a comparatively wide region (Table 3, row 4).
By allowing the side-lobe intensity to reach that of the central ring (Table 3, row 5), it becomes possible to break the diffraction limit, obtaining the central ring intensity 2 times the intensity obtained with a narrow annular diaphragm.
The central focal spot can be made still smaller only at the penalty of the considerable energy loss (Table 3, row 6).

Circular Polarization
The circularly polarized beams are attractive because they can be easily obtained from linearly polarized light emitted by the majority of laser sources.Unlike the previously discussed radially and azimuthally polarized beams, no complex or expensive devices are required to obtain them.
For a circularly polarized beam, there is a correlation between the polarization sign and the vortex order sign.If the vortex order || ≤ 2 has the opposite sign to the polarization sign, there will be nonzero components on the optical axis ( = 0).This factor will have an adverse effect on an attempt to obtain a circular distribution of the total intensity.For certainty, "+"-sign circular polarization is analyzed below.Then, at  = 0 and  = −2, the transverse components will make a nonzero contribution to the intensity on the optical axis, whereas the longitudinal component will form a diffraction ring.We note that at  = −2 the transverse components' energy on the axis is lower than it is at  = 0.At  = −1, the situation is the opposite, but for the high NA focusing systems, the contribution from the longitudinal components is normally large.Because of this, for a shadow spot to be formed, the variant with  = −2 is better-suited.Then, the field in the focal region takes the following form: where  =  sin .With a narrow annular diaphragm employed for the longitudinal component, the result will be the same as in (36).However, the contribution of the nonzero longitudinal components in the focus also becomes significant.As a result, in terms of the total intensity, the ring becomes fuzzier in structure than in the case of the full NA system, as seen from the comparison of row 1 and row 2 of Table 4.The use of the optimization procedure improves the situation just insignificantly when compared with the situation when no additional apodization of the focusing system's output pupil is used (Table 4, row 3).
Thus, an attempt to obtain a clearly observed light ring in terms of the total intensity has proved to be unsuccessful.
Because of this, we analyze a circularly polarized "+"-wave with the first-order optical vortex of the same sign as follows: where  =  sin .
From (42), it is seen that, in the general case, a smaller diffraction ring will be formed for the transverse components in comparison with that formed for the longitudinal component (Table 5).Hence, in this case, the use of a narrow annular diaphragm is inefficient, only leading to the extraction of the longitudinal component (Table 5, row 2).
The use of the superposition of ( 14), with the coefficients calculated so as to minimize the light ring, enables the light ring size to be decreased in terms of the total intensity (see Table 5, row 3), but the results are not better than those obtained for the azimuthally polarized beam.
Thus, for widely used and easy to realize polarization types (circular or linear), the optimal results can be only achieved for individual electric field components-the longitudinal or transverse ones.In the total intensity, the optimization effect is deteriorated owing to the combined contribution of different components.In this case, the contribution of the undesirable component can be eliminated with the use of materials selectively sensitive to different electric field components [25].

Conclusion
We have looked into the ways of minimizing the light/shadow focal spot size for differently polarized incident beams under the controlled side-lobe growth by means of an optimal apodization of the focusing system.The apodization that has been proposed involves the introduction of a vortex phase dependence on the angle and the polynomial amplitude dependence on the radius.The optimization of the polynomial coefficients has been performed using Brent's method with the aim of preserving the energy efficiency and keeping the side lobes at desired level.The study has shown that the radial polarization can be efficiently used for generating a tighter focal spot and the azimuthal polarization-for the light ring.
In this case, the diffraction-limited size predicted by the scalar theory is achieved by merely introducing a narrow annular diaphragm shielding off the central part of the focusing system.The efficiency can be enhanced with the use of a wider annular diaphragm with a radial phase jump.With such optical element, it is also possible to obtain a tighter focal spot than that predicted by the scalar theory of diffraction.
The only way to make the light/shadow focal spot essentially smaller is by redirecting the energy to the side lobes.In this case, the growth of the side lobes in a near-focus region can be controlled by means of the amplitude distribution defined by a radial polynomial, which can be optimized, for example, using Brent's method.
The achievement of a trade-off between obtaining a smaller focal spot and retaining the energy in the center depends on the area of the focal region under control.Beyond the region under control, the intensity level can be essentially higher than that in the center.However, alongside the lower energy loss, the expansion of the region under control usually results in a larger focal spot.Thus, the results obtained are in full agreement with the theory of Toraldo di Francia.
For widely used and easy to realize polarization types, in particular, circularly and linearly polarized beams, analogous results can be only achieved for individual electric field components-the longitudinal or transverse ones.In the total intensity, the optimization effect is deteriorated owing to the combined contribution of different components.In this case, the contribution of the undesirable component can be eliminated with the use of materials selectively sensitive to different electric field components.

Figure 2 :
Figure 2: Illustration of the target function in the focal plane at a fixed value of the polar angle .