3

The Uncertainty Principle in Optics

3.1 Approximate Derivation of the Uncertainty Principle

Heisenberg’s uncertainty principle (Heisenberg 1927) is of fundamental importance to optics and to laser optics in particular. Here, optical arguments are applied to outline an approximate derivation of the uncertainty principle.

3.1.1 The Wave Character of Particles

The quantum energy of a particle is given by the well-known quantum energy equation

E=hv(3.1)

where:

h is Planck’s constant

Equating this to the relativistic energy E = mc2 and using the identity λ = c/ν, an expression for the momentum of a particle can be given as

p=hλ(3.2)

which, using the identity

k=2πλ(3.3)

can be restated as

p=ħk(3.4)

This momentum equation was applied to particles, such as electrons, by de Broglie (Haken 1981). Thus, wave properties such as frequency and wavelength were attributed to the motion of particles.

3.1.2 The Diffraction Identity and the Uncertainty Principle

In his discussion of position and momentum, Feynman considers the relative uncertainty in the wavelength that can be measured with a given grating (Feynman et al., 1965). Using such approach, he arrives at an expression for the resolving power of a diffraction grating: Δλ/λ. His discussion is then extended to include the length of a wave train in order to derive an identity for Δλ/λ2. Here, the origin of this expression is examined from an interferometric perspective.

The generalized one-dimensional interferometric equation derived using Dirac’s notation is given by (Duarte 1991)

|x|s|2=j=1NΨ(rj)2+2j=1NΨ(rj)[m=j+1NΨ(rm)cos(Ωm-Ωj)](3.5)

The interference term in this equation can be expressed as (Duarte 1997)

cos[(θm-θj)±(ϕm-ϕj)]=cos(|lm-lm-1|k1±|Lm-Lm-1|k2)(3.6)

where:

k1=2πn1λv

and

k2=2πn2λv

are the wave numbers for the two optical regions, defined in Figure 3.1 (see Chapter 2 for further details). Also, |lm - lm-1| and |Lm - Lm-1| are the corresponding path differences. Since it can be shown that maxima can occur at

(|lm-lm-1|n1±|Lm-Lm-1|n2)2πλv=Mπ(3.7)

where:

M = 0, ±2, ±4, ±6, …

it can be shown that

dm(n1sinΘm±n2sinΦm)2πλv=Mπ(3.8)

which, for n1 = n2 = 1 and λ = λv, reduces to

dm(sinΘm±sinΦm)k=Mπ(3.9)

that can be expressed as the well-known grating equation

dm(sinΘm±sinΦm)=mλ(3.10)

where:

m = 0, ±1, ±2, ±3, ...

Images

FIGURE 3.1 Boundary at a transmission grating depicting corresponding path differences. Further details on the geometry of the transmission grating are given in Chapter 2.

For a grating deployed in the reflection domain, and at Littrow configuration, Θm = Φm = Θ so that the grating equation reduces to

2dsinΘ=mλ(3.11)

Using this equation, one can consider an expanded light beam incident on a reflection grating as illustrated in Figure 3.2. For an infinitesimal change in wavelength

λ1=2dm(Δx1l)(3.12)

λ2=2dm(Δx2l)(3.13)

where:

l is the grating length

Δx is the path difference

From the geometry of Figure 3.2 and the corresponding grating equation for the Littrow configuration (Equation 3.11),

2dm=lλΔx(3.14)

Images

FIGURE 3.2 Path differences in a diffraction grating of the reflective class in Littrow configuration. From the geometry, sin Θ = (Δx/l).

Then, it follows that

Δλ=lλΔx(Δx1-Δx2l)(3.15)

In order to distinguish between a maximum and a minimum, the difference in path differences should be equal to a single wavelength, that is,

(Δx1-Δx2)λ(3.16)

Hence, Equation 3.15 reduces to the well-known diffraction identity

Δλλ2Δx(3.17)

which in turn leads to

ΔvcΔx(3.18)

Assuming a grating composed of uniform and equally spaced slits of width d, separated themselves by a distance d, the total length of the grating can be stated as l ≈ 2Nd, where N is the total number of slits. Hence, the path difference can be written as ΔxmNλ so that Equation 3.17 can also be expressed as

Δλλ21mNλ(3.19)

and multiplication by λ yields

Δλλ1mN(3.20)

which is known as the resolving power of a diffraction grating. Now, considering p = ħk for two slightly different wavelengths leads to

p1-p2=h(λ1-λ2)λ1λ2(3.21)

and since the difference between λ1 and λ2 is infinitesimal, this equation can be restated as

ΔphΔλλ2(3.22)

Substitution of Equation 3.22 into 3.17 leads to

ΔpΔxh(3.23)

which is known as Heisenberg’s uncertainty principle (Dirac 1978). This approach offers a geometrical perspective on the foundations of the uncertainty principle. It should also be noted that many authors also express the uncertainty principle in its minimum form where the right-hand side is given by ħ/2 rather than h:

ΔpΔxh2(3.24)

This minimal version of the uncertainty principle, in addition to a generalized version, is discussed in Duarte (2014).

3.1.3 Alternative Versions of the Uncertainty Principle

In addition to Δp Δxh, the uncertainty principle can be expressed in various useful versions. Assuming an independent derivation, and using p = ħk, it can be expressed in its wavelength-spatial form:

ΔλΔxλ2(3.25)

which can also be stated in its frequency-spatial version:

ΔvΔxc(3.26)

which are Equations 3.17 and 3.18 in an alternative format. Using E = mc2, the uncertainty principle can also be written in its energy–time form:

ΔEΔth(3.27)

which, using the quantum energy E = hν, can be transformed to its frequency–time version:

ΔvΔt1(3.28)

This result is very important in the area of pulsed lasers and implies that a laser emitting a pulse of a given duration Δt has a minimum spectral linewidth Δν. It also implies that by measuring the width of the spectral emission the duration of that pulse can be determined. The time

Δt1Δv(3.29)

is also known as the coherence time. From this time, the coherence length can be defined as

ΔxcΔv(3.30)

which is an alternative form of Equation 3.26. This equation has gigantic physical implications as it illustrates the fact that a single photon, with an infinitesimal narrow linewidth Δν, can have an enormous coherence length.

3.2 Applications of the Uncertainty Principle in Optics

In this section, some useful applications of the uncertainty principle in beam propagation and intracavity optics are considered.

3.2.1 Beam Divergence

If the uncertainty principle is assumed to be derived from independent and rigorous methods, then it can be used to derive some useful identities in optics. For example, starting from

ΔpΔxh

the application of p = ħk yields

ΔkΔx2π(3.31)

which leads directly to

Δλλ2Δx

For a diffraction-limited beam traveling in the z-direction

kx=ksinθ(3.32)

and/or a very small angle θ, we can write

kxkθ(3.33)

Using ΔkxΔx ≈ 2π, it is readily seen that the beam has an angular divergence given by

ΔθλΔx(3.34)

This equation indicates that the angular spread of a propagating beam of wavelength λ is inversely proportional to its original width, that is, narrower beams exhibit a larger angular spread or divergence (see Figure 3.3). Also it states that the light of shorter wavelength experiences less beam divergence, which is a well-known experimental fact in laser physics (see Figure 3.4).

Images

FIGURE 3.3 Beam divergence for two different apertures at wavelength λ. (a) An expanded laser beam is incident on a microhole of diameter 2w. (b) The same laser beam is incident on a microhole of diameter 4w.

Images

FIGURE 3.4 Beam divergence for (a) λ = 450 nm and (b) λ = 650 nm. In both cases, an expanded laser beam is incident on identical microholes of diameter 2w.

Equation 3.34 has the same form of the classical equation for beam divergence (Duarte 1990):

Δθ=λπw[1+(LRB)2+(ALRB)2]1/2(3.35)

where:

w is the beam waist

L = (πw2/λ) is the Rayleigh length

A and B are spatial propagation parameters defined in Chapter 6

For well-chosen experimental conditions, the second and third terms in the parentheses of Equation 3.35 approach zero so that

Δθλπw(3.36)

and the beam divergence is said to approach its diffraction limit. The equivalence of Equations 3.34 and 3.36 is self evident.

The generalized interference equation

|x|s|2=j=1NΨ(rj)2+2j=1NΨ(rj)[m=j+1NΨ(rm)cos(Ωm-Ωj)]

was previously used to derive Δλ ≈ λ2x, which eventually leads to ΔpΔxh and subsequently leads to an expression to the diffraction limited beam divergence.

3.2.1.1 Example

Here, we estimate the divergence of a coherent beam as it propagates in space using the one-dimensional generalized interferometric equation. For instance, at λ = 632.82 nm, a beam with an original dimension of 2w0 = 200 μm can be estimated via Equation 3.5 to spread to 2w ≈ 1.89 mm (full width at half maximum [FWHM]) following a propagation of 0.5 m using an approximate graphical method (see Figure 3.5). The same beam can be estimated to increase further to 2w ≈ 18.9 mm (FWHM) following a propagation of 5 m. Using the dimensions of the beam waist w, and simple geometry, this yields an approximate beam divergence of Δθ ≈ 1.9 mrad. Alternatively, using Equation 3.36, the diffraction-limited beam divergence can be determined directly (for w0 100 nm and λ = 632.82 nm) as Δθ ≈ 2.01 mrad.

3.2.2 Beam Divergence and Astronomy

One application of the beam divergence equation relates to the angular resolution limit of telescopes used in astronomical observations. Reflection telescopes such as the Newtonian and Cassegrain telescopes are depicted in Figure 3.6 and discussed further in Chapter 6. The angular resolution that can be accomplished with these

Images

FIGURE 3.5 Beam divergence determined from the generalized interference equation. (a) Beam profile following propagation through a distance of 0.5 m. (b) Beam profile following propagation through a distance of 5 m. Here, w0 = 100 μm and λ = 632.82 nm. Using the difference in beam waist w over the propagation distance, the beam divergence is determined to be Δθ ≈ 1.9 mrad.

telescopes under ideal conditions is approximately described by the diffraction limit given in Equation 3.36. That is, the smallest angular discrimination, or resolution limit, of a telescope with a diameter D = 2w is given by

Δθ2λπD(3.37)

Images

FIGURE 3.6 Reflection telescopes used in astronomical observations: (a) Newtonian telescope; (b) Cassegranian telescope.

This equation indicates that the two avenues to increase the angular resolution of a telescope are either to observe at shorter wavelengths or to increase the diameter of the telescope. The latter option has been adopted by optical designers that have built very large telescopes. At λ = 500 nm, the angular resolution for a telescope, with D = 10 m, is Δθ ≈ 3.18 × 10-8 rad. For a diameter of D = 100 m, the angular resolution at the same wavelength becomes Δθ ≈ 3.18 × 10-9 rad. Furthermore, for an hypothetical diameter of D = (1000/π) m, the angular resolution at λ = 500 nm becomes Δθ ≈ 1.00 × 10-9 rad. In addition to better angular resolutions, large aperture telescopes provide increased signal since the area of collection increases substantially. In future, these ultralarge telescopes might become important in the search of exoplanets and in the study of fundamental physics phenomena during the very early stages of the universe.

Albeit great progress has been made in the construction of very large segmented mirrors, it might be a long time before telescopes of the extraordinary diameters mentioned here are built. In the meantime, interferometric telescopic arrays (see Chapter 11) can be used to significantly increase the light collection area and resolving power of individual telescopes. Such an array is the Very Large Telescope located at the Atacama Desert in northern Chile, which is composed of four telescopes, each telescope with a diameter of 8.2 m. As mentioned in Chapter 11, the first interferometric telescope was the Hanbury Brown–Twiss interferometer that, as emphasized by Feynman, is indeed a quantum optics instrument. We should also keep in mind that, as explicitly explained in this chapter, the angular resolution of telescopes can be explained either via quantum interferometric principles or via the uncertainty principle itself. Thus, it should not be surprising to see other quantum concepts adopted in the improvement of telescopic resolution.

3.2.2.1 Laser Guide Star

An important feature of large and very large terrestrial telescopes is the use of laser guide star techniques, in conjunction with adaptive optics, to correct for atmospheric turbulence. Atmospheric turbulence can seriously affect the optical homogeneity of the transmission medium as described and discussed in Chapter 10 from an interferometric perspective.

In guide star applications, the laser is tuned to the well-known atomic sodium 32S1/2 - 32P3/2 transition at λ = 588.9963 nm (also known as the sodium D2 line; Mitchel and Zemansky 1971) and its beam is projected 90–100 km above the planetary surface. Emission from the sodium guide star induced by the illumination from the laser is collected by a Cassegrain telescope and used to characterize the atmospheric-induced distortions resulting from turbulence. The spatial information thus collected is used to drive the adaptive optics and compensate for atmospheric turbulence. In this application, three most important parameters of the laser are optical power, laser linewidth (Δν), and beam divergence (Δθ). More specifically, the laser should yield a diffraction-limited beam and single-longitudinal-mode emission (see Chapter 7).

Laser development for guide star applications originated with work on high-power tunable dye lasers (Everett 1989; Primmerman et al. 1991) and includes both pulsed and continuous-wave (CW) tunable lasers (Bass et al. 1992; Pique and Farinotti 2003). Alternatives to direct emission tunable organic dye lasers are frequency-doubled diode-pumped fiber Raman lasers (Feng et al. 2009; Henry et al. 2013).

The use of Amici prisms, or compounds prisms, for atmospheric dispersion corrections in astronomical applications is discussed in Chapter 4.

3.2.2.2 Example

Consider the propagation of a diffraction-limited laser beam over a distance of 95 km at λ = 589 nm for guide star applications in astronomy. A multiple-prism grating tunable laser oscillator (Duarte and Piper 1984; Duarte 1999) can be designed to provide single-longitudinal-mode oscillation at λ ≈ 589 nm in a laser with a beam divergence, which is nearly diffraction limited. Thus, its divergence is characterized by Δθ ≈ (λ/πw). For a beam waist w ≈ 125 μm, the corresponding beam divergence becomes Δθ ≈ 1.5 mrad. If the beam of this oscillator were to be propagated over a distance of 95 km, it would illuminate a circle of nearly 285 m in diameter. This would yield a very weak power density. If, however, the beam is expanded without introducing further divergence by a factor of M ≈ 200, then the new beam divergence becomes Δθ ≈ (λ/Mπw). Now, as a consequence of the beam expansion, the beam diameter at the required propagation distance becomes about 1.42 m, which is close to the dimensions needed to achieve the necessary power densities with existing high-power tunable lasers. It should be mentioned that the beam magnification factors of about 200 are not difficult to attain. Further, for this type of application, the narrow-linewidth oscillator emission is augmented at amplifier stages, where the beam also increases its dimensions, thus reducing the requirements on M.

3.3 The Interferometric Equation and the Uncertainty Principle

Previously, it has been shown that from the Dirac principle

x|s=j=1Nx|jj|s

the generalized interferometric equation

|x|s|2=j=1NΨ(rj)2+2j=1NΨ(rj)[m=j+1NΨ(rm)cos(Ωm-Ωj)]

can be obtained (Duarte 1991, 1993). This equation, originally derived while considering single-photon propagation (Duarte 1991), has also been shown to accurately predict interferometric distributions generated by the interaction of ensembles of indistinguishable photons with N-slit arrays (Duarte 1993). Furthermore, as shown by Duarte (1992), and explained in Chapter 2, the phase term of this generalized interferometric equation can be used to derive the cavity linewidth equation:

ΔλΔθ(λθ)-1

where:

Δθ is the beam divergence

λθ = (∂θ/∂θ), which is the intracavity angular dispersion

This equation has been used extensively to determine the emission linewidth in pulsed narrow-linewidth dispersive laser oscillators (Duarte 2003). As shown earlier in this chapter, the beam divergence term

Δθλπw

can be distilled from the uncertainty principle

ΔpΔxh

which itself can also be obtained in an approximate approach, starting from the phase term of the interferometric equation, thus emphasizing a fundamental connection between these two principles. Besides, as shown by example, an estimate of Δθ can be obtained directly from the interferometric distributions provided via the interferometric equation. The phase term of the interferometric equation is in itself created by the interaction of probability amplitudes via the mechanics of the basic Dirac quantum principle

x|s=j=1Nx|jj|s

where the probability amplitudes are represented by wave functions of ordinary wave optics (Dirac 1978):

j|s=Ψ(rj,s)e-iθj

x|j=Ψ(rx,j)e-iϕj

This exposition indicates that interference, as described by the Dirac notation, has a central and crucial position at the foundations of quantum mechanics and its significance might even be deeper than the significance of the uncertainty principle itself. The importance of interference to quantum mechanical principles was first highlighted by Dirac (1978) and then by Feynman et al. (1965).

3.3.1 Quantum Cryptography

Quantum secure communications and quantum cryptography are subjects of intense research activity. In this regard, it should be mentioned that Heisenberg’s uncertainty principle is often associated with the essence of quantum cryptography (Bennett et al. 1992).

An alternative approach to describe the mechanics of quantum cryptography is to use the probability amplitude of entangled polarizations of quanta traveling in opposite directions (Ekert 1991). The probability amplitude for entangled polarizations of quanta traveling in opposite directions was first introduced in the form of (Ward 1949)

|s=(|x1|y2-|y1|x2)(3.38)

which following straightforward normalization becomes

|Ψ-=12(|x1|y2-|y1|x2)(3.39)

which is the ubiquitous probability amplitude depicting quantum entanglement given in numerous research papers and textbooks (see, e.g., Mandel and Wolf 1995). This probability amplitude can be derived via Ward’s argument (Ward 1949), a Hamiltonian approach (Duarte 2014), or via a neat and transparent approach from an N-slit interferometric perspective (Duarte 2013, 2014) again utilizing the Dirac principle:

x|s=j=1Nx|jj|s

Briefly, the probability amplitude for a single photon to propagate from a source s to the interferometric plane configured by a detector d via apertures 1 and 2 is given by 〈d | s

d|s=d|22|s+d|11|s(3.40)

Images

FIGURE 3.7 Generic experimental configuration of a source emitting two indistinguishable quanta traveling in opposite directions with entangled polarizations. This experimental arrangement is a simplified rendition of the original configuration introduced by Pryce and Ward.

For an experiment with a central photon source emitting indistinguishable quanta toward identical detectors (d1 = d2 d) positioned in opposite directions via polarization analyzers p1 and p2 (see Figure 3.7), the corresponding probability amplitude can be described as

d|s=d|p2p2|s+d|p1p1|s(3.41)

In abstract form, this probability amplitude can be expressed as

|s=|p2p2|s+|p1p1|s(3.42)

Defining

|C1=|p1p1|s(3.43)

and

|C2=|p2p2|s(3.44)

Equation 3.42 can be expressed as

|s=|C2+|C1(3.45)

Using the Dirac identity applicable to the state of several similar particles

|X=|a1|b2|c3...|gn(3.46)

for quanta 1 and quanta 2, we can write

|C2=|x1|y2(3.47)

and

|C1=|y1|x2(3.48)

Substituting back in Equation 3.45, and following normalization, leads to

|Ψ+=12(|x1|y2+|y1|x2)(3.49)

and its linear combination

|Ψ-=12(|x1|y2-|y1|x2)

An extension of the Ward probability amplitude is the probability amplitude for n entangled particles propagating in two different directions (Duarte 2013):

|Ψ=12(|a1|b2|c3...|gn±|a1|b2|c3...|gn)(3.50)

For N path alternatives, the overall probability amplitude is given by

|s=j=1N|CN+1-j(3.51)

where the first term of the path probability amplitude series can be written as

CN+1-j=m=1n|γm(3.52)

These are a series of probability amplitudes starting at CN+1−j, with j = 1, and ending at C1, which is reached when j = N. Here, n is the total number of quanta, which is an even number since quanta participate in pairs. For each pair (1, 2), (3, 4)...(n - 1, n), γ represents a pair of orthogonal polarization alternatives such as x and y, and ϕ and ϕ′.

For an even number of paths, the normalized probability amplitudes, designated by CR, where R is a Roman numeral, have the form:

CR=1Nm=1N(±)Cm(3.53)

However, for N = 3,

CI=13(C3+C2+iC1)(3.54)

CII=13(C3-C2+iC1)(3.55)

CIII=13(iC3+jC2+kC1)(3.56)

where:

i, j, and k are defined as Hamilton’s quaternions

A detailed discussion of the Ward probability amplitude for entangled quanta and its corresponding experimental arrangement is given by Duarte (2014).

3.3.1.1 Example

For four particles (n = 4), that is, two pairs, and four paths (N = 4), as illustrated in Figure 3.8,

|s=|C4+|C3+|C2+|C1(3.57)

or

|s=|C1+|C2+|C3+|C4(3.58)

so that

|s=|x1|y2|(ϕ3|(ϕ4+|y1|x2|(ϕ3|(ϕ4+|(ϕ1|(ϕ2|x3|y4+|(ϕ1|(ϕ2|y3|x4(3.59)

Images

FIGURE 3.8 Experimental configuration of a source emitting two pairs (n = 4) of indistinguishable quanta traveling in four different directions (N = 4) with entangled polarizations. In the theoretical development, it is assumed that all detectors are identical (d1 = d2 = d3 = d4 = d).

C1=|x1|y2|(ϕ3|(ϕ4(3.60)

C2=|y1|x2|ϕ3|ϕ4(3.61)

C3=|ϕ1|ϕ2|x3|y4(3.62)

C4=|ϕ1|ϕ2|y3|x4(3.63)

The normalization condition

1=|CI|2+|CII|2+|CIII|2+|CW|2(3.64)

requires that

CI=14(C1+C2+C3+C4)(3.65)

CII=14(C1+C2-C3-C4)(3.66)

CIII=14(C1-C2+C3-C4)(3.67)

CW=14(C1-C2-C3+C4)(3.68)

which lead to the following explicit normalized probability amplitudes

|Ψ=14(|x1|y2|ϕ3|ϕ4+|y1|x2|ϕ3|ϕ4+|(ϕ1|(ϕ2|x3|y4+|(ϕ1|(ϕ2|y3|x4)(3.69)

|Ψ=14(|x1|y2|(ϕ3|(ϕ4+|y1|x2|(ϕ3|(ϕ4-|ϕ1|ϕ2|x3|y4-|ϕ1|ϕ2|y3|x4)(3.70)

|Ψ=14(|x1|y2|(ϕ3|(ϕ4-|y1|x2|(ϕ3|(ϕ4+|ϕ1|ϕ2|x3|y4-|ϕ1|ϕ2|y3|x4)(3.71)

|Ψ=14(|x1|y2|ϕ3|ϕ4-|y1|x2|ϕ3|ϕ4-|(ϕ1|(ϕ2|x3|y4+|(ϕ1|(ϕ2|y3|x4)(3.72)

This is a complete symmetrical configuration where the number of path alternatives (N = 4) is matched by the number of particles (n = 4), which corresponds to two pairs of particles propagating via two pairs of alternative paths. Once this mechanics is established, one can readily extend the multiparticle probability amplitude to other similar higher symmetrical configurations.

Problems

3.1 Calculate the diffraction-limited beam divergence at FWHM for (1) a laser beam with a w = 100 μm radius at λ = 590 nm and (2) a laser beam with a w = 500 μm radius at λ = 590 nm.

3.2 Repeat the calculations of Problem 3.1 for λ = 308 nm. Comment.

3.3 Calculate the single-pass dispersive cavity linewidth for a high-power tunable laser yielding a diffraction-limited beam divergence, w = 100 μm in radius, at λ = 590 nm. Assume that an appropriate beam expander illuminates a 3300 lines/mm grating deployed in the first order. The grating has a 50 mm length perpendicular to the grooves.

3.4 (a) For a high-power pulsed laser delivering a Δν ≈ 350 MHz laser line-width, estimate its shortest possible pulse width at FWHM. (b) For a laser emitting Δt ≈ 10 fs pulses, estimate its broadest possible spectral width in nanometers centered around λ = 600 nm.

3.5 Show that Equation 3.34 follows from Equation 3.33.

3.6 Estimate the angular resolution of a single 8.2 m diameter telescope at λ = 500 m.

3.7 Write explicit normalized probability amplitudes for N = 3 propagation channels and n = 6 quanta, that is, three pairs of photons with entangled polarizations.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.146.37.35