β

Casimir effect

This article's lead section may be too long for the length of the article. Please help by moving some material from it into the body of the article. Please read the layout guide and lead section guidelines to ensure the section will still be inclusive of all essential details. Please discuss this issue on the article's talk page.(March 2017)

The typical example is of the two uncharged conductive plates in a vacuum, placed a few nanometers apart. In a classical description, the lack of an external field means that there is no field between the plates, and no force would be measured between them.[1] When this field is instead studied using the quantum electrodynamic vacuum, it is seen that the plates do affect the virtual photons which constitute the field, and generate a net force[2] – either an attraction or a repulsion depending on the specific arrangement of the two plates. Although the Casimir effect can be expressed in terms of virtual particles interacting with the objects, it is best described and more easily calculated in terms of the zero-point energy of a quantized field in the intervening space between the objects. This force has been measured and is a striking example of an effect captured formally by second quantization.[3][4]

The treatment of boundary conditions in these calculations has led to some controversy. In fact, "Casimir's original goal was to compute the van der Waals force between polarizable molecules" of the conductive plates. Thus it can be interpreted without any reference to the zero-point energy (vacuum energy) of quantum fields.[5]

Because the strength of the force falls off rapidly with distance, it is measurable only when the distance between the objects is extremely small. On a submicron scale, this force becomes so strong that it becomes the dominant force between uncharged conductors. In fact, at separations of 10 nm – about 100 times the typical size of an atom – the Casimir effect produces the equivalent of about 1 atmosphere of pressure (the precise value depending on surface geometry and other factors).[6]

Contents

The Casimir effect can be understood by the idea that the presence of conducting metals and dielectrics alters the vacuum expectation value of the energy of the second quantized electromagnetic field.[7][8] Since the value of this energy depends on the shapes and positions of the conductors and dielectrics, the Casimir effect manifests itself as a force between such objects.

A water wave analogue of the Casimir effect. Two parallel plates are submerged into colored water contained in a sonicator. When the sonicator is turned on, waves are excited imitating vacuum fluctuations; as a result, the plates are forced together.

Any medium supporting oscillations has an analogue of the Casimir effect. For example, beads on a string[9][10] as well as plates submerged in noisy water[11] or gas[12] illustrate the Casimir force.

Dutch physicists Hendrik Casimir and Dirk Polder at Philips Research Labs proposed the existence of a force between two polarizable atoms and between such an atom and a conducting plate in 1947, this special form is called the Casimir–Polder force. After a conversation with Niels Bohr, who suggested it had something to do with zero-point energy, Casimir alone formulated the theory predicting a force between neutral conducting plates in 1948 which is called the Casimir effect in the narrow sense.

Predictions of the force were later extended to finite-conductivity metals and dielectrics, and recent calculations have considered more general geometries. Experiments before 1997 had observed the force qualitatively, and indirect validation of the predicted Casimir energy had been made by measuring the thickness of liquid helium films. However it was not until 1997 that a direct experiment by S. Lamoreaux quantitatively measured the force to within 15% of the value predicted by the theory,.[14] Subsequent experiments approach an accuracy of a few percent.

The causes of the Casimir effect are described by quantum field theory, which states that all of the various fundamental fields, such as the electromagnetic field, must be quantized at each and every point in space. In a simplified view, a "field" in physics may be envisioned as if space were filled with interconnected vibrating balls and springs, and the strength of the field can be visualized as the displacement of a ball from its rest position. Vibrations in this field propagate and are governed by the appropriate wave equation for the particular field in question. The second quantization of quantum field theory requires that each such ball-spring combination be quantized, that is, that the strength of the field be quantized at each point in space. At the most basic level, the field at each point in space is a simple harmonic oscillator, and its quantization places a quantum harmonic oscillator at each point. Excitations of the field correspond to the elementary particles of particle physics. However, even the vacuum has a vastly complex structure, so all calculations of quantum field theory must be made in relation to this model of the vacuum.

The vacuum has, implicitly, all of the properties that a particle may have: spin, or polarization in the case of light, energy, and so on. On average, most of these properties cancel out: the vacuum is, after all, "empty" in this sense. One important exception is the vacuum energy or the vacuum expectation value of the energy. The quantization of a simple harmonic oscillator states that the lowest possible energy or zero-point energy that such an oscillator may have is

Summing over all possible oscillators at all points in space gives an infinite quantity. Since only differences in energy are physically measurable (with the notable exception of gravitation, which remains beyond the scope of quantum field theory), this infinity may be considered a feature of the mathematics rather than of the physics. This argument is the underpinning of the theory of renormalization. Dealing with infinite quantities in this way was a cause of widespread unease among quantum field theorists before the development in the 1970s of the renormalization group, a mathematical formalism for scale transformations that provides a natural basis for the process.

When the scope of the physics is widened to include gravity, the interpretation of this formally infinite quantity remains problematic. There is currently no compelling explanation as to why it should not result in a cosmological constant that is many orders of magnitude larger than observed.[15] However, since we do not yet have any fully coherent quantum theory of gravity, there is likewise no compelling reason as to why it should.[16]

Alternatively, a 2005 paper by Robert Jaffe of MIT states that "Casimir effects can be formulated and Casimir forces can be computed without reference to zero-point energies. They are relativistic, quantum forces between charges and currents. The Casimir force (per unit area) between parallel plates vanishes as alpha, the fine structure constant, goes to zero, and the standard result, which appears to be independent of alpha, corresponds to the alpha approaching infinity limit," and that "The Casimir force is simply the (relativistic, retarded) van der Waals force between the metal plates."[17] Casimir and Polder's original paper used this method to derive the Casimir-Polder force. In 1978, Schwinger, DeRadd, and Milton published a similar derivation for the Casimir Effect between two parallel plates.[18] In fact, the description in terms of van der Waals forces is the only correct description from the fundamental microscopic perspective,[19][20] while other descriptions of Casimir force are merely effective macroscopic descriptions.

A third way of understanding Casimir forces has been suggested, based on canonical macroscopic quantum electrodynamics. In this interpretation, there exists a ground (vacuum) state of the coupled system of matter and fields, which determines the ground-state properties of the electromagnetic field, giving rise to a force. The Casimir force is fundamentally a property of the coupled system of matter and fields, in which the interaction between the plates is mediated by the zero-point fields. In more traditional interpretations, however, the emphasis has fallen either on the electromagnetic field or the fluctuating material in the plates.[21]

Casimir's observation was that the second-quantized quantum electromagnetic field, in the presence of bulk bodies such as metals or dielectrics, must obey the same boundary conditions that the classical electromagnetic field must obey. In particular, this affects the calculation of the vacuum energy in the presence of a conductor or dielectric.

Consider, for example, the calculation of the vacuum expectation value of the electromagnetic field inside a metal cavity, such as, for example, a radar cavity or a microwavewaveguide. In this case, the correct way to find the zero-point energy of the field is to sum the energies of the standing waves of the cavity. To each and every possible standing wave corresponds an energy; say the energy of the nth standing wave is En{\displaystyle E_{n}}. The vacuum expectation value of the energy of the electromagnetic field in the cavity is then

with the sum running over all possible values of n enumerating the standing waves. The factor of 1/2 is present because the zero-point energy of the n'th mode is En/2{\displaystyle E_{n}/2}, where En{\displaystyle E_{n}} is the energy increment for the n'th mode. (It is the same 1/2 as appears in the equation E=ℏω/2{\displaystyle E=\hbar \omega /2}.) Written in this way, this sum is clearly divergent; however, it can be used to create finite expressions.

In particular, one may ask how the zero-point energy depends on the shape s of the cavity. Each energy level En{\displaystyle E_{n}} depends on the shape, and so one should write En(s){\displaystyle E_{n}(s)} for the energy level, and ⟨E(s)⟩{\displaystyle \langle E(s)\rangle } for the vacuum expectation value. At this point comes an important observation: the force at point p on the wall of the cavity is equal to the change in the vacuum energy if the shape s of the wall is perturbed a little bit, say by δs{\displaystyle \delta s}, at point p. That is, one has

Attraction between the plates can be easily understood by focusing on the one-dimensional situation. Suppose that a moveable conductive plate is positioned at a short distance a from one of two widely separated plates (distance L apart). With a << L, the states within the slot of width a are highly constrained so that the energy E of any one mode is widely separated from that of the next. This is not the case in the large region L, where there is a large number (numbering about L / a) of states with energy evenly spaced between E and the next mode in the narrow slot – in other words, all slightly larger than E. Now on shortening a by da (< 0), the mode in the narrow slot shrinks in wavelength and therefore increases in energy proportional to −da/a, whereas all the L /a states that lie in the large region lengthen and correspondingly decrease their energy by an amount proportional to da/L (note the denominator). The two effects nearly cancel, but the net change is slightly negative, because the energy of all the L/a modes in the large region are slightly larger than the single mode in the slot. Thus the force is attractive: it tends to make a slightly smaller, the plates attracting each other across the thin slot.

In the original calculation done by Casimir, he considered the space between a pair of conducting metal plates at distance a{\displaystyle a} apart. In this case, the standing waves are particularly easy to calculate, because the transverse component of the electric field and the normal component of the magnetic field must vanish on the surface of a conductor. Assuming the plates lie parallel to the xy-plane, the standing waves are

where ψ{\displaystyle \psi } stands for the electric component of the electromagnetic field, and, for brevity, the polarization and the magnetic components are ignored here. Here, kx{\displaystyle k_{x}} and ky{\displaystyle k_{y}} are the wave numbers in directions parallel to the plates, and

kn=nπa{\displaystyle k_{n}={\frac {n\pi }{a}}}

is the wave-number perpendicular to the plates. Here, n is an integer, resulting from the requirement that ψ vanish on the metal plates. The frequency of this wave is

where c is the speed of light. The vacuum energy is then the sum over all possible excitation modes. Since the area of the plates is large, we may sum by integrating over two of the dimensions in k-space. The assumption of periodic boundary conditions yields,

where A is the area of the metal plates, and a factor of 2 is introduced for the two possible polarizations of the wave. This expression is clearly infinite, and to proceed with the calculation, it is convenient to introduce a regulator (discussed in greater detail below). The regulator will serve to make the expression finite, and in the end will be removed. The zeta-regulated version of the energy per unit-area of the plate is

In the end, the limit s→0{\displaystyle s\to 0} is to be taken. Here s is just a complex number, not to be confused with the shape discussed previously. This integral/sum is finite for sreal and larger than 3. The sum has a pole at s=3, but may be analytically continued to s=0, where the expression is finite. The above expression simplifies to:

where polar coordinatesq2=kx2+ky2{\displaystyle q^{2}=k_{x}^{2}+k_{y}^{2}} were introduced to turn the double integral into a single integral. The q{\displaystyle q} in front is the Jacobian, and the 2π{\displaystyle 2\pi } comes from the angular integration. The integral converges if Re[s] > 3, resulting in

The sum diverges at s in the neighborhood of zero, but if the damping of large-frequency excitations corresponding to analytic continuation of the Riemann zeta function to s=0 is assumed to make sense physically in some way, then one has

The analytic continuation has evidently lost an additive positive infinity, somehow exactly accounting for the zero-point energy (not included above) outside the slot between the plates, but which changes upon plate movement within a closed system. The Casimir force per unit area Fc/A{\displaystyle F_{c}/A} for idealized, perfectly conducting plates with vacuum between them is

The force is negative, indicating that the force is attractive: by moving the two plates closer together, the energy is lowered. The presence of ℏ{\displaystyle \hbar } shows that the Casimir force per unit area Fc/A{\displaystyle F_{c}/A} is very small, and that furthermore, the force is inherently of quantum-mechanical origin.

NOTE: In Casimir's original derivation [1], a moveable conductive plate is positioned at a short distance a from one of two widely separated plates (distance L apart). The 0-point energy on both sides of the plate is considered. Instead of the above ad hoc analytic continuation assumption, non-convergent sums and integrals are computed using Euler–Maclaurin summation with a regularizing function (e.g., exponential regularization) not so anomalous as |ωn|−s{\displaystyle \vert \omega _{n}\vert ^{-s}} in the above.[23]

Casimir's analysis of idealized metal plates was generalized to arbitrary dielectric and realistic metal plates by Lifshitz and his students.[24][25] Using this approach, complications of the bounding surfaces, such as the modifications to the Casimir force due to finite conductivity, can be calculated numerically using the tabulated complex dielectric functions of the bounding materials. Lifshitz's theory for two metal plates reduces to Casimir's idealized 1/a4 force law for large separations a much greater than the skin depth of the metal, and conversely reduces to the 1/a3 force law of the London dispersion force (with a coefficient called a Hamaker constant) for small a, with a more complicated dependence on a for intermediate separations determined by the dispersion of the materials.[26]

Lifshitz' result was subsequently generalized to arbitrary multilayer planar geometries as well as to anisotropic and magnetic materials, but for several decades the calculation of Casimir forces for non-planar geometries remained limited to a few idealized cases admitting analytical solutions.[27] For example, the force in the experimental sphere–plate geometry was computed with an approximation (due to Derjaguin) that the sphere radius R is much larger than the separation a, in which case the nearby surfaces are nearly parallel and the parallel-plate result can be adapted to obtain an approximate R/a3 force (neglecting both skin-depth and higher-order curvature effects).[27][28] However, in the 2000s a number of authors developed and demonstrated a variety of numerical techniques, in many cases adapted from classical computational electromagnetics, that are capable of accurately calculating Casimir forces for arbitrary geometries and materials, from simple finite-size effects of finite plates to more complicated phenomena arising for patterned surfaces or objects of various shapes.[27][29]

One of the first experimental tests was conducted by Marcus Sparnaay at Philips in Eindhoven (Netherlands), in 1958, in a delicate and difficult experiment with parallel plates, obtaining results not in contradiction with the Casimir theory,[30][31] but with large experimental errors. Some of the experimental details as well as some background information on how Casimir, Polder and Sparnaay arrived at this point[32] are highlighted in a 2007 interview with Marcus Sparnaay.

The Casimir effect was measured more accurately in 1997 by Steve K. Lamoreaux of Los Alamos National Laboratory,[33] and by Umar Mohideen and Anushree Roy of the University of California, Riverside.[34] In practice, rather than using two parallel plates, which would require phenomenally accurate alignment to ensure they were parallel, the experiments use one plate that is flat and another plate that is a part of a sphere with a large radius.

In order to be able to perform calculations in the general case, it is convenient to introduce a regulator in the summations. This is an artificial device, used to make the sums finite so that they can be more easily manipulated, followed by the taking of a limit so as to remove the regulator.

for three-dimensional cavities. The infinite part of the sum is associated with the bulk constant C which does not depend on the shape of the cavity. The interesting part of the sum is the finite part, which is shape-dependent. The Gaussian regulator

is better suited to numerical calculations because of its superior convergence properties, but is more difficult to use in theoretical calculations. Other, suitably smooth, regulators may be used as well. The zeta function regulator

is completely unsuited for numerical calculations, but is quite useful in theoretical calculations. In particular, divergences show up as poles in the complex s plane, with the bulk divergence at s=4. This sum may be analytically continued past this pole, to obtain a finite part at s=0.

Not every cavity configuration necessarily leads to a finite part (the lack of a pole at s=0) or shape-independent infinite parts. In this case, it should be understood that additional physics has to be taken into account. In particular, at extremely large frequencies (above the plasma frequency), metals become transparent to photons (such as X-rays), and dielectrics show a frequency-dependent cutoff as well. This frequency dependence acts as a natural regulator. There are a variety of bulk effects in solid state physics, mathematically very similar to the Casimir effect, where the cutoff frequency comes into explicit play to keep expressions finite. (These are discussed in greater detail in Landau and Lifshitz, "Theory of Continuous Media".)

The Casimir effect can also be computed using the mathematical mechanisms of functional integrals of quantum field theory, although such calculations are considerably more abstract, and thus difficult to comprehend. In addition, they can be carried out only for the simplest of geometries. However, the formalism of quantum field theory makes it clear that the vacuum expectation value summations are in a certain sense summations over so-called "virtual particles".

More interesting is the understanding that the sums over the energies of standing waves should be formally understood as sums over the eigenvalues of a Hamiltonian. This allows atomic and molecular effects, such as the van der Waals force, to be understood as a variation on the theme of the Casimir effect. Thus one considers the Hamiltonian of a system as a function of the arrangement of objects, such as atoms, in configuration space. The change in the zero-point energy as a function of changes of the configuration can be understood to result in forces acting between the objects.

The dynamical Casimir effect is the production of particles and energy from an accelerated moving mirror. This reaction was predicted by certain numerical solutions to quantum mechanics equations made in the 1970s.[36] In May 2011 an announcement was made by researchers at the Chalmers University of Technology, in Gothenburg, Sweden, of the detection of the dynamical Casimir effect. In their experiment, microwave photons were generated out of the vacuum in a superconducting microwave resonator. These researchers used a modified SQUID to change the effective length of the resonator in time, mimicking a mirror moving at the required relativistic velocity. If confirmed this would be the first experimental verification of the dynamical Casimir effect.[37][38]

There are few instances wherein the Casimir effect can give rise to repulsive forces between uncharged objects. Evgeny Lifshitz showed (theoretically) that in certain circumstances (most commonly involving liquids), repulsive forces can arise.[40] This has sparked interest in applications of the Casimir effect toward the development of levitating devices. An experimental demonstration of the Casimir-based repulsion predicted by Lifshitz was carried out by Munday et al.[41] Other scientists have also suggested the use of gain media to achieve a similar levitation effect,[42] though this is controversial because these materials seem to violate fundamental causality constraints and the requirement of thermodynamic equilibrium (Kramers–Kronig relations). Casimir and Casimir-Polder repulsion can in fact occur for sufficiently anisotropic electrical bodies; for a review of the issues involved with repulsion see Milton et al.[43]

It has been suggested that the Casimir forces have application in nanotechnology,[44] in particular silicon integrated circuit technology based micro- and nanoelectromechanical systems, silicon array propulsion for space drives, and so-called Casimir oscillators.[45]

The Casimir effect shows that quantum field theory allows the energy density in certain regions of space to be negative relative to the ordinary vacuum energy, and it has been shown theoretically that quantum field theory allows states where the energy can be arbitrarily negative at a given point,[46] Many physicists such as Stephen Hawking,[47]Kip Thorne,[48] and others[49][50][51] therefore argue that such effects might make it possible to stabilize a traversable wormhole. Miguel Alcubierre has suggested[52] using the effect to obtain the negative energy density required for his Alcubierre Drive.

^See The Alcubierre Warp Drive by John G. Cramer, where Cramer notes that "Alcubierre, following the lead of wormhole theorists, argues that quantum field theory permits the existence of regions of negative energy density under special circumstances, and cites the Casimir effect as an example."

Barrow, J. D. (2000). The Book of Nothing: Vacuums, Voids, and the Latest Ideas About the Origins of the Universe. Pantheon Books. ISBN0-09-928845-1. (Also includes discussion of French naval analogy.)