Molecular binding is an interaction between molecules that results in a stable association between those molecules. Cooperative binding occurs if the number of binding sites of a macromolecule that are occupied by a specific type of ligand is a non-linear function of this ligand's concentration. This can be due, for instance, to an affinity for the ligand that depends on the amount of ligand bound. Cooperativity can be positive (supra-linear) or negative (infra-linear). Cooperative binding is most often observed in proteins but nucleic acids can also exhibit cooperative binding, for instance of transcription factors. Cooperative binding has been shown to be the mechanism underlying a large range of biochemical and physiological processes.

History and mathematical formalisms

Christian Bohr and the concept of cooperative binding

In 1904, Christian Bohr studied hemoglobin binding to oxygen under different conditions.[1] When plotting hemoglobin saturation with oxygen as a function of the partial pressure of oxygen, he obtained a sigmoidal (or "S-shaped") curve, see figure 1. This indicates that the more oxygen is bound to hemoglobin, the easier it is for more oxygen to bind - until all binding sites are saturated. In addition, Bohr noticed that increasing CO2 pressure shifted this curve to the right - i.e. higher concentrations of CO2 make it more difficult for hemoglobin to bind oxygen.[1] This latter phenomenon, together with the observation that hemoglobin's affinity for oxygen increases with increasing pH, is known as the Bohr effect.

Figure 1. Original figure from Christian Bohr, showing the sigmoidal increase of oxyhemoglobin as a function of the partial pressure of oxygen.

A receptor molecule is said to exhibit cooperative binding if its binding to ligand scales non-linearly with ligand concentration. Cooperativity can be positive (if binding of a ligand molecule increases the receptor's apparent affinity, and hence increases the chance of another ligand molecule binding) or negative (if binding of a ligand molecule decreases affinity and hence makes binding of other ligand molecules less likely). Figure 1 is a chart of the "fractional occupancy" Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \bar{Y}}
of a receptor with a given ligand, which is defined as the quantity of ligand-bound binding sites divided by the total quantity of ligand binding sites:

If Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \bar{Y}=0}
, then the protein is completely unbound, and if Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \bar{Y}=1}
, it is completely saturated. If the plot of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \bar{Y}}
at equilibrium as a function of ligand concentration is sigmoidal in shape, as observed by Bohr for hemoglobin, this indicates positive cooperativity. If it is not, no statement can be made about cooperativity from looking at this plot alone.

The concept of cooperative binding only applies to molecules or complexes with more than one ligand binding sites. If several ligand binding sites exist, but ligand binding to any one site does not affect the others, the receptor is said to be non-cooperative. Cooperativity can be homotropic, if a ligand influences the binding of ligands of the same kind, or heterotropic, if it influences binding of other kinds of ligands. In the case of hemoglobin, Bohr observed homotropic positive cooperativity (binding of oxygen facilitates binding of more oxygen) and heterotropic negative cooperativity (binding of CO2 reduces hemoglobin's facility to bind oxygen.)

Throughout the 20th century, various frameworks have been developed to describe the binding of a ligand to a protein with more than one binding site and the cooperative effects observed in this context (reviewed by Wyman, J. and Gill, 1990[2]).

The Hill equation

The first description of cooperative binding to a multi-site protein was developed by A.V. Hill.[3] Drawing on observations of oxygen binding to hemoglobin and the idea that cooperativity arose from the aggregation of hemoglobin molecules, each one binding one oxygen molecule, Hill suggested a phenomenological equation that has since been named after him:

Figure 2. Hill plot of the Hill equation in red, showing the slope of the curve being the Hill coefficient and the intercept with the x-axis providing the apparent dissociation constant. The green line shows the non-cooperative curve.

The "Hill plot" is obtained by plotting Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \log \frac{\bar{Y}}{1-\bar{Y}}}
versus Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \log [X]}
. In the case of the Hill equation, it is a line with slope Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle n_H}
and intercept Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle log(K_d)}
(see figure 2). This means that cooperativity is assumed to be fixed, i.e. it does not change with saturation. It also means that binding sites always exhibit the same affinity, and cooperativity does not arise from an affinity increasing with ligand concentration.

The Adair equation

G.S. Adair found that the Hill plot for hemoglobin was not a straight line, and hypothesized that cooperativity was not a fixed term, but dependent on ligand saturation.[4] Having demonstrated that hemoglobin contained four hemes (and therefore binding sites for oxygen), he worked from the assumption that fully saturated hemoglobin is formed in stages, with intermediate forms with one, two, or three bound oxygen molecules. The formation of each intermediate stage from unbound hemoglobin can be described using an apparent macroscopic association constant Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_i}
. The resulting fractional occupancy can be expressed as:

where n denotes the number of binding sites and each Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_i}
is a combined association constant, describing the binding of i ligand molecules.

The Klotz equation

Working on calcium binding proteins, Irving Klotz deconvoluted Adair's association constants by considering stepwise formation of the intermediate stages, and tried to express the cooperative binding in terms of elementary processes governed by mass action law.[5][6] In his framework, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_1}
is the association constant governing binding of the first ligand molecule, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_2}
the association constant governing binding of the second ligand molecule (once the first is already bound) etc. For Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \bar{Y}}
, this gives:

It is worth noting that the constants Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_1}
, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_2}
and so forth do not relate to individual binding sites. They describe how many binding sites are occupied, rather than which ones. This form has the advantage that cooperativity is easily recognised when considering the association constants. If all ligand binding sites are identical with a microscopic association constant Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K}
, one would expect Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_1=nK, K_2=\frac{n-1}{2}K, \ldots K_n=\frac{1}{n}K}
(that is Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_i=\frac{n-i+1}{i}K}
) in the absence of cooperativity. We have positive cooperativity if Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_i}
lies above these expected values for Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle i>1}
.

The Klotz equation (which is sometimes also called the Adair-Klotz equation) is still often used in the experimental literature to describe measurements of ligand binding in terms of sequential apparent binding constants.[5]

Pauling equation

By the middle of the 20th century, there was an increased interest in models that would not only describe binding curves phenomenologically, but offer an underlying biochemical mechanism. Linus Pauling reinterpreted the equation provided by Adair, assuming that his constants were the combination of the binding constant for the ligand (Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K}
in the equation below) and energy coming from the interaction between subunits of the cooperative protein (Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \alpha}
below).[7] Pauling actually derived several equations, depending on the degree of interaction between subunits. Based on wrong assumptions about the localisation of hemes, he opted for the wrong one to describe oxygen binding by hemoglobin, assuming the subunit were arranged in a square. The equation below provides the equation for a tetrahedral structure, which would be more accurate in the case of hemoglobin:

The KNF model

Based on results showing that the structure of cooperative proteins changed upon binding to their ligand, Daniel Koshland and colleagues[8] refined the biochemical explanation of the mechanism described by Pauling.[7] The Koshland-Némethy-Filmer (KNF) model assumes that each subunit can exist in one of two conformations: active or inactive. Ligand binding to one subunit would induce an immediate conformational change of that subunit from the inactive to the active conformation, a mechanism described as "induced fit".[9] Cooperativity, according to the KNF model, would arise from interactions between the subunits, the strength of which varies depending on the relative conformations of the subunits involved. For a tetrahedric structure (they also considered linear and square structures), they proposed the following formula:

Where Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_X}
is the constant of association for X, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_t}
is the ratio of B and A states in the absence of ligand ("transition"), Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_{AB}}
and Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle K_{BB}}
are the relative stabilities of pairs of neighbouring subunits relative to a pair where both subunits are in the A state (Note that the KNF paper actually presents Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle N_s}
, the number of occupied sites, which is here 4 times Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \bar{Y}}
).

The MWC model

Figure 3. Reaction scheme of a Monod-Wyman-Changeux model of a protein made up of two protomers. The protomer can exist under two states, each with a different affinity for the ligand. L is the ratio of states in the absence of ligand, c is the ratio of affinities.

Figure 4. Energy diagram of a Monod-Wyman-Changeux model of a protein made up of two protomers. The larger affinity of the ligand for the R state means that the latter is preferentially stabilized by the binding.

The Monod-Wyman-Changeux (MWC) model for concerted allosteric transitions[10] went a step further by exploring cooperativity based on thermodynamics and three-dimensional conformations. It was originally formulated for oligomeric proteins with symmetrically arranged, identical subunits, each of which has one ligand binding site. According to this framework, two (or more) interconvertible conformational states of an allosteric protein coexist in a thermal equilibrium. The states - often termed tense (T) and relaxed (R) - differ in affinity for the ligand molecule. The ratio between the two states is regulated by the binding of ligand molecules that stabilizes the higher-affinity state. Importantly, all subunits of a molecule change states at the same time, a phenomenon known as "concerted transition". The MWC model is illustrated in figure 3.

The allosteric isomerisation constant L describes the equilibrium between both states when no ligand molecule is bound: Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle L=\frac{\left[T_0\right]}{\left[R_0\right]}}
. If L is very large, most of the protein exists in the T state in the absence of ligand. If L is small (close to one), the R state is nearly as populated as the T state. The ratio of dissociation constants for for the ligand from the T and R states is described by the constant c: Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle c = \frac{K_d^R}{K_d^T}}
. If Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle c=1}
, both R and T states have the same affinity for the ligand and the ligand does not affect isomerisation. The value of c also indicates how much the equilibrium between T and R states changes upon ligand binding: the smaller c, the more the equilibrium shifts towards the R state after one binding. With Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \alpha = \frac{[X]}{K_d^R}}
, fractional occupancy is described as:

The sigmoid Hill plot of allosteric proteins (shown in figure 5) can then be analysed as a progressive transition from the T state (low affinity) to the R state (high affinity) as the saturation increases (see figure 4). The Hill coefficient also depends on saturation, with a maximum value at the inflexion point. The intercepts between the two asymptotes and and the y-axis allow to determine the affinities of both states for the ligand.

Figure 5. Hill plot of the MWC binding function in red, of the pure T and R state in green. As the conformation shifts from T to R, so does the binding function. The intercepts with the x-axis provide the apparent dissociation constant as well sas the microscopic dissociation constants of R and T states.

In proteins, conformational change is often associated with activity, or activity towards specific targets. Such activity is often what is physiologically relevant or what is experimentally measured. The degree of conformational change is described by the state function Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \bar{R}}
, which denotes the fraction of protein present in the Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle R}
state. As the energy diagram illustrates, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \bar{R}}
increases as more ligand molecules bind. The expression for Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \bar{R}}
is:

A crucial aspect of the MWC model is that the curves for Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \bar{Y}}
and Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://api.formulasearchengine.com/v1/":): {\displaystyle \bar{R}}
do not coincide,[11] i.e. fractional saturation is not a direct indicator of conformational state (and hence, of activity). Moreoever, the extents of the cooperativity of binding and the cooperativity of activation can be very different: an extreme case is provide by the bacteria flagella motor with a Hill coefficient of 1.7 for the binding and 10.3 for the activation.[12][13] The supra-linearity of the response is sometimes called ultrasensitivity.

If an allosteric protein binds to a target that also has a higher affinity for the R state, then target binding further stabilises the R state, hence increasing ligand affinity. If, on the other hand, a target preferentially binds to the T state, then target binding will have a negative effect on ligand affinity. Such targets are called allosteric modulators.

Since its inception, the MWC framework has been extended and generalised. Variations have been proposed, for example to cater for proteins with more than two states,[14] proteins that bind to several types of ligands [15][16] or several types of allosteric modulators [16] and proteins with non-identical subunits or ligand-binding sites.[17]

Examples of cooperative binding

The list of molecular assemblies that exhibit cooperative binding of ligands is very large, but some examples are particularly notable for their historical interest, their unusual properties, or their physiological importance.

Figure 6. Cartoon representation of the protein hemoglobin in its two conformations: "tensed (T)" on the left corresponding to the deoxy form (derived from PDB id:11LFL) and "relaxed (R)" on the right corresponding to the oxy form (derived from PDB id:1LFT).

As described in the historical section, the most famous example of cooperative binding is hemoglobin. Its quaternary structure, solved by Max Perutz using X-ray diffraction,[18] exhibits a pseudo-symmetrical tetrahedron carrying four binding sites (hemes) for oxygen (see figure 6). Many other molecular assemblies exhibiting cooperative binding have been studied in great detail.

Multimeric enzymes

The activity of many enzymes is regulated by allosteric effectors. Some of these enzymes are multimeric and carry several binding sites for the regulators.

Threonine deaminase was one of the first enzymes suggested to behave like hemoglobin[19] and shown to bind ligands cooperatively.[20] It was later shown to be a tetrameric protein.[21]

Ion Channels

Most ion channels are formed of several identical or pseudo-identical monomers or domains, arranged symmetrically in biological membranes. Several classes of such channels whose opening is regulated by ligands exhibit cooperative binding of these ligands.

It was suggested as early as 1967[25] (when the exact nature of those channels was still unknown) that the nicotinic acetylcholine receptors bound acetylcholine in a cooperative manner due to the existence of several binding sites. The purification of the receptor[26] and its characterization demonstrated a pentameric structure with binding sites located at the interfaces between subunits, confirmed by the structure of the receptor binding domain.[27]

Multi-site molecules

Although most proteins showing cooperative binding are multimeric complexes of homologous subunits, some proteins carry several binding sites for the same ligand on the same polypeptide. One such example is calmodulin. One molecule of calmodulin binds four calcium ions cooperatively.[30] Its structure presents four EF-hand domains,[31] each one binding one calcium ion. Interestingly, the molecule does not display a square or tetrahedron structure, but is formed of two lobes, each carrying two EF-hand domains.

Figure 7. Cartoon representation of the protein Calmodulin in its two conformation: "closed" on the left (derived from PDB id: 1CFD) and "open" on the right (derived from PDB id: 3CLN). The open conformation is represented bound with 4 calcium ions (orange spheres).

Transcription factors

Cooperative binding of proteins onto nucleic acids has also been shown. A classical example is the binding of the lambda phage repressor to its operators, which occurs cooperatively.[32][33] Other examples of transcription factors exhibit positive cooperativity when binding their target, such as the repressor of the TtgABC pumps[34] (n=1.6).

Conversely, examples of negative cooperativity for the binding of transcription factors were also documented, as for the homodimeric repressor of the Pseudomonas putidacytochrome P450cam hydroxylase operon[35] (n=0.56).

Conformational spread and binding cooperativity

Early on, it has been argued that some proteins, especially those consisting of many subunits, could be regulated by a generalized MWC mechanism, in which the transition between R and T state is not necessarily synchronized across the entire protein.[36] In 1969, Wyman [37] proposed such a model with "mixed conformations" (i.e. some protomers in the R state, some in the T state) for respiratory proteins in invertebrates.

Following a similar idea, the conformational spread model by Duke and colleagues[38] subsumes both the KNF and the MWC model as special cases. In this model, a subunit does not automatically change conformation upon ligand binding (as in the KNF model), nor do all subunits in a complex change conformations together (as in the MWC model). Conformational changes are stochastic with the likelihood of a subunit switching states depending on whether or not it is ligand bound and on the conformational state of neighbouring subunits. Thus, conformational states can "spread" around the entire complex.

^ abKLOTZ IM (1946) The application of the law of mass action to binding by proteins; interactions with calciumArch Biochem 9:109-17 [PMID: 21009581]Cite error: Invalid <ref> tag; name "Klotz1946a" defined multiple times with different content