Psychology & Neuroscience Stack Exchange is a question and answer site for practitioners, researchers, and students in cognitive science, psychology, neuroscience, and psychiatry. It only takes a minute to sign up.

$$R = e^{-\large \frac{t}{s}}$$ where $R$ is retention, $t$ is time and $s$ is strength can be found on Wikipedia. And $$b = \frac{100k}{c\log(t) +k}$$ where $b$ is retention, $t$ is time, $k = 1.48$ and $c = 1.25$, can be found in Ebbinghaus's study chapter 7. I've also seen it here and here also.

My first question is which forgetting curve is correct? And if both or the latter is, where did the constants $k$ or $c$ come from? What do they represent?

2 Answers
2

The question which of these two descriptions is correct? is perhaps natural in the context of, say, someone studying for an examination. Epistemologists might suggest that a better formulation would be is either of these correct?

However, as stated here there are clear reasons for preferring the first formulation to the second. I shall first explain why, and then attempt to reconstruct a more useful form for the second approach.

By substituting the value $t=0$, a time value (however the time variable is scaled) at which we would expect retention to be a maximum, we obtain:
$$ R_0 = 1 \\
b_0 = 0
$$
Another peculiarity of the second equation is that if $T=0.4297...$ we have:
$$
b_T = \pm \infty
$$
The latter is shorthand for the fact that as $t$ approaches $T$ from below then $b_t \rightarrow -\infty$, whilst $b$ then flips to $+\infty$ as soon as $t \gt T$ and thereafter decreases.

The factor of $100$ in the numerator of the fraction in equation $2$ suggests this was intended to be adapted to percentage correct, and should be dropped to make the two formulations more easily comparable.

The least changed version of eq. $2$ which makes some kind of sense would be:
$$ R_t = \frac1{1 + a \; ln(1+ \lambda t)}$$

This yields $R_0 = 1$, whilst $R_t$ is now a convex, strictly decreasing function of $t$. Surely both of these features are desiderata for any expression for retention.

Note that now (what we may term) model $1$ ($M1$) is a 1-parameter model, whereas $M2$ has two parameters, which might be preferred for an empirical data-fitting exercise. Both models have a linear approximation for small values of the time variable, namely:

$$ (M1) \; \; \; R_t = 1 -\frac{t}{s} $$

$$(M2) \; \; \; R_t = 1 - a \lambda t $$

So that we require $ \frac1s = a \lambda$ for compatibility. This suggests that a more useful form of $M2$, retaining the strength factor from the first model, is:
$$ R_t = \frac1{1 + \frac1{\lambda s} ln(1 + \lambda t)}$$

Where now $\lambda$ is clearly seen to be a scaling factor which must be applied to both variables, time and strength. This is perhaps a slight conceptual clarification, since the intervening variable strength is shown to be dimensionally equivalent to time. A strong memory is one that lasts for years, whereas, with the possible exception of the type of mnemonist studied by Professor Luria, few of Ebbinghaus' experimental subjects would retain any memory of his deliberately unmemorable nonsense syllables after the passage of more than a few weeks.

This observation also has a bearing on experimental methodology. Some nonsense syllables might be more memorable for a given subject - for example if, by chance, the syllable happened to match the subject's own initials etc. Such effects should, however, merely contribute a small addition of random noise to the experimental data.