According to Cambridge University Press, it's Birch--Swinnerton-Dyer. (Sorry, this should be a comment, but I'm meritless.)
–
MephistoOct 7 '10 at 1:06

4

This really depends on the language in which you're writing; I assume, you're referring to English here? (In German, e.g. the em dash --- usually isn't used at all, parenthetical comments are set in between en dashes --.
–
doncherryApr 16 '11 at 14:28

This is explained thoroughly on page 4 of the TeXbook, in case you want some more information on this topic.
–
PieterOct 6 '10 at 16:38

1

When referring to papers with multiple authors, say Atiyah and Bott, should it be Atiyah-Bott or Atiyah--Bott?
–
Kevin H. LinOct 6 '10 at 18:25

1

@Kevin: I don't generally use that constructions, but I'd guess a hyphen because you're using the authors as a compound identifier for the paper.
–
dmckeeOct 6 '10 at 18:30

9

@Kevin: There's a pretty strong preference for Atiyah--Bott, since Atiyah-Bott is how you would write the double-barelled surname. Then en dash suggests more distance.
–
Charles StewartOct 6 '10 at 19:19

2

@nuttyaboutnatty To indicate all the numbers from three to seven.
–
dmckeeApr 29 '13 at 23:31

The answers and comments I have seen so far are incomplete and not entirely accurate in some regards. Rather than hiding all the information in comments, here is a summary. Note that I'm listing only the cases that occur most frequently and that seem to raise the most questions. I am not saying that my answer is exhaustive. (For example, I'm omitting discussion of the various uses of dashes in dialogues or quoted speech because (1) they pop up rarely in the (La)TeX world as they are mostly relevant to copyeditors of fiction (who can be assumed to have learned the rules otherwise) and (2) their usage is rare and thus not necessarily governed by hard-and-fast, rigid rules.)

The em-dash has very few recommended use cases nowadays.

The most common generally accepted possible use case is before a name in a quotation that is attributed only to a person (that is: not quoted bibliographically), say, at the beginning of a paper or chapter.

In US typography, parenthetical phrases are traditionally set off with em-dashes without surrounding spaces. (Some sources recommend "hair spaces" around the em-dash for this practice, but other sources recommend against that, and I don't see this commonly done by newspapers and magazines in the US.) This practice is recommended against by some style guides these days (typographer Robert Bringhurst indirectly calls the em-dash "Victorian"), and I agree. While you could describe this as a question of visual taste, one definite problem with em-dashes without surrounding spaces is that tracking (aka letter-spacing aka "stretching" in some TeX lingo) doesn't play well with this: interword spacing will increase uniformly for words separated by spaces but will remain fixed for words separated by an em-dash without surrounding spaces. Of course that would not apply to em-dashes surrounded by spaces (the NYTimes style guide recommends this, for this reason), but keep in mind that this makes for really fat typographic separation.

(Note that usage of either em- or en-dashes for parenthetical purposes includes many cases where you visually see only one em/en-dash, simply because of the implicit orthographic rule that the beginning or end of a sentence "eats" one member of the "dash-dash parenthesis symbol pair".)

The en-dash has two frequent usages:

As a modern punctuation mark for parenthetical phrases. The en-dash in this usage is always surrounded by spaces. This also works well with kerning (see above). For this parenthetical usage, the em-dash is more common in US typography, while the en-dash is more common in UK typography.

As a hyphen-replacement, functioning as a semantic linking element in a compound word (most frequently: a compound noun) that binds together two elements more loosely than a normal-space-in-that-same-compound. This is very important: When I have a compound such as "pre-World War II", I really want it to appear as pre-–World␣War␣II (too bad Unicode en-dashes don't render correctly on this forum!) with ␣ standing in for a fixed (non-stretchable) space (this question is about how to produce such fixed-width spaces), because semantically the compound has the structure [pre [World War II]], because the "pre" modifies the entire compound word/expression "World War II", not just the word "World". That is: Don't look at simple rules telling you about hyphenated adjectives, prefixes, etc. for nouns (these are too complex); instead pay attention to the semantic structure, because this is the only thing that ultimately counts. We want a separator that separates orthographic words (= clusters of contiguous letters) a tad more than an ordinary space while still indicating that things belong together. If people were writing "pre-World War II" directly (with a hyphen, which is normally (not clearly on this forum though) shorter than or of equal length as an ordinary space), the immediate visual parsing experience would be to understand this structurally as [[pre World] War II], which is semantically incorrect. Note that linguistically the fact that English orthography doesn't (unlike German orthography) treat all linguistic words as orthographic words ("swimming pool" is one linguistic word but two English orthographic words) is what essentially necessitates this usage of an en-dash, which is entirely absent from German orthography. For example, we write things like "Konrad-Adenauer-Stiftung"; here ordinary hyphens represent all levels of inner-compound connections. Even though the division between "Konrad" and "Adenauer" is tighter than the one between "Adenauer" and "Stiftung" (the semantic structure here is [[Konrad Adenauer]-Stiftung]), there is at least no illusion that "Adenauer" binds closer to "Stiftung" than to "Konrad" (which there would in English, if you used a space for the first gap and only a hyphen instead of an en-dash for the second gap). Note, the point here is that the en-dash has this effect, because it must be visually slightly wider than an ordinary space.

Q: Are there constraints against line-breaking around dashes?

A:parenthetical dashes: I believe to have seen line breaks before and after parenthetical dashes in the past, but when I recently checked a bunch of US magazines (such as the New Yorker, the Atlantic, and Harper's), I saw only line-final dashes in any column. This came to me as a surprise, but either there is variation or I was previously wrong about this. I don't see any reason against line-initial parenthetical dashes, but if they don't occur or only rarely occur, let's hereby note this observation. If other people have further insight on this, please contribute.

A:en-dashes functioning as a hyphen-replacement: These of course cannot occur line-initially.

(Now you might ask: Why would anyone want to forbid line breaks on either or both sides of a dash? I can think of two possible reasons: (1) Perhaps someone would want to treat all dashes like hyphens, by visual analogy. (2) One could think of the two hyphens in a parenthetical pair as an "opening hyphen" and a "closing hyphen" and by analogy with other parenthesis types forbid the former line-finally and the latter line-initially. But such a distinction is never made in practice.)

Wikipedia claims en-dash usage for "relationships and connections", but Chicago does not recognize that; this "relationships and connections" usage is IMO newfangled and not generally accepted. But then, note as a disclaimer that I don't agree with all of Chicago. (Because many of its statements are prescriptivist recommendations, but I am trying to be descriptivist. Prescriptivism can be good, but many recommendations I've read - here and elsewhere - are neither descriptively accurate nor backed up by argument.)

Hyphens:

For whenever most people think hyphens are used. (I could attempt to make an exhaustive list, but I think this doesn't belong here.)

Some thoughts and observations about page ranges:

I've read in various places that an en-dash is prescribed for page ranges (e.g. pp.~100--200), but note that such usage is by no means universal even in the US. Subjectively speaking, I think it is used more in English than in German, though. With this in mind, German usage could be changing, and I also wouldn't be surprised if technical subjects' literature is influenced by (La)TeX conventions. (The (La)TeX community tends to recommend an en-dash for page ranges.)

While some sources recommend an en-dash for page ranges, many sources are silent on this matter. Because an ordinary person might not notice an en-dash (vs a hyphen) in page ranges, I think that de-facto far more people use a hyphen (though this admittedly doesn't prove that it's better to do so).

In Germany I can say that for sure an ordinary hyphen is predominantly used for page ranges (though this does admittedly leave open the question of what "ideal" typographic practice would be).

While DIN 5008 supposedly prescribes an en-dash ("Halbgeviertstrich") for ranges, there is no mention of such usage in either the "Grammatik" volume of Duden or the official rules for the reformed orthography. Descriptively speaking, en-dashes between digits are overwhelmingly not used.

I believe that DIN 1505-2 doesn't mention or require an en-dash around digits

The en-dash is not taught to be used for juxtaposition of words of equal category (and in English this is also definitely fringe usage), so it shouldn't be obviously so for digit groups either.

Note that Unicode provides a so-called figure-dash (U+2012), which is designed to be of the same width as a digit. According to some people, this is what everybody "should" be using, but I would like to remind everyone that ultimately usage depends on the information level and acceptance of the various target communities). While I don't know how widespread the figure-dash is (I am fairly sure it is not widespread as of right now), I'd be curious to learn about this community's stance on this.

The Chicago Manual of Style's (16th ed.) does recommend an en-dash for ranges (6.78) [I disagree with this aesthetically], but its recommendations are consistent with the view that en-dashes (or figure-dashes) do not belong in between numbers in other cases: "A hyphen is used to separate numbers that are not inclusive, such as telephone numbers, social security numbers, and ISBNs." (6.77)

Note that Wikipedia offers some "prescriptivist poppycock" (that expression is taken from the Language Log blog, which I highly recommend): "The figure dash is used when a dash must be used within numbers (e.g. phone number 555‒0199). It does not indicate a range, for which the en dash is used [...]." It is actually quite unclear where these prescriptions come from. I definitely can say that such usage is not widespread, and I do not see an obvious reason: the hyphen doesn't have the same semantic status as a digit, neither in a digit string (where it acts as a separator) nor in a range (of page numbers, units in physics, ...). Semantically the only intuitive rule is that the separator should be wider than the digit separation (that is, wider than zero) and that it should ideally be narrower than other surrounding orthographic divisions, to make clear that the digit group is a close unit (on the phrase structure level). That is, it would be odd to use an em-dash in a page range (since a sentence containing that page range would then have the largest visual separator token be just that em-dash), but by the same reasoning an en-dash is also not appropriate. So, a hyphen seems entirely fine, and I am somewhat surprised that the oft-heard (in the (La)TeX community) statement that ranges necessitate an en-dash isn't supported by either solid references or a good linguistic argument that talks about tokens, orthography, and phrase structure. As for non-range digit groupings: dots and colons can also function as separating elements in a sequence of digits, but noone would ever think about wanting to set them in a monospaced font with equal-to-digit width.

Being German I cannot confirm that “an ordinary hyphen is used for page ranges”. What I do know, though, is that more than once I read that a Halbgeviertstrich is recommended [1, 2, 3]. Supposedly it is also recommended by the DIN 5008 [4].
–
clemensJun 16 '12 at 19:57

1

Personally I like the en-dash a a range indicator. To me a hyphen connects two parts of something to a whole, like parts of a telephone number, whereas the en-dash somehow (to me) reflects the nature of the range: two ends (the numbers) and something in between that's left out. (I hope I made myself clear - English's not my native language...)
–
clemensJun 16 '12 at 20:06

@cgnieder Many thanks for your concrete references! I was judging from checking a few books (not TeX-related or technical ones, but other genres) and things looked like a hyphen, but it is very good to know about some more professional recommendations. (Though I'd still say that I don't know which type of hyphen is used most often in practice in professionally published books. I think descriptive surveys of actual use in typesetting aren't as common or well-known/publicized as I'd like them to be.)
–
Lover of StructureJun 18 '12 at 6:00

@cgnieder As for your second comment (about the en-dash for ranges): Having that preference for en-dashes is fine, though I am not offended by just using a normal hyphen for that simple purpose.
–
Lover of StructureJun 18 '12 at 6:05

Usage of the em dash, since that's what dmckee's explanation doesn't tackle: it's nicest to think of the em dash as a dramatic pause in a sentence, and it need not be used parenthetically. Chicago (6.87) calls it "the most commonly used and versatile of the dashes", and lists among its uses:

Amplifying or explaining, e.g., Chicago's example: "She secured the strategy --- a strategy that would, she hoped, secure the peace."

Separating subject from pronoun: "Tex --- that was the typesetting program they would use."

Used in place of other punctuation: this is the idea that it is a low-precedence subsitute for hyphens, commas, colons or parentheses. It also has other nice properties, such as that it can be used as a comma, together with an exclamation or question mark. Chicago's example: "Only if --- heaven forbid! --- you lose your passport should you call home."

To introduce speech, as per Brent's answer.

Don't overuse it! Em dashes stand out more than any other piece of normal punctuation and too many on a page can make the text look absurd. Use it only to add force to your writing, or to solve tricky to parse punctuation problems — and those are often best solved by finding a less complex way of making your point.

One typically does not use full spaces around em dashes. One thing that works well and apparently traces back to Knuth (although, I'm not sure exactly how) \def\dash---{\kern.16667em---\penalty\exhyphenpenalty\hskip.16667em\relax} However, I typically remove the --- after \dash and stick in an \unskip.
–
TH.Oct 7 '10 at 12:16

1

@TH.: Right. I like to space em dashes, where I have the freedom, because it cranks up their impact a notch higher. It would be ideal if one could separate this typographical issue from text composition, but that's a bit tricky in practice. Your favourite editor's search and replace function should be up to the job, instead.
–
Charles StewartOct 7 '10 at 12:39

One potential problem with spaced dashes is that you need to be sure you use a nonbreakable space before the dash (hence the \kern in the \dash macro). It looks pretty bad to have the dash appear on the next line.
–
TH.Oct 7 '10 at 12:51

@TH.: Hmmm. I don't think I agree: with a parenthetical dash, one wants it to behave rather like a parenthesis (i.e. break before opening, after closing). And it clearly matters far less than for other kinds of punctuation. I'm going to have a look at some examples.
–
Charles StewartOct 7 '10 at 20:38

I'm pretty sure that's wrong, but regardless, my "potential problem" remains in that you need to ensure a nonbreakable space somewhere. It's clear that Knuth disagrees with you about where the break should occur around an em dash. Try the following experiment: \vbox{\hsize1.5em \noindent\hfill 0---0}\bye Note that --- is 1em wide and 0 is .5em wide in cmr10 [TeXbook page 60]. Thus the dash and exactly one of the 0s fits in the box. Note that you can add \penalty9999 before the dash and TeX will break before the dash.
–
TH.Oct 8 '10 at 4:05

As much as I love bringhurst, this endash/emdash prescriptivism is going too far, I think. Whether to use an en or em dash in sentences is as much dictated by the style of the text and the design of the font.
–
Will RobertsonOct 7 '10 at 10:30

1

@Will: Of course you're right; but some of us (like me) don't have as much artistic instinct as others, so it's nice to have a "base role model".
–
Brent.LongboroughOct 18 '10 at 8:27