NOTE: I replaced the "name" attribute with "id" because "name" is old and tired. EDIT: The above stuff is all based on the assumption that the <b>, <span>, and font-family/size stuff is identical in all of the original endnote code instances. You'd need to make judicious use of (.*?) if not.
(and I had a mistake in the first edition of this post that I corrected)

With the old regex engine, I could use '\x20' to specify a space in the replacement pattern, but that no longer works in the current version.

Other than using a literal space, how do I specify a space character in the replace field? (I don't want to use a literal space, because I often save my s/r patterns in a development notes file, and they're hard to see in plain text.)

It's just a small question. To select letters intended to become dropcaps, I use this part of a Regex:
([A-Z])

However, I realize this does not select accented capitals that do exist in French (like ╔, └, ď and so on). Of course, I can just suppress their accents. But if I wish to make a drop-cap out of an accented capital, what would be the code?

It's just a small question. To select letters intended to become dropcaps, I use this part of a Regex:
([A-Z])

However, I realize this does not select accented capitals that do exist in French (like ╔, └, ď and so on). Of course, I can just suppress their accents. But if I wish to make a drop-cap out of an accented capital, what would be the code?

([.]) is a catch-all. Have you better?

([A-Z╔└ď])

the dash just means range. the normal is any one of these. You can use both as I have

However, I realize this does not select accented capitals that do exist in French (like ╔, └, ď and so on). Of course, I can just suppress their accents. But if I wish to make a drop-cap out of an accented capital, what would be the code?

([.]) is a catch-all. Have you better?

Code:

\p{Lu}

Will catch all upper-case letters (including unicode characters), if that's what you're looking for. Add parentheses to make it a capture group if desired, of course.

Thanks very much for your replies. As this regex is intended to be used for French texts, I will use theducks' proposal. I just did not know one could add letters this way as I did not see any example of it.

Thanks very much for your replies. As this regex is intended to be used for French texts, I will use theducks' proposal. I just did not know one could add letters this way as I did not see any example of it.

Just so you know, it doesn't matter what language it is. If it's a valid uppercase letter (including unicode characters with acute, grave, breve umlauts—any valid diacritic, really), (\p{Lu}) will capture it. But whatever you're comfortable with is the way to go.

I wish to clean an html text which suffers from recurrent mistakes from an OCR engine (Cuneiform).

When I meet one the mistakes, I make a replacement and I note it. After some pages, I met most of the mistakes and now I intend to build a regex, adding as many as 15 successive simple search and replace like the following two.
A@ → Ó
B@ → š
I do not know how to perform these 15 F&R within a simple regex.Suppose I would like to build it for the two above, what should I write?