Humanist Discussion Group, Vol. 10, No. 884.
Center for Electronic Texts in the Humanities (Princeton/Rutgers)
Centre for Computing in the Humanities, King's College London
Information at http://www.princeton.edu/~mccarty/humanist/

I have been recieving an alarming amount of junk mail recently including
unsollicited subscriptions to e-mail newsletters that are really hardware
ads. Over the year it has gone from one such message a month to one every
other day. One note offered even to sell me millions of e-mail addresses
and the technology to use the list! I assume that, as it becomes easier to
buy such lists and spam the net, the problem will only get worse. Am I the
only one thus affected? What are you doing about it? Is it time to have a
"private" e-mail address or to learn how to use the filters in Eudora?

To help in the testing of a general purpose encoding conversion tool, I
am soliciting the contribution of snippits of any size of texts encoded
for the use of Tact or Micro OCP. I am especially interested in instances
of dense, complicated, or unusual encoding. I would also be interested in
snippits of Beta Code encoded texts which aren't owned by TLG or PHI
(which I could include as a sample in an eventual distribution).

My loftier goal was to design a general purpose tool for converting
languages that conform to a 'type 3' grammar (e.g. a regular language,
which can be described by a regular expression) into a language that
conforms to a 'type 2', or context-free, grammar, by adding hierarchical
information, in the form of tags, to the text. In practice, the tool
is at present only able to convert COCOA and Beta-Code encoded texts to
different flavors of SGML.

Please send pointers or attached texts to

gjmurphy@Princeton.EDU

Also, please indicate whether I may include the snippit in an eventual
distribution of the conversion tool, as a sample.