Greetings, and thank you for your reply.
While this is nearly the same output
I received running the Perl script I posted.
The script merely indicated that Unicode::UCD couldn't properly map "\x99" (0099) | "&8482;" (in Decimal),
to a Unicode symbol/entity. In all likelyhood, it was because the document wasn't properly encoded(windows-1252-1|ISO-8859-1), instead of UTF-8|UTF8. I've examined enough of the documents
to know that they aren't "junk", but rather UTF-8 encoded files that weren't saved accordingly.
So, knowing that Perl is quite Unicode|UTF-8 savvy, I was hoping I could find
a way to let Perl discover it's current incorrect encoding -- say ISO-8859-1, and either
convert the embedded symbols to their Decimal equivalent, or, if it's safe, to save it as UTF-8.
In fact, after saving that same document as UTF-8, and running that script on it, caused the script to emit that error. Reading that same document with the embedded symbols/characters in it, while being
ISO-8859-1 with that script emitted: