note
pdenisowski
<P><PRE>It's still missing a correct 'secondary sort' <b>(for the edge case when the diacritic-stripped words are identical);</b></PRE><P>
(laughs) That's hardly an "edge case" in Vietnamese - there are <b>thousands</b> of minimal pairs where the only difference between the words is the diacritical marks. While it's possible to read and understand Vietnamese typed in (7-bit) ASCII without too much ambiguity (i.e. you can figure out what word is meant from the context), this obviously wouldn't work for a dictionary.
<P>
The other issue is that the words in the dictionary need to be sorted in the "correct" order for me to detect duplicates, etc.
<P>
I'll try out your suggestion later today - thanks again!
1068103
1068152