A problem is that whenever I make a change to the translator, most of the dictionary is changed and I existing texts will become impossible to translate back. It could be frozen at some point, of course. The generator of words is already a part of wesnoth 1.13 (used as the new name generator), it would just need to add some more determinism. Maybe if all changes were frozen afterwards, it could make sense.

It currently picks the translation so that the word is hashed, the hash is the seed sent to the generator to generate a word, if a duplicity happens, the seed is increased by a constant and retried. The problem is that any change to the word generator causes different words to appear with a different hash. Letter to group of letters would generate overly long words, I can't figure out any other way.

It would be totally doable for a campaign to contain blackspeech lines that are replaced by normal language if you have something. The author would just have to add some blackspeech lines along with normal language and have some logic pick which one will be displayed.

I decided not to use umlauts because it would look more German. I don't want to do this to Germans. They have done nothing to deserve it. Deutsch ist ein bisschen schöne Sprache! And if you go to Norway, you'll realise that German is a nice language. And Norwegian does not use umlauts. Turkish is also pretty hard on ears. Swedes are also very proud of their beautiful language. On the other hand, umlauts could help create enough syllables to make a different approach to translating words.
t->ruv
a->vorg
th->razg
at->bur
that->razgbur (razgvorgruv is longer, the shorter possibility is picked)
Common letter groups getting their syllables, making the language systematic and independent of seeds and dictionaries. This could solve a lot of problems if there was enough blackspeech syllables for most groups. Umlauts could help achieve it. Without it, this approach leads to horribly long words.
Interestingly, UTF-8 has also some unusual letters with umlauts, like ẅ, ẗ, ÿ, ḧ or ẍ, not sure who uses it except for some heavy metal bands (like Queensrÿche).

Dugi wrote:It would be totally doable for a campaign to contain blackspeech lines that are replaced by normal language if you have something. The author would just have to add some blackspeech lines along with normal language and have some logic pick which one will be displayed.

Yeah that would be cool!

Dugi wrote:
I decided not to use umlauts because it would look more German. I don't want to do this to Germans. They have done nothing to deserve it. Deutsch ist ein bisschen schöne Sprache! And if you go to Norway, you'll realise that German is a nice language. And Norwegian does not use umlauts. Turkish is also pretty hard on ears. Swedes are also very proud of their beautiful language.

yeah thats no problem i just thought to ask

Dugi wrote:
Interestingly, UTF-8 has also some unusual letters with umlauts, like ẅ, ẗ, ÿ, ḧ or ẍ, not sure who uses it except for some heavy metal bands (like Queensrÿche).

I will make an experiment to check how often would it be possible to replace digraphs or trigraphs can be replaced by blackspeech syllables. If enough of them is formed, the created words might be short enough to be more scary than funny. The üm̈läüẗs̈ could help it greatly. At the cost of rendering the pronunciation ambiguous.

This approach could make blackspeech work without the limiting need for a dictionary.

If you want it hard to decipher i would suggest starting from a completely different method of communication such as sign language, which is based on cascading/chaining concepts rather than serially transcribing speech or just transliterating english.

tr0ll wrote:If you want it hard to decipher i would suggest starting from a completely different method of communication such as sign language, which is based on cascading/chaining concepts rather than serially transcribing speech or just transliterating english.

_________
I am trying a novel method of translating to blackspeech, without the need for a massive dictionary of randomly generated words (where game and games are totally different words on the other end). I have made a statistical analysis of English language, finding most common letter groups and replacing them by blackspeech syllables systematically (making this a cypher rather than a language). To achieve this, I needed much more syllables, so I had to add some umlauts and a few other letters. This way, the generated text's length is about 174%158% of the same text in English, which is slightly worsebetter than the current blackspeech, which produces texts with 162% of the length of English text (the more length-inefficient languages need about 140% of the size of English text).

EDIT:
Okay, implemented. All previous messages in blackspeech are made untranslatable back unless I re-enable the old one, I am sorry.

On the other hand, this one has those demanded umlauts and other symbols related to nordic languages, slightly shorter words and does not need a heavy dictionary. It technically isn't a language this time, but a cypher. It works on any words, however it's coded to produce shorter words in English, not in any other language.