1. There is still an error with:{{懶洋洋}} [01 lan3 yang2 yang2] malvigla; langvora

Before it gave this error:{{懶洋洋}} 1-0-B [extra {{}} ]

Now it gives this error: 1-0-B [the 懶洋洋 characters are missing]

The 懶洋洋 is in {{ and }}, so I think it should not get indexed ... or am I wrong ?The empty index before 1-0-B might be your BOM-character, I did not yet fully investigate that yet though. (the BOM-character would not harm in the index).

Quote

2. If there are 2 words in the left column, then only the 2nd words are getting indexed:馬虎 / 马虎 {{[01 mahu]}} malzorga

Both "馬虎" and "马虎" should be indexed. But, only "马虎" is indexed. Here is the index:馬虎 / 马虎 1-152-B马虎 1-152-S

I thought if there is a space between words, then they will both be indexed. This is probably a different problem then the {{}} problem.I added this line to DictionaryForMIDs.properties, but it didn't fix the problem:dictionaryGenerationLanguage1ExpressionSplitString: /

馬虎 / 马虎 1-152-B马虎 1-152-Slooks ok for me on a first sight: the "馬虎" is part of the index, well still combined into "馬虎 / 马虎" (DictionaryGeneration thinks this is a 'phrase' unless you set the dictionaryGenerationLanguage1ExpressionSplitString).

And you are right, you really need to put a dictionaryGenerationLanguage1ExpressionSplitString here !

With the dictionaryGenerationLanguage1ExpressionSplitString: /the index should look like 馬虎 1-152-B马虎 1-152-S

The 懶洋洋 is in {{ and }}, so I think it should not get indexed ... or am I wrong ?The empty index before 1-0-B might be your BOM-character, I did not yet fully investigate that yet though. (the BOM-character would not harm in the index).

You're right, since the word is in {{}} it won't be indexed. So the extra " 1-0-B" won't cause any problems in the index.

I changed the first line to this (I removed the {{}} ):懶洋洋 [01 lan3 yang2 yang2] malvigla; langvora

Now 懶洋洋 is not in the index. So I think the BOM must be causing a problem.

Quote

With the dictionaryGenerationLanguage1ExpressionSplitString: /the index should look like 馬虎 1-152-B马虎 1-152-S

Yes, the index should look like that. I added the SplitString, but the index still looks like this:馬虎 / 马虎 1-90-B马虎 1-90-S

I will occasionally exclude the BOM-character from being put in the index (I will do it for all dictionaries, not only for Chinese one). But there is no reason to wait for this, cause the indexing algorithm will work fine also with the BOM-character in it.

Quote

Yes, the index should look like that. I added the SplitString, but the index still looks like this:馬虎 / 马虎 1-90-B马虎 1-90-S

Will look at that also occasionally. Will likely take a few weeks though, sorry for that.