3110https://www.ibm.com/developerworks/community/forums/atom/replies?topicUuid=77777777-0000-0000-0000-000014958936Nouns are not correctly recognized in Dutch Replies2013-03-26T09:59:35.495ZIBM Connections - Discussion Forumurn:lsid:ibm.com:forum:77777777-0000-0000-0000-000014961133Re: Nouns are not correctly recognized in Dutch2013-03-26T09:59:35.495ZSystemAdmin110000D4XKactive2013-03-26T09:59:35.495Z
If you want to see compound words as single tokens in the rules editor to write complex rules on top of them, I would suggest you create a rule to annotate any sequence of nouns with the feature "isConnectedToPrevious" as a "CompoundNoun" and use this new type in subsequent rules. So it is a bit like creating a shallow parsing grammar for your model.<br />
<br />
If you really want to turn off decomposition, then it is an advanced usage and you need to contact IBM via the support channel from which you bought the ICA license. <br />
<br />
It is possible to turn off decomposition, but please be aware that there are side effects, mainly on the Part of speech tagging precision which may degrade.
none, view_forum, view_categoryurn:lsid:ibm.com:forum:77777777-0000-0000-0000-000014960729Re: Nouns are not correctly recognized in Dutch2013-03-25T15:00:43.579ZSystemAdmin110000D4XKactive2013-03-25T15:00:43.579Z
Is there a way to switch off the "decomposition paradigm"? Working with a custom dictionary will be a hell of a job, because we will have a lot of "compound" words.
none, view_forum, view_categoryurn:lsid:ibm.com:forum:77777777-0000-0000-0000-000014959097Re: Nouns are not correctly recognized in Dutch2013-03-20T14:54:11.326ZSystemAdmin110000D4XKactive2013-03-20T14:54:11.326Z
This is due to the decomposition paradigm which decomposes "oogsegment" into oog and segment. <br />
You can add these type of compound words into a custom dictionary and use them in your model if you need to.
none, view_forum, view_category