tadej: is currently updating its-tools, looking at use of non-its annotations

16:50:56 [tadej]

daveL: right now we have a mechanism to identifiy to which data category it applies to, allowing for user-defined names

16:51:10 [tadej]

s/identifiy/identify/

16:51:59 [tadej]

daveL: ... since you're borrowing the mechanism anyway, you're out of conformance anyway

16:52:13 [tadej]

daveL: we could remove it, since we don't have a formal extension mechanism

16:54:11 [Marcis]

I hear you, I just cannot say anything

16:54:14 [tadej]

tadej: if we define a per-datacategory confidence attribute, how to express multi-valued attributes?

16:54:54 [Marcis]

I mean, if the domains are automatically identified, then you will have a confidence (if the systems will return probabilistic results)

16:55:22 [Marcis]

As tadej said - the weighted mechanism says that there is a confidence

16:56:16 [tadej]

tadej: It boils down to whether that number is useful for the consumer

16:57:38 [Marcis]

The categories (not in exact names...) that I see requiring the confidence are: MT, Terminology, Domain segmentation tools (are there any currently used by the MT use cases?), Named Entity Recognition (currently in Disambiguation, right?), others (?)

16:58:03 [tadej]

action: daveL to ask for use cases of data category-specific confidence scores

16:58:03 [trackbot]

Created ACTION-281 - Ask for use cases of data category-specific confidence scores [on David Lewis - due 2012-11-12].

16:59:06 [Ankit]

w.r.t. confidence scores in MT, they are are mainly used in a post-editing environment, i.e. when a human translator uses these scores to determine which outputs of a MT system they want to correct..

17:00:35 [tadej]

tadej: disambiguation can produce scores, but not commonly used

17:02:33 [tadej]

daveL: its:tools has its own element, the its:standOffList - we should describe it how it works within a script element, so it's as similar as possible to the XML markup.