| FIRST USE: Lexicon based match: we chose a very simple metric: matching between words in T and H based on a path of distance at most 2 in the WordNet graph, using any links (hyponymy, hypernymy, meronymy, pertainymy, etc.)<br/>

| FIRST USE: Antonymy relation to detect contradiction. In order to broaden the domain of the antonymy relation, we consider a combination of synonyms and antonyms. Used in combination with VerbOcean.<br/>

+

SECOND USE: Synonymy, hyponymy and hypernymy for nouns and adjectives. Used in combination with eXtended WordNet relations.

| When using WordNet, we assume that a term is semantically interchangeable with its exact occurrence, its synonyms, and its hypernyms. In extracting hypernyms, we exclude the hypernyms that are more distant than two links to the original terms in WordNet synsets.

+

| Two ablation tests performed. The first for Wordnet alone, the second for both WordNet and Framenet. Null impact of the resource(s) on two-way task for both ablations.

| No precise evaluation of the resource has been carried out. In our second run we used a combined system (EDITSneg + EDITSallbutneg), and we had an improvement of 0.6% in accuracy with respect to the first run in which only EDITSneg was used. EDITSallbutneg exploits lexical similarity (WordNet similarity), but we can’t affirm with precision that the improvement is due only to the use of WordNet

+

| No separate evaluation

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

−

| UIUC

+

| DLSIUAES

−

| RTE3

−

|

−

| Semantic distance between words

−

|

−

|- bgcolor="#ECECEC" align="left"

−

| AUEB

| RTE4

| RTE4

|

|

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

−

| DLSIUAES

+

| EMORY

| RTE4

| RTE4

|

|

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

−

| EMORY

+

| FbkIrst

| RTE4

| RTE4

−

|

+

| 3.0

−

|

+

| Lexical similarity

−

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

| No precise evaluation of the resource has been carried out. In our second run we used a combined system (EDITSneg + EDITSallbutneg), and we had an improvement of 0.6% in accuracy with respect to the first run in which only EDITSneg was used. EDITSallbutneg exploits lexical similarity (WordNet similarity), but we can’t affirm with precision that the improvement is due only to the use of WordNet

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| FSC

| FSC

Line 86:

Line 233:

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| IIT

| IIT

Line 92:

Line 240:

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| IPD

| IPD

Line 98:

Line 247:

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| OAQA

| OAQA

Line 104:

Line 254:

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| QUANTA

| QUANTA

Line 110:

Line 261:

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| SAGAN

| SAGAN

Line 116:

Line 268:

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| Stanford

| Stanford

Line 122:

Line 275:

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| UAIC

| UAIC

Line 127:

Line 281:

|

|

|

|

−

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

| Ablation test performed: +3% precision on two-way task.

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| UMD

| UMD

Line 134:

Line 289:

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| UNED

| UNED

Line 140:

Line 296:

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| Uoeltg

| Uoeltg

Line 146:

Line 303:

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

|- bgcolor="#ECECEC" align="left"

|- bgcolor="#ECECEC" align="left"

| UPC

| UPC

Line 152:

Line 310:

|

|

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

| ''Data taken from the RTE4 proceedings. Participants are recommended to add further information.''

+

+

|- bgcolor="#ECECEC" align="left"

+

| AUEB

+

| RTE3

+

| 2.1

+

| Synonymy resolution

+

| Replacing the words of H with their synonyms in T: on RTE3 data sets 2% improvement

FIRST USE. Impact of the resource on two-way task: -0.5%/+1% accuracy respectively on run1 and run2.
SECOND USE. Impact of the resource on two-way task: +1.33%/-0.33% accuracy respectively on run1 and run2.

QUANTA

RTE5

Several relations from wordnet, such as synonyms, hyponym, hypernym et al.

FIRST USE: Lexicon based match: we chose a very simple metric: matching between words in T and H based on a path of distance at most 2 in the WordNet graph, using any links (hyponymy, hypernymy, meronymy, pertainymy, etc.)

When using WordNet, we assume that a term is semantically interchangeable with its exact occurrence, its synonyms, and its hypernyms. In extracting hypernyms, we exclude the hypernyms that are more distant than two links to the original terms in WordNet synsets.

Two ablation tests performed. The first for Wordnet alone, the second for both WordNet and Framenet. Null impact of the resource(s) on two-way task for both ablations.

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

EMORY

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

FbkIrst

RTE4

3.0

Lexical similarity

No precise evaluation of the resource has been carried out. In our second run we used a combined system (EDITSneg + EDITSallbutneg), and we had an improvement of 0.6% in accuracy with respect to the first run in which only EDITSneg was used. EDITSallbutneg exploits lexical similarity (WordNet similarity), but we can’t affirm with precision that the improvement is due only to the use of WordNet

FSC

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

IIT

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

IPD

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

OAQA

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

QUANTA

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

SAGAN

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

Stanford

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

UAIC

RTE4

Ablation test performed: +3% precision on two-way task.

UMD

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

UNED

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

Uoeltg

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

UPC

RTE4

Data taken from the RTE4 proceedings. Participants are recommended to add further information.

AUEB

RTE3

2.1

Synonymy resolution

Replacing the words of H with their synonyms in T: on RTE3 data sets 2% improvement