Notes

Earlier versions

Earlier version of the Sequoia corpus (before deep syntax annotation) are described here and are available for download here.

Version numbers

In the 2015 release, the version number was set to 7.0 to be align with previous release of the Sequoia corpus (before introduction of deep-dependencies).

UD Versions

Since 2017, the Sequoia corpus (surface only) is also available in Universal Dependency format and is released as one of the UD corpora named
UD_French-Sequoia.

Encoding of fixed expressions

In version 7.0 and previous, fixed expressions are encoded in a single token with _ symbol as a word separator.
Since version 8.0 these expressions are represented by several tokens linked with dep_cpd relations.
For instance, in the two figures below, the sentence annodis.er_00106 is given with its annotation in Sequoia 7.0 and 8.0.