Research Resources

The Survey of English Usage carries out research
in English language Corpus Linguistics. We construct corpora, develop
tools and methodologies, and carry out original research into the
English language itself. Our recent and current research is summarised
on our Research Projects pages.
This section of our website is concerned with the dissemination
of reference material and other products of our research.

Parsed Corpora

The Research Projects part of the site summarises two major parsed
corpora of English.

ICE-GB is the British
Component of the International Corpus of English, containing samples
of written and spoken contemporary (early 1990s) English. ICE-GB
is now available in a Release 2 version with updated software and
optionally, aligned digital audio.

DCPSE is the new parsed
corpus of spoken English, containing samples from the late 1960s
to early 1990s.

Grammatical Query Methodologies

The FTF pages describe Fuzzy Tree
Fragments in some detail as well as explain how they can be used
in carrying out experiments in
grammar using ICECUP. Supporting linguistic experimentation in software,
using FTFs, is the subject of a research
project.

Sean Wallis has also published a number of articles on statistical
methods for corpus linguists on his blog, corp.ling.stats.