Recent Achievements

Jianguo, Kirk and Chris have a paper at AACL (American
Association for Corpus Linguistics, 2008)

Chris is the local arrangements chair for ACL 2008, in Columbus.

Anton has filed his thesis, graduated, and has a postdoctoral
research position at CASL

Anna has filed her thesis and moved to a tenure-track job at
Montclair State University.

Chris was the chair for NLP of HLT-EMNLP 2005, which is one of
the largest and most prestigious conferences in language technology,
with about 400 participants and 130 tightly-refereed research
presentations.

Jihyun had a paper on her psycholinguistic work accepted for
both the main session and the student research workshop of COLING/ACL
2006

Jianguo had a paper on lexical acquisition from sparse data
accepted for both the main session and the student research workshop of
COLING/ACL 2006

Papers in proceedings

Peer reviewed

Brew, Chris 1992. Letting the Cat out of the Bag: generation
for shake-and-bake MT. In Proceedings of the 14th International
Conference on Computational Linguistics, pages 610–616, Nantes, France.

Brew, Chris 1995. Stochastic HPSG. In Proceedings of the 7th
Conference of the European Chapter of the Association for Computational
Linguistics, pages 83–89, Dublin, Ireland, March 28–31. University
College.

Brew, Chris and David McKelvie. 1996. Word-pair extraction
for lexicography. In Kemal Oflazer and Harold Somers, editors,
Proceedings of the Second International Conference on New Methods in
Language Processing, pages 45–55, Ankara, September. Bilkent University.

Brew. Chris and Markus Dickinson and W. Detmar Meurers 2005
“Language and Computers”: Creating an Introduction for a General
Undergraduate Audience Proceedings of the Workshop on “Effective Tools
and Methodologies for Teaching Natural Language Processing And
Computational Linguistics” 43rd Annual Meeting of the Association for
Computational Linguistics (ACL-05) Ann Arbor, MI, USA

Feldman, Anna, Jiri Hana, and Chris Brew 2006 A
cross-language approach to rapid creation of new morpho-syntactically
annotatedresources. In Proceedings of the fifth International
conference on Language Resources and Evaluation (LREC 2006) . Genoa,
Italy

Feldman, Anna, Jiri Hana, and Chris Brew. 2005 Buy one, get
one free or what to do when your linguistic resources are limited”. To
appear in Proceedings of the third international seminar on Computer
Treatment of Slavic and East-European Languages (Slovko 2005).
Bratislava, Slovakia.

Jiri Hana, Anna Feldman, Luiz Amaral, and Chris Brew. 2006
Tagging Portuguese with a Spanish Tagger Using Cognates. In Proceedings
of the Workshop on Cross-language Knowledge Induction hosted in
conjunction with the 11th Conference of the European Chapter of the
Association for Computational Linguistics (EACL-2006). Trento, Italy.

McDonald, Scott and Chris Brew. 2001. A rational analysis of
semantic processing by the left cerebral hemisphere. In First Workshop
on Cognitively Plausible Models of Semantic Processing, (5 pages),
Edinburgh, July.

Wong, Wai Yi Peggy, Chris Brew, Shui-Duen Chan, and
Mary Beckman. 2002. Using the Segmentation Corpus to define an
inventory of concatenative units for Cantonese speech synthesis”. In
Proceedings of the First SIGHAN Workshop on Chinese Language Processing
in conjunction with the 19th International Conference on Computational
Linguistics 2002. (Papers presented at the First SIGHAN Workshop on
Chinese Language Processing), pages 119–123. Taipei, Taiwan.

Bard, E. G., Cooper, L., Kowtko, J. C., and Brew,
C. (1991). Psycholinguistic studies on the incremental recognition of
speech: A revised and extended introduction to the messy and the
sticky. DYANA Report R1.3.B, University of Edinburgh.

Broad coverage dictionaries and
ontologies for natural language processing (NLP) are difficult and
costly to create and maintain by hand. It is therefore desirable to
learn them from distributional information, such as can be obtained
from unlabeled or sparsely labeled text corpora. Many linguistic and
psycholinguistic theories are distributional, but emphasize local
neighborhood structure more than do previous NLP approaches.
Successful visualization techniques such as keyword-in-context also
rely on the preservation of neighborhood structure. A similar
emphasis is present in emerging techniques for data reduction, such
as LLE and min-cut algorithms. While the immediate goal of the
project is to gain a better understanding of lexical tuning and
acquisition, the resulting dictionaries, ontologies and mapping
techniques have the potential to help information professionals (such
as librarians, translators, patent examiners and paralegal
researchers) to navigate through corpora, to understand the
significance of the data that they see, and to incorporate insights
derived from the data into their working practice.