Gaois research group

Parallel corpus

About the parallel corpus

This site consists of a search engine that allows the user to search an aligned parallel corpus of legislative texts. The corpus contains c.54.8 million words, c.28.1 million in Irish and c.26.7 million in English. Terms/phrases can be searched in English or in Irish and a list will be generated of each segment in the corpus in which this term or phrase is to be found. The user can then view this list of segments side by side with their equivalent segments in the other language.

The corpus was created as an internal resource for the editorial staff of the Gaois research group, Fiontar & Scoil na Gaeilge, to facilitate terminological work on the LEX project. Permission was received from the European Commission and from the Government of Ireland to make aligned segments from the following available to the public:

European Union Legislation (Regulations and Directives): Irish-English aligned material from European Union Regulations and Directives.

Other non-legislative European documents: Non-legislative documents from the European Union (preparatory acts and other non-legislative texts).

Constitution of Ireland: 1937 Constitution.

Acts of the Oireachtas: Primarily comprising of acts from the period 1922-2003 along with a selection of acts aligned in subsequent years.

Order of Business (Dáil Éireann/Seanad Éireann): Non-legislative documents from The Translation Section, Office of the Houses of the Oireachtas. Primarily comprising of press releases, titles, motions, resolutions, financial resolutions and statements.