Author: 
Machine Translation team of the European Parliament's Directorate-General for Translation (DGTRAD)
Description: 
The Digital Corpus of the European Parliament (DCEP) contains the majority of the documents published on the European Parliament's official website. It comprises a variety of document types, from press releases to session and legislative documents related to European Parliament's activities and bodies.
Resource type: 
corpus
Resource availability: 
available for commercial use
available for research purposes
free
Can the resource be directly downloaded?: 
Yes
Modality: 
text
Production date: 
2001-2012
Format explanation: 
DCEP is available as full-text documents and as sentence-aligned data. DCEP includes alignment information for the full documents, as well as for sentences, produced separately for each language pair. DCEP is accompanied by tools that allow to produce sentence-aligned corpora separately for each of the 276 language pairs. The sentence-aligned data is in plain text format, i.e. XML/TMX output is not supported.