Source name:
Description:
A parallel corpus with 5 million French-Dutch sentence pairs in the legislative domain, produced by downloading official documents from the online version of the Belgisch Staatsblad/Moniteur Belge (publication of the Belgian authorities), and automatically aligning the sentences in documents available in two languages. The corpus contains 2.4 million unique sentence pairs. It covers documents which appeared between 1997 and 2006.
Language(s):
Resource link:
Name of contact person:
Resource type:
corpus
Resource availability:
available for research purposes
free
Can the resource be directly downloaded?:
No
Modality:
text
Size:
2.4 million unique sentence pairs
Production date:
2010
Format explanation:
Original documents in HTML. Pairs of files with French and Dutch sentences (equivalent sentences have the same line number). Sentence pairs in TMX format (Translation Memory eXchange)