Source name: 
Author: 
University of Zagreb - Natural Language Processing Group
Description: 
The parallel document pair candidates were automatically extracted from the hrWaC corpus, a web corpus collected from the .hr top-level domain.
Resource type: 
corpus
Resource availability: 
available for commercial use
available for research purposes
free
Can the resource be directly downloaded?: 
Yes
Modality: 
text
Format: 
Size: 
2 languages, total number of files: 2 total number of tokens: 4.96M total number of sentence fragments: 0.20M
Production date: 
2009