Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.12188/24292
Title: | Resources for Machine Translation of the Macedonian Language | Authors: | Stolić, Milosh Zdravkova, Katerina |
Keywords: | Natural Language Processing, Computational Linguistics, Bilingual Machine Translation, Statistical analysis, Language Resources | Issue Date: | 2009 | Conference: | ICT Innovations 2009 | Abstract: | This paper focuses on creating new linguistic resources for the Macedonian language. It presents a new parallel corpus between Macedonian and Serbian language, build around the digitalized version of George Orwell's "1984", developed during the MULTEXT-EAST project. The original corpus is expanded with news articles from the Southeast European Times newspaper, published in public domain. The paper describes the retrieval, conversion, preprocessing, filtering and sentence-alignment of the corpus, then discusses and evaluates the alignment results. | URI: | http://hdl.handle.net/20.500.12188/24292 |
Appears in Collections: | Faculty of Computer Science and Engineering: Conference papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Resources_for_Machine_Translation_of_the_Macedonia.pdf | 117.94 kB | Adobe PDF | View/Open |
Page view(s)
33
checked on May 29, 2024
Download(s)
25
checked on May 29, 2024
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.