This is a list of my academic publications.


Hanneforth, T., Kapanadze, O. and Kotzé, G. Accepted. Applying Computer Technologies to the Georgian Language: From a Treebank to a Syntactic Parser [Abstract]. To be presented at The thirteenth International Tbilisi Symposium on Language, Logic and Computation, Batumi, Georgia, 16-20 September 2019.


Kotzé, G., Vandeghinste, V., Martens, S. and Tiedemann, J. 2017. Large aligned treebanks for syntax-based machine translation. Language Resources and Evaluation. Vol. 51(2). Springer. doi:10.1007/s10579-016-9369-0. URL: [Abstract] [BibTeX]

Kotzé, G. and Wolff, F. 2017. Developing and evaluating a pipeline for Setswana OCR. Proceedings of the 2017 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech), pp. 236-241. IEEE. URL: [Abstract] [BibTeX]


Kotzé, G. 2016 Refining semi-automatic parallel corpus creation for Zulu to English statistical machine translation. In: Proceedings of the 2016 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech), pp. 48-53. IEEE. [Abstract] [BibTeX]


Kotzé, G. and Wolff, F. 2015. Syllabification and parameter optimisation in Zulu to English machine translation. South African Computer Journal. No. 57, pp. 1-23. South African Institute of Computer Scientists and Information Technologists (SAICSIT). [Abstract] [BibTeX]


Wolff, F. and Kotzé, G. 2014. Experiments with syllable-based Zulu-English machine translation. In Puttkammer, M. and Eiselen, R. (eds.): Proceedings of the 2014 PRASA, RobMech and AfLaT International Joint Symposium. pp. 217-222. Cape Town, South Africa. [pdf] [Abstract] [BibTeX]

Kotzé, G. and Wolff, F. 2014. Experiments with syllable-based English-Zulu alignment. In Proceedings of the SaLTMiL Workshop on free/open-source language resources for the machine translation of less-resourced languages (at LREC 2014), pp. 7-11, Reykjavík, Iceland. [pdf] [Abstract] [BibTeX]


Kotzé, G. 2013. Complementary Approaches to Tree Alignment: Combining Statistical and Rule-Based Methods. PhD thesis. University of Groningen. [pdf] [Abstract] [BibTeX]

Vandeghinste, V., Martens, S., Kotzé, G., Tiedemann, J., Van den Bogaert, J., De Smet, K., Van Eynde, F., and Van Noord, G. 2013. Parse and Corpus-based Machine Translation. In Peter Spyns and Jan Odijk (eds.): Essential Speech and Language Technology for Dutch. pp. 305-319. Springer. [Abstract] [BibTeX]


Kotzé, G. 2012. Transformation-based tree-to-tree alignment. In Computational Linguistics in the Netherlands Journal. Vol. 2, pp. 71-96. [pdf] [Abstract] [BibTeX]

Kotzé, G., Vandeghinste, V., Martens, S. and Tiedemann, J. 2012. Large aligned treebanks for syntax-based machine translation. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pp. 467-473, Istanbul, Turkey. [pdf] [Abstract] [BibTeX]


Kotzé, G. 2011. Finding Statistically Motivated Features Influencing Subtree Alignment Performance. In Bolette Sandford Pedersen, Gunta Nešpore and Inguna Skadiņa (eds.): Proceedings of the 18th Nordic Conference of Computational Linguistics NODALIDA 2011, May 11-13, Riga, Latvia. NEALT Proceedings Series, Vol. 11, pp. 332-335. Tartu: Tartu University Library. [pdf] [Abstract] [BibTeX]

Kotzé, G. 2011. Improving syntactic tree alignment through rule-based error correction. In Proceedings of ESSLLI 2011 Student Session, pp. 122-127, Ljubljana, Slovenia. [pdf] [Abstract] [BibTeX]

Kotzé, G. 2011. Rule-induced error correction of aligned parallel treebanks. In Proceedings of the International Conference "Corpus Linguistics - 2011", pp. 35-40, Saint Petersburg, Russia. [pdf] [Abstract] [BibTeX]

Vandeghinste, V., Van den Bogaert, J., Martens, S., and Kotzé, G. 2011. PaCo-MT: Parse and Corpus-based Machine Translation. In Forcada, M.L., Depraetere, H., and Vandeghinste, V. (eds.): Proceedings of the 15th International Conference of the European Association for Machine Translation, p. 347. Leuven, Belgium. [pdf] [Abstract] [BibTeX]


Tiedemann, J. and Kotzé, G. 2009. A Discriminative Approach to Tree Alignment. In Ilisei, I., Pekar, V. and Bernardini, S. (eds.): Proceedings of the International Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography and Language Learning (in connection with RANLP'09), pp. 33-39. Borovets, Bulgaria. [pdf] [Abstract] [BibTeX]

Tiedemann, J. and Kotzé, G. 2009. Building a Large Machine-Aligned Parallel Treebank. In Passarotti, M. and Przepiórkowski, A. and Raynaud, S. and Van Eynde, F. (eds.): Proceedings of the 8th International Workshop on Treebanks and Linguistic Theories (TLT'08), pp. 197-208. Milan, Italy. [pdf] [Abstract] [BibTeX]


Kotzé, G. 2008. Development of an Afrikaans wordnet: methodology and integration / Ontwikkeling van 'n Afrikaanse woordnet : metodologie en integrasie. Literator: Journal of Literary Criticism, Comparative Linguistics and Literary Studies : Human language technology for South African languages: Special Issue 1, Volume 29(1). pp. 163-184. [pdf] [Abstract] [BibTeX]


Kotzé, G. 2006. Building a WordNet for Afrikaans: Preliminary research in the form of an inquiry into the feasibility and optimal methodology for the development of a wordnet database. Master's thesis. Free University of Amsterdam. [pdf] [Abstract] [BibTeX]