Ondřej Svoboda — Curriculum Vitae (česky)
computational linguist, programmer, hacker, creator
Personal information
- Born
- 1989
- Nationality
- Czech
- my first name; at this domain
- Driver’s licence
- cars up to 3.5 tonnes
Education
- Alma mater
- Masaryk University, Brno, Czech Republic
- personal page
- Field of study
- Czech Language with Orientation on Computational Linguistics
- at the Faculty of Arts (in cooperation with the Faculty of Informatics)
- Master’s degree
- 09/2014 — 02/2017
- thesis: Corpus tagging accuracy as affected by changes to its rich tagset (in English)
- Erasmus+
- 09/2015 — 02/2016
- Department of Russian & Slavonic Studies, University of Sheffield, United Kingdom
- Bachelor’s degree
- 09/2011 — 06/2014
- thesis: Automatic Recognition of Czech Word-order Positions and Elements of Functional Sentence Perspective (in Czech)
- papers: An Experiment with Theme–Rheme Identification, Semi-automatic Theme-Rheme Identification, both with Karel Pala
- Other studies attempted
- 09/2009 — 06/2011 (bachelor’s, not completed)
- Artificial Intelligence and Natural Language Processing at the Faculty of Informatics
Jobs and references
- open-source
- GitHub profile
- Red Hat (oVirt project) – improvements and fixes to a (network) configuration daemon for hypervisors; unit and system testing
- Master’s thesis:
tagsetbench
source code (MIT licensed) - contributor to OSD Lyrics and maintainer of the Arch Linux package
- Czech translation of OpenTTD (regularly until 2009) and lrcShow-X
- Music on Console patches: MPRIS support, and a few more
- bug reporter: helped make Altap Salamander work under Wine
- Jablocom – design and implementation of a statistical framework (frequency distribution, over time)
- Heuréka.cz – heuristics for assessing the quality of a product’s description; refactoring and improvement of a crawler and a parser
- Tycoonez.com:munity – programmed TTD and OpenTTD sites (frontend and backend)
- Svoboda Fonservis – various applications for Windows, current temperature gadget, repair status
Knowledge and skills
- Computer science
- efficient data structures, grasp of Prolog and the functional programming paradigm, principles of speech recognition and synthesis, basics of digital signal and image processing
- Linguistics
- corpus linguistics using CQL, basic knowledge of internals of a corpus manager, ability to create own tools, grasp of syntactic analysis frameworks, interest in morphology and development of languages
- Natural language processing
- batch processing of text corpora, chart-based syntactic analysis (including a basic Czech grammar designed with the concept of word-order positions in mind), derivational morphology
- Development
- Python (data processing, automation, testing, GUI), AJAX, SQL, C#/.NET, advanced regular expressions, user-space C/C++ and Win32 (basics), Visual Basic 6, PHP
- Computing environment capability
- keyboard-centric power user of Arch Linux, seasoned user of Git
- Other
- private teaching (linguistics, programming)
Personality
- insightful, thorough
Languages
- Czech; Slovak
- native; very advanced
- English
- very advanced
- French
- intermediate (but mostly passive)
- Russian, Polish, German
- beginner
- all of the world languages
- interested!
Interests
- programming, development of convenience tools (for time and knowledge management)
- development, etymology, phonology, morphology, syntax, semantics, and scripts of the world languages
- natural language understanding (someday)
- biking, walking, trekking, parkour
- singing, just enjoying a piano or a guitar
- occasional contributing to Wikipedia and Wiktionary
Last updated on 10 March 2017.