Towards electronic lexicography for the Kurdish language

Ahmadi, Sina; Hassani, Hossein; McCrae, John P.

View/Open

ahmadi2019kurdishlex.pdf (944.2Kb)

Date

2019-10-01

Author

Ahmadi, Sina

Hassani, Hossein

McCrae, John P.

Metadata

Show full item record

Usage

This item's downloads: 370 (view details)

Recommended Citation

Ahmadi, Sina, Hassani, Hossein & McCrae, John P. (2019). Towards electronic lexicography for the Kurdish language. Paper presented at the eLex 2019 (sixth biennial conference on electronic lexicography), Sintra, Portugal, 01-03 October.

Published Version

https://elex.link/elex2019/proceedings-download/

Abstract

This paper describes the development of lexicographic resources for Kurdish and provides a lexical model for this language. Kurdish is considered a less-resourced language, and currently, lacks machine-readable lexical resources. The unique potential which Linked Data and the Semantic Web offer to e-lexicography enables interoperability across lexical resources by elevating the traditional linguistic data to machine-processable semantic formats. Therefore, we present our lexicon in Ontolex-Lemon ontology as a standard model for sharing lexical information on the Semantic Web. The research covers the Sorani, Kurmanji, and Hawrami dialects of Kurdish. This research suggests that although Kurdish is a less-resourced language, in terms of documented lexicons, it has a wide range of resources, but because they are not machine-readable they could not contribute to the language processing. The outcome of this project, which is made publicly available, assists scholars in their efforts towards making Kurdish a resource-rich language.

URI

http://hdl.handle.net/10379/15513

Collections

Data Science Institute (Conference Papers)

Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland