Some OntoLex-Lemon

Originally posted as part of the CMPE58H course by Susan Üsküdarlı in the fall semester of 2022 at https://medium.com/@furkanakkurt7642/some-ontolex-lemon-798779392f0a.


OntoLex-Lemon1 is a model for representing lexicographical ontologies, lemon being short for lexicon model for ontologies. It was developed because ontology languages, OWL and RDF(S), didn’t allow furnishing ontologies with linguistic information so a resource like WordNet wasn’t possible. They mention in their community report that “OWL and RDF(S) rely on a property [named] rdfs:label” to show a relation between a vocabulary element and its lexicalization. This single property is not in the least sufficient for a human language.1

Thanks to OntoLex-Lemon, WordNet has powered so many NLP applications of English and made them possible. Many languages lack a wordnet as creation of one requires manual annotation of thousands upon thousands entries as well as consistent and accurate lexical data collection. Because human languages are results of the human brain, they share a lot in common and all of their needs can be fulfilled by a single lexicon model. Also, the OntoLex-Lemon model is open to extensions and its vocabulary can be augmented on an application basis. Human languages are very similar but since linguistic theories vary, the model has been made sure to be abstracted from specific theories.1

Core Model

A representative image of the model is provided in their document, which I’ve provided just above. In the image, boxes represent classes, filled arrows represent object properties and non-filled arrows are for subclass relations. Main class of the model is Lexical Entry which is further specialized into three: word, multiword expression and an affix (prefix, suffix, etc.). For a word example, we can represent the word cat in Turtle RDF Syntax as the triple: :cat a ontolex:Word.1

For more information, please see the community report, dated 5/10/2016.1

References

  1. https://www.w3.org/2016/05/ontolex/