TY - JOUR AB - There is a range of large knowledge bases, such as Freebase and DBpedia, as well as linked data sets available on the web, but they typically lack lexical information stating how the properties and classes they comprise are realized lexically. Often only one label is attached, if at all, thus lacking rich linguistic information, e.g. about morphological forms, syntactic arguments or possible lexical variants and paraphrases. While ontology lexicon models like lemon allow for defining such linguistic information with respect to a given ontology, the cost involved in creating and maintaining such lexica is substantial, requiring a high manual effort. Towards lowering this effort we present ATOLL, a framework for the automatic induction of ontology lexica, based both on existing labels and dependency paths extracted from a text corpus. We instantiate ATOLL\ with respect to DBpedia\ as dataset and Wikipedia as corresponding corpus, and evaluate it by comparing the automatically generated lexicon with a manually constructed one. Our results clearly corroborate that our approach shows a high potential to be applied in a semi-automatic fashion in which a lexicon engineer can validate, reject or refine the automatically generated lexical entries, thus having a clear potential to contributing to the reduction the overall cost of creating ontology lexica. DA - 2014 DO - 10.1016/j.datak.2014.09.003 KW - natural language processing LA - eng M2 - 148 PY - 2014 SN - 0169-023X SP - 148-162 T2 - Data & Knowledge Engineering TI - ATOLL - A framework for the automatic induction of ontology lexica UR - https://nbn-resolving.org/urn:nbn:de:0070-pub-26958695 Y2 - 2024-11-21T23:03:16 ER -