A domain categorisation of vocabularies based on a deep learning classifier.

Nogales Moyano, Alberto; Sicilia, Miguel Ángel; García Tejedor, Álvaro José

doi:10.1177/01655515211018170

A domain categorisation of vocabularies based on a deep learning classifier.

A domain categorization of vocabularies based on a Deep Learning classifier - EDITED (Copia en conflicto de ceiecubuntu 2018-12-13).pdf (303.55 KB)

Identifiers

URI: https://hdl.handle.net/10641/3129

ISSN: 0165-5515

DOI: 10.1177/01655515211018170

Publication date

2021

Authors

Nogales Moyano, Alberto

Sicilia, Miguel Ángel

García Tejedor, Álvaro José

Metrics

Share

Export

Abstract

The publication of large amounts of open data has become a major trend nowadays. This is a consequence of pro-jects like the Linked Open Data (LOD) community, which publishes and integrates datasets using techniques like Linked Data. Linked Data publishers should follow a set of principles for dataset design. This information is described in a 2011 document that describes tasks as the consideration of reusing vocabularies. With regard to the latter, another project called Linked Open Vocabularies (LOV) attempts to compile the vocabularies used in LOD. These vocabularies have been classified by domain following the subjective criteria of LOV members, which has the inherent risk introducing personal biases. In this paper, we present an automatic classifier of vocabularies based on the main categories of the well-known knowledge source Wikipedia. For this purpose, word-embedding models were used, in combination with Deep Learning techniques. Results show that with a hybrid model of regular Deep Neural Network (DNN), Recurrent Neural Network (RNN) and Convolutional Neural Network (CNN), vocabularies could be classified with an accuracy of 93.57 per cent. Specifically, 36.25 per cent of the vocabularies belong to the Culture category.

Keywords

Linked Data, Deep Learning, Document Categorisation

Collections

ESCUELA POLITÉCNICA SUPERIOR

Full item page

Depósito Digital UFV

A domain categorisation of vocabularies based on a deep learning classifier.

Identifiers

Publication date

Start date of the public exhibition period

End date of the public exhibition period

Authors

Advisors

Journal Title

Journal ISSN

Volume Title

Publisher

Metrics

Share

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Doctoral program

Description

Keywords

Citation

Collections