Skip to main content

Research Repository

Advanced Search

A pilot investigation of Information Extraction in the semantic annotation of archaeological reports

Vlachidis, Andreas; Tudhope, Douglas

A pilot investigation of Information Extraction in the semantic annotation of archaeological reports Thumbnail


Authors

Douglas Tudhope



Abstract

The paper discusses a prototype investigation of semantic annotation, a form of metadata assigning conceptual entities to textual instances; in the case of archaeological grey literature. The use of Information Extraction (IE), a Natural Language Processing (NLP) technique, is central to the annotation process while the use of Knowledge Organization System (KOS) is explored for the association of semantic annotation with both ontological and terminological references. The annotation process follows a rule-based information extraction approach using the GATE NLP toolkit, together with the CIDOC CRM ontology, its CRM-EH archaeological extension and English Heritage thesauri and glossaries. Results are reported from an initial evaluation, which suggest that these information extraction techniques can be applied to archaeological grey literature reports. Further work is discussed drawing on the evaluation and consideration of the characteristics of the archaeology domain. Copyright © 2012 Inderscience Enterprises Ltd.

Citation

Vlachidis, A., & Tudhope, D. (2012). A pilot investigation of Information Extraction in the semantic annotation of archaeological reports. International Journal of Metadata, Semantics and Ontologies, 7(3), 222-235. https://doi.org/10.1504/IJMSO.2012.050183

Journal Article Type Article
Acceptance Date Jan 1, 2012
Publication Date Nov 1, 2012
Publicly Available Date Jun 8, 2019
Journal International Journal of Metadata, Semantics and Ontologies
Print ISSN 1744-2621
Electronic ISSN 1744-263X
Publisher Inderscience
Peer Reviewed Peer Reviewed
Volume 7
Issue 3
Pages 222-235
DOI https://doi.org/10.1504/IJMSO.2012.050183
Keywords NLP, natural language processing, KOS, knowledge organisation systems, semantic annotation, information extraction, GATE, digital archaeology, grey literature, CIDOC CRM ontology, archaeological reports, metadata
Public URL https://uwe-repository.worktribe.com/output/956169
Publisher URL http://dx.doi.org/10.1504/IJMSO.2012.050183
Additional Information Additional Information : This is the author's accepted manuscript. The final publiashed version is available at: http://dx.doi.org/10.1504/IJMSO.2012.050183.

Files





You might also like



Downloadable Citations