Andreas Vlachidis Andreas.Vlachidis@uwe.ac.uk
Senior Lecturer in Computer Science
Automatic metadata generation in an archaeological digital library: Semantic annotation of grey literature
Vlachidis, Andreas; Binding, Ceri; May, Keith; Tudhope, Douglas
Authors
Ceri Binding
Keith May
Douglas Tudhope
Abstract
. This paper discusses the automatic generation of rich metadata from excavation reports from the Archaeological Data Service library of grey literature (OASIS). The work is part of the STAR project, in collaboration with English Heritage. An extension of the CIDOC CRM ontology for the archaeological domain acts as a core ontology. Rich metadata is automatically extracted from grey literature, directed by the CRM, via a three phase process of semantic enrichment employing the GATE toolkit augmented with bespoke rules and knowledge resources. The paper demonstrates the potential of combining knowledge based resources (ontologies and thesauri) in information extraction, and techniques for delivering the automatically extracted metadata as XML annotations coupled with the grey literature reports and as RDF graphs decoupled from content. Examples from two consuming applications are discussed, the Andronikos web portal which serves the annotated XML files for visual inspection and the STAR project, research demonstrator which offers unified search across of archaeological excavation data and grey literature via the core ontology CRM-EH.
Journal Article Type | Article |
---|---|
Acceptance Date | Jan 1, 2013 |
Publication Date | Jan 1, 2013 |
Deposit Date | Feb 6, 2018 |
Publicly Available Date | Feb 6, 2018 |
Journal | Computational Linguistics |
Print ISSN | 0891-2017 |
Electronic ISSN | 1530-9312 |
Publisher | Massachusetts Institute of Technology Press (MIT Press) |
Peer Reviewed | Peer Reviewed |
Pages | 187-202 |
Keywords | automatic, metadata, generation, archaeological, digital library, semantic, annotation, grey, literature |
Public URL | https://uwe-repository.worktribe.com/output/936443 |
Publisher URL | https://www.mitpressjournals.org/loi/coli |
Contract Date | Feb 6, 2018 |
Files
Vlachidis_Automatic Metadata Generation Extended.pdf
(269 Kb)
PDF
You might also like
Text mining in archaeology: Extracting information from archaeological reports
(2015)
Book Chapter
The CrossCult knowledge base: A co-inhabitant of cultural heritage ontology and vocabulary classification
(2017)
Presentation / Conference Contribution
Downloadable Citations
About UWE Bristol Research Repository
Administrator e-mail: repository@uwe.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search