Converted from a Word document
Archiving a TEI project FAIRly
A. Creamer [https://orcid.org/0000-0002-5286], G. Lembi [https://orcid.org/0000-0001-8962],
E. Mylonas [https://orcid.org/0000-0002-0215],
M. Satlow [https://orcid.org/0000-0001-7692]
Organization(s): Brown University, United States of America
The Inscriptions of Israel Palestine Project is an online corpus of inscriptions from Israel and Palestine, written in Hebrew, Greek, Latin and Aramaic, dating roughly from the Persian Period to the Arab Conquest. As of spring 2019, it has collected and encoded more than 4000 inscriptions, out of some 10000 relevant texts: we aim to create an exhaustive and easily accessible collection and to enable users to carry out a variety of searches and extensive textual analysis.
The FAIR Principles aim to enhance the ability of machines to automatically find and
use digital objects, in addition to supporting their reuse by individuals. The
principles are organized under four areas intended to ensure digital objects are findable, accessible, interoperable,
and re-usable. Following
epigraphy.info’s mission statement we are applying the
FAIR Principles to guide our development of archival formats and processes for our corpus.
As IIP prepared to deposit files in the Brown Digital Repository, we defined formats for ensuring that our files will be as informative, self-documenting and re-usable as possible. Each inscription is contained in a single, XML file, encoded in the well-documented Epidoc subset of the TEI. These files, however, linked to externally maintained controlled vocabularies (using the xi:include feature) and bibliography (using Zotero), in order to facilitate the work of our encoders and ensure consistency. One of our challenges was to incorporate these external data into the robust , stand-alone, archival format.
The archival format of the IIP files is the result of a transformation that writes all the applicable <classifications> directly into the <profileDesc> and also puts complete bibliographic entries derived from Zotero into each file. We will continue to encode using external reference files to ensure consistency, but the archival format should not rely on any external sources.
We will introduce the FAIR Guiding Principles and FAIR Metrics as they apply to epigraphic corpora and TEI encoding, discuss the roadmap for implementation, and look at archival practices beyond FAIR when it comes to preservation of data as well as re-use. While the first steps to making a digital corpus findable and accessible seem straightforward—IIP texts have been ingested into the Brown Digital Repository, have unique and persistent identifiers, rich metadata, and are freely available, we can still improve on both facets. Simple interoperability and re-usability are available through the IIP API in both the production and the archival versions of the corpus, however, it will be important to do further work on controlled vocabularies, shared concepts, and encoding practices in order to enhance both of these facets.
Bibliography
--, 2014. Guiding Principles for Findable, Accessible, Interoperable and Re-usable
Data Publishing version b1.0 [WWW Document]. FORCE11.
URL
https://www.force11.org/fairprinciples
(accessed 5.10.19).
--, Epigraphy.info [WWW Document], n.d.
URL
http://epigraphy.info/
(accessed 7.31.19).
Feraudi-Gruénais, F., Grieshaber, F., 2016. Digital Epigraphy am Scheideweg? /
Digital Epigraphy at a crossroads? Presented at the Nachnutzung und Nachnutzbarkeit
der Forschung im Akademienprogramm Workshop der Nordrhein-Westfälischen Akademie der
Wissenschaften und der Künste und der Union der deutschen Akademien der
Wissenschaften AG „eHumanities“,
Düsseldorf.
https://doi.org/DOI:10.11588/heidok.00022141
Implementing FAIR Data Principles: The Role of Libraries, 2017. . LIBER.
URL
https://libereurope.eu/blog/2017/12/08/implementing-fair-data-principles-role-libraries/
(accessed 5.10.19).
Satlow, M., 2002. Inscriptions of Israel/Palestine [WWW Document].
URL
https://library.brown.edu/iip/index/
(accessed 7.31.19).
https://doi.org/10.26300/pz1d-st89
Wilkinson, M.D., Dumontier, M., Aalbersberg, Ij.J., et al., 2016. The FAIR Guiding
Principles for scientific data management and stewardship. Scientific Data 3,
160018.
https://doi.org/10.1038/sdata.2016.18