Archiving a TEI project FAIRly Creamer, Andrew Lembi, Gaia Mylonas, Elli Satlow, Michael unknown date Zentrum für Informationsmodellierung - Austrian Centre for Digital Humanities, Karl-Franzens-Universität Graz Austria Zentrum für Informationsmodellierung - Austrian Centre for Digital Humanities, Karl-Franzens-Universität Graz Austria GAMS - Geisteswissenschaftliches Asset Management System Creative Commons BY-NC 4.0 2019 Graz o:tei2019.125

Converted from a Word document

Papers tei2019

en Judaic Studies digital repositories FAIR principles digital epigraphy 2019-09-03T08:16:43Z unknown author

Archiving a TEI project FAIRly

A. Creamer [https://orcid.org/0000-0002-5286], G. Lembi [https://orcid.org/0000-0001-8962], E. Mylonas [https://orcid.org/0000-0002-0215], M. Satlow [https://orcid.org/0000-0001-7692]

Organization(s): Brown University, United States of America

The Inscriptions of Israel Palestine Project is an online corpus of inscriptions from Israel and Palestine, written in Hebrew, Greek, Latin and Aramaic, dating roughly from the Persian Period to the Arab Conquest. As of spring 2019, it has collected and encoded more than 4000 inscriptions, out of some 10000 relevant texts: we aim to create an exhaustive and easily accessible collection and to enable users to carry out a variety of searches and extensive textual analysis.

The FAIR Principles aim to enhance the ability of machines to automatically find and use digital objects, in addition to supporting their reuse by individuals. The principles are organized under four areas intended to ensure digital objects are findable, accessible, interoperable, and re-usable. Following epigraphy.info’s mission statement we are applying the FAIR Principles to guide our development of archival formats and processes for our corpus.

As IIP prepared to deposit files in the Brown Digital Repository, we defined formats for ensuring that our files will be as informative, self-documenting and re-usable as possible. Each inscription is contained in a single, XML file, encoded in the well-documented Epidoc subset of the TEI. These files, however, linked to externally maintained controlled vocabularies (using the xi:include feature) and bibliography (using Zotero), in order to facilitate the work of our encoders and ensure consistency. One of our challenges was to incorporate these external data into the robust , stand-alone, archival format.

The archival format of the IIP files is the result of a transformation that writes all the applicable <classifications> directly into the <profileDesc> and also puts complete bibliographic entries derived from Zotero into each file. We will continue to encode using external reference files to ensure consistency, but the archival format should not rely on any external sources.

We will introduce the FAIR Guiding Principles and FAIR Metrics as they apply to epigraphic corpora and TEI encoding, discuss the roadmap for implementation, and look at archival practices beyond FAIR when it comes to preservation of data as well as re-use. While the first steps to making a digital corpus findable and accessible seem straightforward—IIP texts have been ingested into the Brown Digital Repository, have unique and persistent identifiers, rich metadata, and are freely available, we can still improve on both facets. Simple interoperability and re-usability are available through the IIP API in both the production and the archival versions of the corpus, however, it will be important to do further work on controlled vocabularies, shared concepts, and encoding practices in order to enhance both of these facets.

Bibliography

--, 2014. Guiding Principles for Findable, Accessible, Interoperable and Re-usable Data Publishing version b1.0 [WWW Document]. FORCE11. URL https://www.force11.org/fairprinciples (accessed 5.10.19).

--, Epigraphy.info [WWW Document], n.d. URL http://epigraphy.info/ (accessed 7.31.19).

Feraudi-Gruénais, F., Grieshaber, F., 2016. Digital Epigraphy am Scheideweg? / Digital Epigraphy at a crossroads? Presented at the Nachnutzung und Nachnutzbarkeit der Forschung im Akademienprogramm Workshop der Nordrhein-Westfälischen Akademie der Wissenschaften und der Künste und der Union der deutschen Akademien der Wissenschaften AG „eHumanities“, Düsseldorf. https://doi.org/DOI:10.11588/heidok.00022141

Implementing FAIR Data Principles: The Role of Libraries, 2017. . LIBER. URL https://libereurope.eu/blog/2017/12/08/implementing-fair-data-principles-role-libraries/ (accessed 5.10.19).

Satlow, M., 2002. Inscriptions of Israel/Palestine [WWW Document]. URL https://library.brown.edu/iip/index/ (accessed 7.31.19). https://doi.org/10.26300/pz1d-st89

Wilkinson, M.D., Dumontier, M., Aalbersberg, Ij.J., et al., 2016. The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data 3, 160018. https://doi.org/10.1038/sdata.2016.18