TEI/XML
TEI/XML is the base format for our data storage. There are three main units of TEI/XML used in CoReMA:
- The hyperdiplomatic transcription of the cooking recipe collections (the project internal term used is "glyph version"). See Editorial Model. Example: o:corema.b2
- The semantic annotation of the cooking recipe collections (the project internal term used is "annotated detail version"). See Semantic Model. Example: o:corema.b2.recipes
- The single recipe which was derived from the semantic annotation of the cooking recipe collections (the project internal term used is "split version"). See Recipe Index. Example: o:corema.b2.1
All TEI/XML source files can be accessed by adding
"/TEI_SOURCE" to the URL of the respective digital object. Example: https://gams.uni-graz.at/o:corema.b2.1/TEI_SOURCE, https://gams.uni-graz.at/o:corema.b2.recipes/TEI_SOURCE and https://gams.uni-graz.at/o:corema.b2/TEI_SOURCE.
Furthermore,
the source files can be downloaded via the "Metadata button" that can be found on
the right side of the bottom toolbar.
RDF/XML
All the structured data of the single recipes is also stored as RDF in our triple-store (Blazegraph). The serialization we use for the API is RDF/XML.
All RDF/XML files can be accessed by adding "/RDF" to the URL of a single recipe. Example: https://gams.uni-graz.at/o:corema.b2.1/RDF.
Furthermore, the RDF/XML files can be downloaded via the "Metadata button" that can be found on the right side of the bottom toolbar.
Plaintext
As a service for easy re-use of the project's transcriptions (e.g. for NLP methods) there are PLAINTEXT versions available for all digital objects.
All PLAINTEXT files can be accessed by adding "/PLAINTEXT" to the URL of the respective digital object. Example: https://gams.uni-graz.at/o:corema.b2.1/PLAINTEXT, https://gams.uni-graz.at/o:corema.b2.recipes/PLAINTEXT and https://gams.uni-graz.at/o:corema.b2/PLAINTEXT.
Furthermore, the PLAINTEXT files can be downloaded via the "Metadata button" that can be found on the right side of the bottom toolbar.
Indices
Indices can be accessed via the web-interface. The selected rows (via the checkboxes) can be downloaded in the formats "JSON", "XML", "CSV", "TXT", "SQL" and "MS-Excel":
Furthermore most of the indices can be accessed directly via the so called "query object". These objects are stored in the GAMS repository and represent a SPARQL query. The results of these queries can be accessed as "JSON" or "XML". Example: /archive/objects/query:corema.toolindex/methods/sdef:Query/getJSON and /archive/objects/query:corema.toolindex/methods/sdef:Query/getXML
Here is a full list of indices as well as statistics (analysis results) that can be accessed in this way:
- query:corema.recipeindex
- query:corema.ingredientindex
- query:corema.toolindex
- query:corema.dishindex
- query:corema.ingredientquantities
- query:corema.toolquantities
- query:corema.dishquantities
- query:corema.faocategoryquantities
- query:corema.colwithexactsameing
- query:corema.colwithexactsametitle
- query:corema.titlesequences
- query:corema.titlesequencesabsolute
- query:corema.titlecount
- query:corema.recwithexactsameing
- query:corema.collectionindex
Some of the indices as well as statistics can only be accessed via the web-interface as it was necessary to use JavaScript to manipulate the results before displaying them or parameters were used for options chooseable by the web-interface users.
Downloads
Additional downloads like python and xslt scripts as well as the schema files are available in the CoReMA Git repository at https://github.com/zimgraz/corema