PeriodO – Technical Overview

Linked Data

The PeriodO period gazetteer documents definitions of historical periods. Each entry of the gazetteer identifies the definition of a single period. To be included in the gazetteer, a period must

give the period a name,
impose some temporal bounds on the period,
have some implicit or explicit association with a geographical region, and
have been formally or informally published in some citable source.

Much care has been put into giving periods stable identifiers that can be resolved to RDF representations of periods. PeriodO models periods as SKOS concepts. These are grouped into concept schemes sharing the same bibliographic source. Temporal extent is expressed via a direct textual quotation from the source, as well as via a structured approximation of this expression modeled using the OWL-Time ontology. Similarly spatial extent is represented both by a textual quote (where one was given) and a set of identifiers referring to spatial entities in external resources such as DBpedia.

Examples:

http://n2t.net/ark:/99152/p05krdxmkzt identifies the “Dark Age” as defined by Davis and Alcock on page 97 of Sandy Pylos: an archaeological history from Nestor to Navarino (see the JSON representation). This is an example of a period.
http://n2t.net/ark:/99152/p06v8w4 identifies all the periods from the FastiOnline database of archaeological excavations (see the JSON representation). This is an example of an authority.

Periods

The PeriodO dataset is essentially a collection of periods.

Labels and documentation

A period is a skos:Concept, “an idea or notion,” as the SKOS Reference puts it. We use a number of SKOS properties to describe periods:

skos:prefLabel is used for the name of the period exactly as given in the original source. The value of this property is a simple literal xsd:string, with no language tag.
skos:altLabel is used for language-specific names of the period, assigned by PeriodO curators. The value of this property is a language-tagged string. The language tag consists of (at least)
1. a three-character primary language subtag, as defined in ISO 639-2 or ISO 639-3, and
2. a four-character script subtag, as defined in ISO 15924.
There will always be at least one skos:altLabel, with the language tag eng-latn. If the source of the period was not written in English, there will always be another skos:altLabel with a language tag indicating the language and script of the source of the period. The language of the source of the period is indicated via a dc:language property, the value of which is a language tag as described above.
skos:note is used for notes about the period taken from or attributed to the original source. For example, the original Pleiades definition of “Ottoman Rise (AD 1300-1453)” includes the note “ends with the conquest of Constantinople.” The value of this property is a simple literal xsd:string, with no language tag.
skos:editorialNote is used for administrative or editorial notes added by the PeriodO curators; these do not appear in the original source. The value of this property is a simple literal xsd:string, with no language tag.
skos:inScheme is used to link a period to the authority of which it is a part.

Source

Usually the bibliographic information about the source of a period is provided through properties of the authority to which it belongs. However, in some cases there may be additional bibliographic information that is specific to an individual period. In these cases, we use a dcterms:source to provide this additional information, and dcterms:isPartOf to link the source of the period to the source of the authority to which it belongs. For example, we might use the following to indicate that the specific book page from which a period was sourced:

<p0tns5v4kdf>
    dcterms:source [
        dcterms:isPartOf <http://www.worldcat.org/oclc/63807908> ;
        bibo:locator "page 25"
    ] .

(bibo is the Bibliographic Ontology.)

Temporal extent

We use properties from the Time Ontology to describe the temporal extent of periods. A period is a time:ProperInterval, an interval of time with different beginning and end points. We assume that these (instantaneous) beginning and end points can never be precisely identified, hence our descriptions focus on describing the intervals that start and finish the period:

time:intervalStartedBy links the period to an (anonymous) time interval that has the same (unknown) beginning point as the period, and an (unknown) end point that comes before the end point of the period. We call this the start interval for the period.
time:intervalFinishedBy links the period to an (anonymous) time interval that has the same (unknown) end point as the period, and an (unknown) beginning point that comes after the beginning point of the period. We call this the stop interval for the period.

Diagram showing the relation between a period’s temporal extent and its start and stop intervals — The relation between a period’s temporal extent and its start and stop intervals

We describe the start and stop intervals in two ways. Both ways of describing the interval are required; these are complementary descriptions, not alternatives:

skos:prefLabel is used to textually describe the interval exactly as given in the original source, for example “end of the first century BC”. The value of this property is a simple literal xsd:string, with no language tag.
time:hasDateTimeDescription is used to describe the interval in a more structured fashion. This property links the interval to an (anonymous) time:DateTimeDescription. These structured descriptions are created by PeriodO curators.

Currently we use the following properties in our datetime descriptions:

time:year for descriptions of intervals that can be represented with a single year. For example, an interval with the textual description “600 BC” can be described with a datetime description having a time:year value of -0599.
periodo:earliestYear and periodo:latestYear for descriptions of intervals that need to be represented as ranges. For example, an interval with the textual description “eight century BC” can be described with a datetime description having a periodo:earliestYear value of -0799 and a periodo:latestYear value of -0700.

The datatype for values of time:year, periodo:earliestYear, and periodo:latestYear is xsd:gYear. Note that:

xsd:gYear values can have any number of digits.
xsd:gYear values may be zero. The value 0000 is interpreted as 1 BCE.
xsd:gYear values represent Gregorian calendar years and “are not, in general, convertible to simple values corresponding to years in other calendars.” We are comfortable with this limitation because we use these values only for the purposes of ordering and visualizing temporal extents of intervals. The skos:prefLabel of an interval should be considered the authoritative description.

We may use additional properties in our datetime descriptions in the future, for example to describe intervals at a finer temporal granularity than a year.

Spatial extent

We use the following properties to describe the spatial extent of periods:

periodo:spatialCoverageDescription is used to textually describe the spatial extent exactly as given in the original source, for example “Near East and Greece”. The value of this property is a simple literal xsd:string, with no language tag.
dcterms:spatial is used to link a period to descriptions of locations in gazetteers such as DBpedia/Wikidata, GeoNames, or Pleiades.

Authorities

An authority is simply a set of periods that share a source. We use dcterms:source to link authorities to bibliographic descriptions of their sources. Where possible we rely on external bibliographic databases such as WorldCat and CrossRef for bibliographic metadata.

An authority is a skos:ConceptScheme, “an aggregation of one or more SKOS concepts”. Belonging to the same authority does not imply any semantic relationship between periods, other than sharing a source. In particular, the periods belonging to an authority do not constitute a periodization, meaning a single coherent, continuous division of historical time. In the future we plan to add additional properties for indicating when a set of periods constitute a periodization.

The root resource of the PeriodO dataset is an rdf:Bag (unordered collection) of authorities.

Identifiers

Each period is given its own Web-based, resolvable Uniform Resource Identifier (URI) in the form of a Archival Resource Key (ARK), minted through the EZID system of the California Digital Library. These ARKs are resolvable to structured, machine-readable representations of individual periods.

Wherever possible bibliographic sources are identified with WorldCat URIs or CrossRef DOIs, and creators are identified with Virtual International Authority File URIs.

Data Serialization

Periods in the PeriodO gazetteer are published as JSON-LD, a serialized form of the Resource Description Format (RDF) used to describe Linked Data. The entire dataset is available for download as a single JSON file, so that it can be more easily reused by other projects.

User Interface

A browser-based client provides search, visualization, and data entry tools. It also provides an interface for managing distributed versioning and collaborative development of the PeriodO dataset. The entire editorial history of the PeriodO gazetteer is published as a separate dataset using the PROV ontology.