The KNOT Data Model - Overview

A guideline for the description of digital scholarly activity and objects as cultural heritage using a common vocabulary and facilitating the dissemination of good practices.

The process of modelling in the Digital Humanities is key to moving from theory to practice [1], turning critical thinking and interpretation into a potentially practical object such as a data model. Furthermore, modelling, both in the Cultural Heritage and Digital Humanities fields, is not just about the representation of the source but also about the process itself, in part because this process involves representing and capturing some of the inherent subjectivity and peculiarities of human creations and activities to express principles that are anchored in a specific context rather than a general law [2].

As such, the KNOT Data Model (KNOT-DM) was developed to capture some of the peculiarities of our argument for scholarly digital objects as unique and interesting examples of the digital cultural heritage (DCH) of Italian universities (see the Pilot page for more), and act as a guideline for the description of digital scholarly objects and activity as DCH using a common vocabulary and facilitating the dissemination of good practices.

The SAMOD methodology was used as a reference point in the design of the KNOT-DM to create a small, data-centric iterative workflow: we first used the results of our census (of Italian humanities projects and objects) to create an initial set of requirements such as being able to describe both scholarly digital objects and the activity that created them, different types of objects and access, and their cultural heritage dimension; we then evaluated various existing standards, models, ontologies, and vocabularies for their interoperability, ongoing development and support, and established usage in real-world applications; following this we narrowed our choice of existing standards and ontologies and evaluated them against a first set of competency questions (who, what, where, when, how) by modeling a small subset of data from the census (12 entries); following this further refinements were made to which elements from each existing knowledge organization system the model should use and how they should connect; insights from this step also led to the creation of the KNOT Controlled Vocabularies; lastly the first version of the ontology and documentation of the data model were created drawing from the insights acquired in each previous step.

The use of internationally recognised and complimentary standards stems from an early design decision that KNOT-DM should be focused on reusability rather than the creation of new ontological components in order to allow flexibility and ensure its eventual compatibility with the Digital Library infrastructure.

One of the key learnings that emerged during the development process was the understanding that the domain we were interested in modelling lay at the intersection of three existing domains that represent specific aspects of our argument for scholarly digital objects as examples of the DCH of Italian universities: academia, public data, and digital cultural heritage. This led to a model designed with three interconnected segments, reflecting this intersection, capable of representing entities, activities, agents, themes, relationships, and spatio-temporal information.

The Venn diagrams below summarize the domain intersection KNOT-DM is based on as well as the equivalent standards it uses. See the Structure section for more details on how the data model integrates each standard.

The goal of the KNOT-DM is to facilitate the description of digital scholarly activity and objects as examples of DCH (starting with the data produced by the project) as well as foster interconnection with existing systems that already document this activity and these objects, such as repositories (both institutional and public) and dedicated websites, to avoid the duplication of existing data and information. One area of interest with regard to this last point is in exploring how KNOT can integrate with the IRIS research management system offered to Italian universities by CINECA.

Metadata created with KNOT-DM forms the basis of the knowledge graph used in the KNOT catalogue.

FIG 1. Intersection of the domains and standards used in KNOT-DM.

KNOT-DM is intended to be flexible and reusable. It is freely available for universities, and other interested parties, to use to model their own data for independent usage or to allow its inclusion within the KNOT project.

KNOT-DM allows the following information to be recorded:

  • Research project activity — who, what, where, when, and how.
  • Research project outputs — the scholarly digital objects created by research project activities. (The data model is intended to be flexible enough to enable the description of various objects such as software, websites, web services, and data collections.)
  • Research project inputs — the entities used by a research project such as physical items, people, places, and themes.
  • Agents — involved in research projects and the creation of outputs as contributors and publishers, from the individual to the organization via organizational units (such as departments or laboratories) and their locations.
  • Relationships — between activities, agents, entities, locations, and concepts, such as the influence of external concepts or the reuse of digital scholarly objects between projects.
  • Spatial and temporal information — including that of the projects and their agents but also that recorded in scholarly digital objects, such as for example the places mentioned in a text or the time period covered by the data.
  • Concepts — such as the type of objects produced, the technologies used, the research activities, the technical specifications, the academic disciplines involved, and the subjects of research. These concepts are described using a selection of controlled vocabularies and authority controls.

The figure below summarizes the workflow of the KNOT-DM, in line with the goals of the first year of the project.

FIG 2. KNOT-DM workflow.

References

[1] Tomasi, Francesca. 2018. “Modelling in the Digital Humanities: Conceptual Data Models and Knowledge Organization in the Cultural Heritage Domain.” https://doi.org/10.12759/HSR.SUPPL.31.2018.170-179.
[2] Ciula, Arianna, Øyvind Eide, Cristina Marras, and Patrick Sahle. 2018. “Models and Modelling between Digital and Humanities. Remarks from a Multidisciplinary Perspective.” GESIS - Leibniz Institute for the Social Sciences. https://doi.org/10.12759/HSR.43.2018.4.343-361.

Structure

Learn more about the different segments that make up the KNOT-DM.

Modules

Learn more about how to use the different modules within KNOT-DM.

Ontology

Learn more about the KNOT Ontology, which expresses the data model in RDF.

Controlled Vocabularies

Learn more about the Controlled Vocabularies used in KNOT-DM, including those created specifically for the project.