The KNOT Pilot

KNOT is a three-year pilot (2022-2025) tasked with investigating ways to integrate the Digital Cultural Heritage (DCH) of Italian universities into I.PaC, the national infrastructure being built by the Central Institute for the Digitization of Cultural Heritage (ICDP) - Digital Library.

Considering the lack of consensus within universities and across them as to what constitutes their DCH, the KNOT project has chosen to focus its investigation on the scholarly digital objects created by academic research within the field of the Humanities, and in particular the Digital Humanities, as an interesting yet unexplored example of existing DCH held by Italian universities. Our approach makes reference to the definitions of DCH provided by both UNESCO [1] and the ICDP [2] as well as insights from the heritage sector to consider digital objects as a form that goes beyond just data and contains multiple agencies and spatio-temporal distributions [3]. As such the objects we are interested in comprise collections of information (such as datasets) as well as digital forms that enable interaction with information, from software and data services such as search interfaces or APIs to visualization and annotation tools. This allows us to move past the scientific context in which these objects are most commonly understood and look at what heritage value might be found in their “reservoir of meaning” [3]: such as the activity that produced them; the relationships to the information they encode; the new contexts they create for this information; and the ways in which they can foster the acquisition of new knowledge from this combination of context and information.

The expected outputs of the pilot include a data model to support the description and valorization of these scholarly digital objects as DCH, a web application showcasing the implementation of the data model and potential ways to integrate the data produced by the pilot into the national infrastructure, and a set of guidelines for the collection, management, enrichment, and reuse of these scholarly digital objects.

Below is a detailed overview of the activities and outputs within each phase of the project.

Year 1 (2023)

The first year was focused on data, with the following goals:

  • Collection, classification, and categorization of heterogeneous and representative data for inclusion and use in the pilot.
  • Elaboration of a conceptual and data model to describe the selected data and reflect the central argument of the project. This model is to be informed by relevant existing approaches and project requirements and should enable a network of semantic links, including through reconciliation with authority records and controlled vocabularies.
  • Definition of competency questions against which the conceptual model can be tested to ensure its ability to facilitate the discovery of latent knowledge.
  • Creation of a knowledge graph, available via a SPARQL endpoint, representing digital scholarly objects as examples of the DCH of Italian universities alongside a workflow for the collection, normalization, cleaning, and transformation of data into RDF.

Additionally the first year also included the following work which continued and fed into subsequent phases:

  • Collaboration with the DISI department as part of a working group with CINECA to explore potentials for integration of the pilot data into I.PaC and collaboration with the DBC department to explore issues around the classification of DCH faced by Italian universities.
  • Interviews with Italian academics to explore the challenges they face in the creation and management of scholarly digital objects born of Humanities research.

The outputs for the first year included:

  • A small census of scholarly digital objects and their related projects, used to develop the data model and knowledge graph.
  • A data model to describe scholarly digital objects as DCH, including an ontology and controlled vocabularies published in RDF.
  • Development of a small-scale catalogue application that uses the data model to describe the results of the census and generate a usable knowledge graph of scholarly digital objects and activity.

Year 2 (2024)

The second year was focused on the catalogue, with the following goals:

  • Evaluation of potential services to extract new knowledge from the data as well as reconcile information with existing institutional repositories.
  • Deployment and further development of the KNOT Catalogue.
  • Development of a repository to host and distribute the KNOT Controlled Vocabularies and ensure their FAIR-ness.
  • Development of a working document for guidelines expected at the end of the project, drawing from insights acquired during the first year around issues of visibility, categorization, and classification of scholarly digital objects.

The outputs for the second year included:

  • Deployment of the KNOT Catalogue, using the CLEF cataloguing application and featuring a knowledge graph of scholarly digital objects and activity from the Italian humanities that includes records for 100 research projects, 35 digital objects, and 96 web services.
  • Publication of visualizations of the KNOT Catalogue knowledge graph, showcasing some of the more interesting aspects of the dataset.
  • Extensions to the functionality of CLEF, such as the ability to generate pages for entities used for authority control and reconciliation.
  • Further development of both the KNOT Data Model and Controlled Vocabularies, including a simplified version of the data model tailored to the limitations of the CLEF functionalities.
  • Development of an application, based on Skosmos, to publish the KNOT Controlled Vocabularies and for potential expanded use as a repository for all semantic artefacts produced by /DH.ARC.

Year 3 (2025)

The third year is focused on integration with I.PaC and the production of guidelines, with the following goals:

  • Integration of the project's data and metadata into the I.PaC platform through continued collaboration with the DISI and CINECA Working Group.
  • Publication of the guidelines for the description and valorization of scholarly digital objects as cultural heritage.
  • Deployment of the /DH.ARC Vocabularies repository and publication of internal guidelines for its continued usage.

Journal Articles

Giacomini, Sebastiano, Marilena Daquino, Francesca Tomasi, and Laurent Antoine Fintoni. 2025. “CLEF 2.0. Solutions for Native Linked Data Cataloguing of Italian Digital Cultural Heritage.” JLIS.It 16 (1): 108–26. https://doi.org/10.36253/jlis.it-611.

Conference Proceedings

Sabba, Fiammetta, Francesca Tomasi, Silvia Tripodi, and Laurent Antoine Fintoni. 2023. “L’Università Di Bologna a Supporto Dell’ecosistema Digitale Della Cultura Presso Il Ministero Della Cultura.” In Visioni Future: Next Generation Library - Vol.1, 45–50. Editrice Bibliografica. https://hdl.handle.net/11585/947326.

Fintoni, Laurent Antoine. 2024. "Rethinking Scholarly Digital Objects as Cultural Heritage: The KNOT Project."" In Me.Te. Digitali. Mediterraneo in rete tra testi e contesti, Proceedings del XIII Convegno Annuale AIUCD2024, 582–587. https://dx.doi.org/10.6092/unibo/amsacta/7927.

Datasets

Fintoni, Laurent, Marilena Daquino, and Francesca Tomasi. “KNOT Controlled Vocabularies”. Digital Humanities Advanced Research Center, University of Bologna, March 25, 2025. https://doi.org/10.5281/zenodo.15082216.

Fintoni, Laurent, Francesca Tomasi, and Marilena Daquino. “KNOT Catalogue Knowledge Graph”. Digital Humanities Advanced Research Center, University of Bologna, March 17, 2025. https://doi.org/10.5281/zenodo.15037686.


References

[1] “Records of the General Conference, 32nd Session, Paris, 29 September to 17 October 2003, v. 1: Resolutions.” UNESCO, 2004. https://unesdoc.unesco.org/ark:/48223/pf0000133171.
[2] Docs Italia. “01_Piano nazionale di digitalizzazione del patrimonio culturale | Piano nazionale di digitalizzazione del patrimonio culturale.” Accessed January 28, 2024. https://docs.italia.it/italia/icdp/icdp-pnd-docs/it/v1.1-febbraio-2023/index.html.
[3] Cameron, Fiona. The Future of Digital Data, Heritage and Curation in a More-than Human World.London ; New York:Routledge/Taylor & Francis Group, 2021. ZA4080.4.