The starting point for the development of the ontology is the domain model for European Poetry. This model was created through a process of inverse engineering in order to retrieve the informational needs of the community of practise. These are analysis of patterns, establishment of functional and non-functional requirements.The resulting conceptual model presents a great complexity because of its exhaustiveness and due to the various conceptual domains that were explored. It covers both the bibliographical and technical information of the works, and also metadata derived from literary and prosodic analysis. Moreover, it includes elements that are complementary elements of the texts like images and musical notation.
In order to reduce the complexity of the model, we began by identifying knowledge areas and defining a series of subdomains.
Thus, we identified the following subdomains (see illustration below):
The graph below shows the subdomains.
Therefore, the development of an ontological model for European poetry has been carried out through a modular design that is derived, initially, from the subdomains identified in the domain model. Subdomains have a well defined semantics but it is necessary to make refinements to determine which of the subdomains corresponds to a complete and independient ontology and which should be merged with others to create a larger ontology.
The modular design results in several ontologies which are connected in a network of ontologies for European poetry. We tooh into account the following criteria:
To measure the degrees of cohesion, we applied a metric proposed in Oh, Yeom, & Ahn (2011), which measures the cohesion of the modules, i.e. the ontologies, independently of the cohesion of the ontological network. This metric takes into account a factor that ensures the quality of the module. This factor is the degree of internal relations in the module, that is, the degree of cohesion present in the module that undoubtedly influence the coupling with other modules. This also makes it possible to verify the logical consistency between the modules and the complete ontology.
Cohesion in an ontology module has to do with the degree of relationship of the classes in the module. Classes are related when they share properties or have connections with other classes. Therefore, the relations contemplated in this metric can be both hierarchical (the properties of the parent are shared) and non-hierarchical (the classes connect to each other).
For every subdomain, we carried out different analysis to obtain:
From this process, we identified the following ontologies as part of the network of ontologies for the domain of European poetry
This ontology covers aspects related to poetic works and their manifestations.
The classes PoeticWork, Redaction and Ensemble have been defined for this purpose.
Since it is the core or central ontology of the network, we incorporated classes that are not specific to poetry but that represent a transversal knowledge. These classes complete the relevant information not only for the classes of the core ontology but also for other ontologies of a more specific domain.
The following entities have therefore been identified:
The core ontology is imported by the rest of ontologies of the network. For this reason, besides containing the mentioned classes, it also provides a set of common properties that have the same semantics in all the classes in which they are defined. In this way it is possible to express semantics in an unambiguous way for properties that from this point of view represent conceptually the same thing.
The data properties related to the dating of the work, its manifestations and its transmission, contain different features to capture the specifics of the expression of the date in this domain. We needed to take into consideration the difficulty of dating works according to established formats. Dates cannot always be defined with accuracy and this entails the need of additional elements to describe the dating issues. Therefore, we created a small ontology of dates to better represent datation issues when dealing with historical data and periods.
This module covers the classes and properties concerned with the transmission of the works. The classes of this module are PrimarySource, BibliograhicSource, Witness, WitnessCollection, Repository, Facsimile, Reading, Apparatus, Location.
This module contains the classes and properties that are necessary to obtain information from literary analysis. The classes in this module are Acrostic, Intertextuality, RhetoricalDevice.
This module covers the classes and properties related to the textual structuring of the manifestations of the works. The classes that form it are: Syllable, Line, Stanza, Word and Punctuation, these last two belong to a LexicalUnit hierarchy.
This module contains the classes and properties that model the required information for the prosodic analysis of a poetic text. As in other modules, we defined a hierarchy of classes that models the patterns of different levels and that are oriented to define the recurrence of the pattern followed by the stanzas, the lines and the poetic work itself. The classes that form this module are LinePattern, StanzaPattern, Work Pattern, Métrical Encoding, Symbol, RhymeMatch.
The POSTDATA ontology network also takes into account a feature presented in many poetic works, which is the presence of musical accompaniment. In this ontology, we have not sought a detailed representation of the musical characteristics but those that can enrich the text and play an important role as complementary information. The classes are: Melody, MusicalNotation, Performance
In the manifestations of poetic works, elements that increase the expressiveness of the works or add context appear regularly. The aim of this ontology is to cover these aspects. The classes identified in the model are: Paratext, Illustration and Scene.
The image below shows the network of ontologies