Posted:June 27, 2017

KBpedia Relations, Part IV: The Detailed Relations Hierarchy

Completing the Schema for the Relations Addition

In the previous part of this series I discussed the relations model that we are adding to the KBpedia knowledge structure for its upcoming version 1.50. This article continues that discussion by fleshing out the concept hierarchy for this relations addition. These changes are taking place within the upper structure of KBpedia, what we call the KBpedia Knowledge Ontology, or KKO. The addition of this relations model affects both the class hierarchy of KKO and its properties structure. Both aspects are discussed below.

The previous parts of this series introducing the new KBpedia v 150 provide the rationale for adding a relations or predications structure to KBpedia, as well as the rationale for grounding this structure in the universal categories and logic of relations developed by Charles Sanders Peirce. Let me expand on those arguments a bit further here.

Recall from the earlier parts in this discussion that we want to adopt a schema for relations because we are interested in assertions and propositions in our knowledge structures. As Peirce notes [1],”The unity to which the understanding reduces impressions is the unity of a proposition. This unity consists in the connection of the predicate with the subject; and, therefore, that which is implied in the copula, or the conception of being, is that which completes the work of conceptions of reducing the manifold to unity.” (CP 1.548). The previous part discussed the rationale for three main classes of relationship types and how they related to Peirce’s views on logic and the theory of signs (semiosis). It is now time to expand on those distinctions.

The Second Level of the Relations Hierarchy

At the top level, relations are defined in KBpedia as belonging to one of three main types: Attributes, which are how we describe and characterize individual things (using the shorthand of A:A); External Relations, which are how an individual thing may relate, interact or situate with regard to external things (using the shorthand of A:B); or Representations[3], which are how we may name or define, indicate or reference, or provide supporting information about an individual thing (using the shorthand of re:A). These three main categories capture every conceivable form of relation. Note that an individual thing may also include classes or types, as well as individuals. Through this means, we can talk about and refer to categories, concepts, classes or types when we need to consider them as things unto themselves.

These three main categories correspond to Peirce’s universal categories of Firstness, Secondness and Thirdness [2]. (Where appropriate, I also sometimes footnote some Peirce quotes relevant to the category at hand.) Using the same trichotomous approach to categorization [2], we can now expand these three main relational categories into a second level of nine categories:

For External Relations (A:B), the split is into these three categories:

Direct — a simple, direct relationship (no intermediaries) between two different objects (entities, events, or their types, considered as instances). Example concepts include is a, simple without parts, part of, members in types or classes, genealogical roles (parent, child, brother), identity, extensional

Associatives — a situational and contextual assertion of proximity, affiliation or adjacency of the subject with regard to any contiguity. Example concepts include see also, lists, links (incoming + outgoing), associations, likenesses, resemblances [5].

The Third Level of the Relations Hierarchy

For the next level, we can continue with our process of categorization based on the universal categories applied to the nine categories listed above. Here is the schema that results from this approach:

Third Level of the KBpedia Relations Hierarchy

For the mid-level category of Intrinsics, the first of three categories under Attributes, here are the three subsidiary categorizations (representing the Firstness, Secondness and Thirdness, respectively):

Qualities — an internal characteristic or aspect of an object; collectively these define intensionally what kind of thing to which the object belongs, though that relationship is not intrinsic

Elementals — a contributing part of or integral input or aspect that adds to the understanding about the subject (A)

Configurations (forms) — forms or arrangements that are of the nature or perceivable of the subject (A).

For the mid-level category of Adjunctual, here are the three subsidiary categorizations:

Quantities — a characteristic of a subject (A) that is expressed as a number quantity

For the mid-level category of Denotatives, the first of three categories under Representations, here are the three subsidiary categorizations:

Media — iconic images or sounds that invoke the identification with a given object or representation. Media in this sense draws attention to the object [6]

Labels — symbolic text strings that help to name or draw attention to a particular object

Descriptions — text strings that may be longer than labels and provide additional or contextual information or specify attributes about the object, beyond drawing attention.

For the mid-level category of Indexes, here are the three subsidiary categorizations:

Pointers — physical or symbolic indicators of a given thing and which draw attention to it

Identifiers — generally (unique) symbols or strings that provide a key to the given subject, often within some conventional scheme for generating and recognizing the token assigned

Codes — an assigned symbolic token or string that groups the object with similar items; the generation and interpretation of the token is (often) done in relation to an understood schema. Indexes are included in this category.

And, lastly, for the mid-level category of Associatives, here are the three subsidiary categorizations:

Lists — an aggregation, either ordered or unordered, of objects similar to one another with respect to given characters or types

Relateds (see also) — an indicator of some nature to other objects similar or related to the given object; the criteria and degree or strength of relationship between the items are indeterminate

Augments — an indicator to an external factor in relation to the object, which factor itself leads to still further explanations.

These categories, then, complete the three levels underneath the KBpedia Knowledge Ontology’s Predications branch (itself a Secondness under the basic Thirdness of Generals). We have established these as concepts within KKO such that we may reason over the ideas of these categories, and do other conceptual work with them. This conceptualization is helpful in order to consider the ideas of predicates and the relationships between them. However, for mapping properties from external sources, we need a parallel structure in terms of KKO properties.

An Analogous Structure for Properties

We thus created an analogous structure for properties within KBpedia. One useful addition that is forthcoming in the KBpedia v 1.50 release is a mapping of 2500 Wikidata properties to this property schema, which represents more than 95% of all property assignments to the 25 million+ entities within Wikidata. This is one of the first tangible expressions of the benefits of having a fully rational predicate structure within KBpedia.

Attributes, External Relations and Representations comprise OWL properties. In general, Attributes correspond to the OWL datatypes property; External Relations to the OWL object property; and Representations to the OWL annotation properties. These specific OWL terms are not used in our KKO grammar, however, because some attributes may be drawn from controlled vocabularies, such as colors or shapes, that can be represented as one of a list of attribute choices. In these cases, such attributes are defined as object properties. Nonetheless, the mappings of KKO’s grammar to existing OWL properties is quite close. In all cases, our mappings to external sources are done via the subPropertyOf relation in OWL.

Object and Data Properties

Because a given external non-annotation property (that is, all properties except Representations) may either refer to another object (IRI) or to a string or data value, we created parallel listings of object and datatype properties within KKO. Here is the listing for the object properties group as shown by a screen capture from Protégé:

A Matching KBpedia Object Properties Hierarchy

Note that Protégé lists its items alphabetically, rather than our preferred ordering of Firstness, Secondness and Thirdness. You also should note the parallelism to the concept hierarchy discussed above. Not shown is the similar datatype properties structure.

Annotation Properties

As noted, Representations correspond to annotation properties, and that listing is shown in this screen capture:

A Matching KBpedia Annotation Properties Hierarchy

In OWL2, annotations may be organized in a subPropertyOf manner, even though inferencing is not possible over this structure. The organization, though, is helpful to understand relationships (and they can independently be reasoned over with the matching concept structure) and may be used in search, SPARQL or external analytic scripts.

We also organize the SKOS and Dublin Core annotation properties used by KKO into the categorical structure of these Representations, as shown by the screen capture above. This organization helps elucidate the structure under the Representations branch using these commonly applied properties.

Some Caveats and Next Part

This completes the hierarchical specification of the Predications branch within KKO. These 39 categories (3 + 9 + 27), following the categorization approach suggested by Peirce’s universal categories, now complete our additions to this next version of KBpedia.

This addition has been under intense and active development for the past year, though it has been in the works for much longer than that. Over that time, the names of categories and, indeed, even the splits themselves, have changed frequently and evolved. Peirce’s Logic of Relatives[7], as one of the two main sources, is itself laid out in a dichotomous structure, though elsewhere in Peirce’s writings — as we showed in the previous parts — these are often expressed in Peirce’s more standard trichotomous structure. I am sure with use and more experience that we will see further refinements to the KBpedia relations schema as time goes on. Please do not be surprised if you see further changes.

We are now nearly complete with the advance material leading up the KBpedia v 1.50 release. We will have one next part in our series recapitulating the terminology and grammar for this new version. The announcement following that will finally release version 1.50.

This series on KBpedia relations covers topics from background, to grammar, to design, and then to implications from explicitly representing relations in accordance to the principals put forth through the universal categories by Charles Sanders Peirce. Relations are an essential complement to entities and concepts in order to extract the maximum information from knowledge bases. This series accompanies the next release of KBpedia (v 150), which includes the relations enhancements discussed.

[1] Peirce citation schemes tend to use an abbreviation for source, followed by volume number using Arabic numerals followed by section number (such as CP 1.208) or page, depending on the source. For CP, see the electronic edition of The Collected Papers of Charles Sanders Peirce, reproducing Vols. I-VI, Charles Hartshorne and Paul Weiss, eds., 1931-1935, Harvard University Press, Cambridge, Mass., and Arthur W. Burks, ed., 1958, Vols. VII-VIII, Harvard University Press, Cambridge, Mass.

[3] “I confine the word representation to the operation of a sign or its relation to the object for the interpreter of the representation.” (CP 1.540)

“A very broad and important class of triadic characters [consists of] representations. A representation is that character of a thing by virtue of which, for the production of a certain mental effect, it may stand in place of another thing.” (CP 1.564)

“as my analysis makes it to be, that a percept contains only two kinds of elements, those of firstness and those of secondness, then the great overshadowing point of difference is that the perceptual judgment professes to represent something, and thereby does represent something, whether truly or falsely. This is a very important difference, since the idea of representation is essentially what may be termed an element of “Thirdness,” that is, involves the idea of determining one thing to refer to another.” (CP 7.630)

[4] “But it would be difficult if not impossible, to instance an absolutely pure index, or to find any sign absolutely devoid of the indexical quality. Psychologically, the action of indices depends upon association by contiguity, and not upon association by resemblance or upon intellectual operations.” (CP 2.306)

“Indices may be distinguished from other signs, or representations, by three characteristic marks: first, that they have no significant resemblance to their objects; second, that they refer to individuals, single units, single collections of units, or single continua; third, that they direct the attention to their objects by blind compulsion. But it would be difficult if not impossible, to instance an absolutely pure index, or to find any sign absolutely devoid of the indexical quality. Psychologically, the action of indices depends upon association by contiguity, and not upon association by resemblance or upon intellectual operations.” (CP 2.306)

[5] “Association is the only force which exists within the intellect, and whatever power of controlling the thoughts there may be can be exercised only by utilizing these forces; indeed, the power, and even the wish, to control ourselves can come about only by the action of the same principles. Still, the force of association in its native strength and wildness is seen best in persons whose understandings are so little developed that they can hardly be said to reason at all. Believing one thing puts it into their heads to believe in another thing; but they know not how they come by their beliefs, and can exercise no control over the inferential process. These unconscious and uncontrolled reasonings hardly merit that name; although they are very often truer than if they were regulated by an imperfect logic, showing in this the usual superiority of instinct over reason, and of practice over theory.” (CP 7.453)

[6] “The only way of directly communicating an idea is by means of an icon; and every indirect method of communicating an idea must depend for its establishment upon the use of an icon. Hence, every assertion must contain an icon or set of icons, or else must contain signs whose meaning is only explicable by icons. The idea which the set of icons (or the equivalent of a set of icons)contained in an assertion signifies may be termed the predicate of the assertion.” (CP 2.278)

This article completes the hierarchical specification of the new Predications branch within KBpedia, used for reasoning over relations and mapping properties from external sources. These 39 categories (3 + 9 + 27), follow the categorization approach suggested by Peirce's universal categories, and now complete our additions to the next version 1.50 of KBpedia due shortly for release.