Abstract

The Ontology for Provenance and Plans (P-Plan) is an extension of the PROV-O ontology [PROV-O] created to represent the plans that guided the execution of scientific processes.
P-Plan describes how the plans are composed and their correspondence to provenance records that describe the execution itself.

P-Plan is an OWL2 ontology developed to describe abstract scientific workflows as plans and link them to their past executions.

P-Plan extends the W3C PROV-O Ontology [PROV-O], which encodes the W3C PROV
data model [PROV-DM]. PROV-DM describes the provenance
of objects (prov:Entities) as a record of assertions about the steps (prov:Activities) that generated them and the entities used in those steps.
Provenance describes past execution, but does not offer a vocabulary to express the plan that the execution was supposed to follow.

As an example, provenance vocabularies are appropriate for describing assays once they are executed, but are not designed to describe protocols.
Therefore, in addition to the provenance record, it is often desirable to publish the plan that was followed during the execution.
This would allow the provenance record to include what was envisioned would happen prior to the execution.

Publishing the plan has several benefits:

The plan can provide a higher-level, more abstract description of what was executed
which improves understandability and facilitates reuse in future situations

The plan can describe the expectations for the execution,
which can then be contrasted with the provenance to detect deviations and correct abnormalities.

Acknowledging this need, PROV includes the term “prov:Plan”. However, it does not elaborate any further how plans can be described or
related to other provenance elements of the execution, which is precisely the scope of the P-Plan ontology.

This document specifies the classes and properties of the P-Plan ontology.

P-Plan extends PROV to link the provenance traces to their plan descriptions.
The next tables summarize the classes and properties that have been used to extend or complement the
PROV-O Starting Point Terms to our domain.
No dataproperties are included, because P-Plan doesn't define any:

P-plan uses PROV to represent the provenance of the execution and extends it to link it to the different parts of the plan.
PROV describes the usage and generation of entities through two main properties: prov:wasGeneratedBy
(an Entity wasGeneratedBy an Activity) and prov:used (an Activity used an Entity for the execution).
The agents responsible for the execution are linked to the activity as prov:Agent with the property
prov:wasAssociatedWith. All properties can be qualified with prov:Roles. Provenance assertions can be
grouped in prov:Bundles, so that provenance can be asserted for the bundle.

In PROV plans are defined as entities associated with an agent and an activity. PROV does not specify anything
further about plans and how they correspond to parts of the execution, as it is considered out of the scope of
the model for provenance

Figure 1 shows an overview of P-PLAN and how it relates to PROV assertions,
showing plans at the top and plan executions at the bottom.
The provenance of the execution is entirely captured with PROV. Entity, activity and bundle concepts are subclasses
of PROV classes (p-plan:Bundle, p-plan:Entity and p-plan:Activity) to be able
to represent their relationship with the parts of the plan (p-plan:correspondsToStep property for activities,
p-plan:correspondsToVariable property for entities and prov:wasDerivedFrom to connect
the bundle representing the execution to the plan).

Figure 1. P-PLAN as an extension of PROV to describe plans.:
The execution trace is captured with PROV-O statements (prov:used, prov:wasGeneratedBy) and the plan is captured with p-plan.

p-plan:Plan is a subclass of prov:Plan. p-plan:Steps represent the planned execution
activities. Plan steps may be bound to a specific executable step or refer to a class of steps, providing an abstraction
layer over the execution. As a result, a plan step could be carried out in different ways in different executions
of the same plan. In addition, a step may not have a corresponding activity, for example if there is an execution failure. Dependencies
between p-plan:Steps are captured with the p-plan:isPreceededBy relationship.
p-plan:Variables represent the inputs of the steps and can have proper-ties (i.e., type, restrictions, metadata, etc.).
p-plan:Steps have p-plan:Variables as input and p-plan:Variables are output of
p-plan:Steps. Both of them are associated to
a p-plan:Plan. The relation of the plan with agents and activities is not specified P-PLAN, since it can be modeled with PROV
(with the prov:hadPlan property).
Note that prov:Plan is a prov:Entity, so its provenance can be modeled with PROV.

4. Cross reference for P-Plan classes and properties

This section provides details for each class and property defined by the P-Plan Ontology.