Chapter 10

Non-Classical Logics

In the following sections, we shall present some of these logics. The study can be partially based on the notions that were introduced in the previous chapters.

As usual, the presentation will contain motivations, historic landmarks, applications, similarities, and differences with what is already known, and finally indications on how to reason with these logics.

A common feature of these logics is their philosophical origin, which can be explained as a return to the roots: analysis of discourse in natural language, of what is true, what is false, what is neither true nor false, what is necessary, what is contingent, etc.

10.1. Many-valued logics

One of the first conventions that we adopted in the study of logic was to state (in accordance with mathematics) that a reasoning is correct if and only if it is impossible for the premises to be true and the conclusion to be false. But common sense shows that things are not that simple, when we want to begin the analysis before any simplification choice has been made.

Indeed, among the syntactically correct sentences, those we are interested in are declarative sentences and more precisely propositions (i.e. the sets of declarative sentences that are synonyms).

There are declarative sentences that are never assigned truth values, for example,

Honesty is beautiful

whose meaning can only be metaphorical. Others such as

The robot is in the room

can sometimes be true and sometimes be false...

The study of many-valued logics, which we shall also refer to as p-valued logics or n-valued logics (implicitly stating that p, n ≥ 3), originated, similar to many other subjects, with the work of Aristotle, who considered the future contingents (see section 10.3), i.e. what may or may not occur. The example considered by Aristotle was the proposition:

There will be a sea battle tomorrow.

which is neither true nor false (today) and suggests a third truth value. Łukasiewicz (following Aristotle) believed that if we do not accept that declarative sentences about the future are neither true nor false, then we have to accept fatalism (philosophical doctrine according to which all events are predetermined by fate).

Some historic landmarks:

– during the Middle Ages, little research on the topic;

– end of the 19th Century and beginning of the 20th Century: they are definitively accepted;

– some important names: MacColl, C.S. Peirce (around 1909), N.A. Vasil’ev, Łukasiewicz (starting in 1920), and Post (starting in 1921);

– three-valued logics have been particularly studied;

– there exist ∞-valued logics (infinitely-valued logics), in particular, the ieq logic (see section 10.2).

The introduction by Łukasiewicz of p-valued logics was preceded by some profound philosophical considerations.

According to historians and philosophers of logic, two principal origins of p-valued logics seem to have been:

1) the theory of contradictory objects, that states the existence of objects having contradictory properties. It seems like Łukasiewicz believed that non-contradictory objects did not exist;

2) (for three-valued logic) the notions of (non)determinism, causality, free will, etc.

Łukasiewicz studied the problems of induction and probabilities seriously. In his first papers, the logical values depended on the finite number of considered individuals, for example:

To x2 = 1, we assign value 2/3 in the universe {-1, 0, 1}

To x2 = 1, we assign value 2/5 in the universe {-2, -1, 0, 1, 2}

We reduce the algebraic formalizm to a minimum for the treatment of p-valued logics. The adopted formalization is sufficient for our needs and can easily be generalized.

Using logics with three truth values can cause the loss of powerful techniques such as the law of excluded middle. For example, the following reasoning:

equ

is not a correct reasoning in three-valued logic.

EXAMPLE 10.1.– A finer analysis of programs can be performed with a three-valued logic than with a two-valued one.

For example, if p and q are Boolean conditions and f and g are functions ((pf, qg, h) means: if p then f else (if q then g else h)):

equ

It seems obvious that the programs (pf, f) (meaning if p then f else f) and f are equivalent; however, whereas f always computes f, we can imagine that, for example, p: (1/x) ≤ 3, and as variable x is real, it can take the value 0, which yields:

equ

DEFINITION 10.1.– (truth functional (extensional) connective). A binary connective o_dot.gif is truth functional iff the truth value of ieq only depends on the truth values of ieq and ieq (i.e. a truth table can be provided for ieq). A logic is truth functional iff all its connectives are truth functional. Truth functional connectives are also called extensional connectives (see definition 2.3).

DEFINITION 10.2.– (n-valued logic). If ieq is a formal language generated by propositional variables and connectives, NN, N = 0, 1,…,n-1 is a finite set of truth values and DN is a set of distinguished values, then L =<ieq, N, D> is an n-valued logic.

We can take an infinite set for N (see definition 10.3).

REMARK 10.1.– One fundamental characteristic that is shared by classical and many- valued logics is truth functionality.

We shall present the most commonly used three-valued logics, that is, those of Łukasiewicz, Kleene, and Bochvar.

Łukasiewicz’s L3 logic (1920)

Basic principle: being able to handle contingent futures. To do so, he introduced the intermediate value 1/2.

The connectives are defined with the following tables:

equ

the other usual connectives are defined as follows:

equ

which yields:

equ

equ

DEFINITION 10.3.– (interpretation, valuation).

– An interpretation (valuation) is an application: set of wffs {0, 1/2, 1} that respects the truth tables above.

– A tautology or valid wff is a wff that takes the value denoted by 1 (or a value in D) for every interpretation.

For general p-valued logics, we must consider N instead of {0, 1/2, 1} and D instead of 1.

What changes and what does not:

equ

P ∨ ¬P: tautology in CPC (Classical Propositional Calculus), non-tautological in L3

¬(P ∧ ¬P): tautology in CPC, non-tautological in L3

P ⇔¬P: contradiction in CPC, satisfiable in L3

P d_arr.gif P: tautology in CPC, tautology in L3

A formal system for this logic:

ieq with:

ieq3 = ieq0 % i.e. it is the same as for S1

image = {MP}

ieq:

3L1) equ

3L2) equ

3L3) equ

3L4) equ

REMARK 10.2.– In L3, the deduction theorem (see remark 3.14) does not hold.

The truth tables could have been established the following way:

equ

or

equ

Sometimes, the latter is expressed as:

equ

These last definitions naturally lead to considering n-valued logics with n ≥ 4, in particular ∞-valued logics.

Kleene’s three-valued logics (1938 and 1952)

Basic principle: give a value that means undecidable (and not an intermediate value) to the mathematical assertions that are true or false, but cannot be proved or refuted.

REMARK 10.3.– This principle is not respected by the laws of excluded middle and of non-contradiction. Doing otherwise would lead to the non-respect of truth functionality. For example, we would assign true to ieq. Assume the value of p is 1/2 and that ieq is replaced by ieqq with the same truth value, then the value of ieq is...1/2, because ieq (table for ieq) and ieq (table for ∨).

In so-called strong logic (1938) we assign the value i when the values 0 (false) and 1 (true) are not sufficient to conclude (same principle as in Łukasiewicz’s L3).

equ

The other truth tables are the same as in L3.

In so-called weak logic (1952), the occurrence of value i in a sub-formula forces the formula to also have value i.

equ

Bochvar’s three-valued logic (1938)

Basic principle: try to eliminate paradoxes, such as

This sentence is wrong.

(It is true if it is false, false if it is true)

Similarly to Kleene’s weak logic, if a sub-formula has value i, then the formula also has value i.

For Bochvar, a proposition can be significant (it is true or false) or without signification (a paradox). He proposed two types of connectives: the internal connectives and the external connectives. The connectives we are interested in here are the former, whose truth tables coincide with Kleene’s weak connectives.

Paradoxes are not eliminated:

This sentence is false or undetermined.

(It is true if it is false or undetermined, it is false or undetermined if it is true.)

10.1.1. How to reason with p-valued logics?

We show in the following example how to extend the method of semantic tableaux for classical logic (see section 3.2), to handle p-valued logics (p: finite). The method is general of course, as will be evidenced by the example.

In classical logic (two-valued), the method of semantic tableaux enumerates all the models (partial models in FOL) of a finite set of formulas S. When it is not possible to construct any model, we conclude that S is unsatisfiable (see section 3.2).

10.1.1.1. Semantic tableaux for p-valued logics

We consider L =<ieq, N, D> and a set of symbols that do not belong to the vocabulary of ieq : a0, a1, a2,... an-1. These ai, 0 ≤ in − 1 will be used to indicate the n truth values of the (sub-)formulas in the tableau.

We discover the new method by analogy with the analogy for two-valued logic.

two-valued logic:

1) We are interested in the interpretations v such that v(X) = 1 (i.e. true).

2) Branch B is closed iff PB and ¬PB for P (in X): atomic.

3) X is valid iff tableau closed for ¬X.

p-valued logics (p ≥ 3):

1) We are interested in the interpretations image such that image(X) ∈ D.

2) Branch B is closed iff either ai(P) ∈ B, aj(P) ∈ B for ij and P atomic or B contains a formula on which no rule can be applied.

3) X valid iff the tableau is closed for all aj(X) with j ∈ (ND), meaning that X cannot be evaluated to a value that is not distinguished.

EXAMPLE 10.2.– (N. da Costa). In this example, we show how to adapt the method of semantic tableaux to p-valued logics. Although this is only an example, it is clear that the method can be applied for any finitely valued logic.

Given the following truth tables for connectives ¬,d_arr.gif ∧, and ∨ in a three-valued logic, with distinguished values D = {0, 1}.

equ

In each truth table, the leftmost column corresponds to all possible values of the argument on the left-hand side of the connective (named X below). The first line corresponds to all possible values for the argument on the right-hand side of the connective (named Y below).

We want to show that the following formula is valid:

equ

We translate the truth tables (and try to compress their representation).

imagei(form) (0 ≤ i ≤ 2); form ∈ {X, ¬X,X d_arr.gif Y,XY,XY} means form has value i.

(Although we do not need the connective ∧ in this example, we also provide the corresponding rules).

image

Method: show that it is impossible to evaluate the considered formula to a value that is not distinguished (in this example, value 2).

image

10.2. Inaccurate concepts: fuzzy logic

No man has ever or will ever know anything certain.

Xenophanes (6th Century BC)

Similar to the word “model” (see section 5.2), the word “concept” is very frequently used, for example, in AI (and sometimes, it is badly used). It is a notion that has been studied extensively in philosophy.

Among the technical definitions that can be found in the literature, we choose the following one:

Concept: an idea (of a mammal, a triangle, etc.) that is abstract, general, or can be subject to a generalization.

Concept is the rational way of knowing reality.

A distinction must be made between the concept and the object. The concept is the intensional counterpart1 of the object2.

There are a priori concepts, i.e. concepts that are not a result of experience, unity, plurality, etc. and a posteriori concepts, i.e. general notions that define classes of objects that are either given or constructed (concept of a mammal, a function, etc). Different philosophical schools claim that only one or the other of these concepts is really a concept.

Every concept possesses an extension, possibly empty. The objects are defined by the concepts that indicate the set of characters that belong (or are supposed to belong) to the objects of the class, and to no other object, for example, man: rational animal.

When creating concepts, similarity (analogy) plays a crucial role. We can also say that the notion of similarity (of which we could say that equality is a particular case3) also plays a major role in FL.

We can relate this notion to that of a model. In a model, we choose some characteristics that we wish to be representative and whose goal could be to define the class under consideration.

REMARK 10.4.– The activities of definition (specification) and modeling are at the core of a computer scientist’s activities. They can be viewed from the point of view of the philosophical classification of constructive definitions (specification) and explanatory definitions (modeling). Example of the former: append (see example 6.2) and of the latter small: someone of a size less than (say) 1.70 m.

Inaccurate concepts

A simple way of illustrating the ideas is to oppose mathematical concepts to empirical concepts. The former are exact because they do not accept limit or neutral cases (example: prime number, right triangle,…). The latter accept neutral cases and are inaccurate (example: big, old, etc.).

More generally, in natural languages, there are terms that are vague, i.e. terms (corresponding to concepts) such that if the extension of the concept is the class denoted by image, and if we consider an arbitrary object denoted by image, then there is no defined answer to the question: “is image an object in class image?”. For example, very tall, small, bald, almost old, very beautiful, etc.

The vagueness is different from ambiguity (the same term can denote different objects in different contexts).

Mathematics and logic gave little consideration to vague concepts, more precisely, they use formal languages to avoid these difficulties.

The theory of fuzzy sets and FL (introduced by L. Zadeh in 1965) are an attempt to formalize vague or inaccurate concepts. Fuzzy sets and FL have given rise to many controversies.

Nowadays, the literature on these topics is huge and we may say that the domain is rather (an example of a vague term!) accepted and respected.

image

Fuzziness is inaccuracy and imprecision. The truth of a fuzzy proposition is a matter of degree. It must be distinguished from the probable (i.e. uncertainty as a degree of belief).

The following example clarifies the differences.

EXAMPLE 10.3.– Consider the following propositions:

a) John is young.

b) John will be alive next week.

a) is a vague or imprecise proposition (because of the occurrence of the word “young”).

b) is a precise proposition that is true or false, but we do not know that it is. It can be assigned a probability of being true or false, but the probability is not a degree of truth.

From an observational point of view, the vague concept “young” is in principle concerned with only one observation; however, the probability of a young person being alive next week is concerned with several observations of young people.

Most FL are truth functional (see section 10.1), which is not the case for probabilities:

we know that:

1) ieq

2) ieq

3) ieq

4) ieq

if for example:

(*) prob(p) = 1/2, hence probp) = 1/2

then using (3) with q : ¬p and replacing ¬p by p, which has, according to (*), the same probability, we obtain:

5) ieq

As a conclusion: in the left-hand side of (4), we replaced an event (¬p) by another event with the same probability (p) (see (*)) and we obtain two different values.

Another example (that also shows the difference with accurate propositions, in particular, the non-validity of the law of non-contradiction). From an intuitive point of view, the sentence:

I am old and I am not old.

does not have the value false. This can be formalized, because the basic logic for FL is ieq (i.e. with values in r1.gif), which gives ieq, and which will in general will be ≠0.

Of course, ieq.

The notion of FL has two meanings: a narrow sense and a wide sense.

For the narrow sense, FL is a logical system whose goal is to formalize approximate reasoning (see below the characterization proposed by L. Zadeh). In this sense, it is an extension of p-valued logics.

For the wide sense, FL ≈ theory of fuzzy sets.

A very old paradox seen from a new perspective. A good example of the kinds of problems that are treated by FL is the analysis (of an equivalent version) of the paradox of the heap (see example 8.1).

X denotes the set of all men

SX the set of all small men.

We consider a real interval P = [0.5, 3] that contains all possible sizes.

The function size h : XP.

We assume:

1) ieq (meaning that there exist men who are small).

2) No man has a size less than 0.5 m or greater than 3 m (which yields interval P above).

3) There are men of all intermediate sizes - w.r.t. the minimal difference that can be detected by measuring instruments (this will entail no loss of generality in the reasoning).

4) If xS, yX, and 0 ≤ h(y) − h(x) ≤ 10−3, then yS (we could have chosen a smaller difference in size, e.g. 10-6.

A man that is 1 mm taller than a small man is also small.

5) If xSh(y) ≤ h(x), then yS. A man that is smaller than a small man is also small.

We want to prove that:

All men are small.

The proof is simple. Given any man y, we select a man, say x, who is without a doubt small. We choose x1 such that h(x1) − h(x) ≤ 10−3. By 4, x1S.

By iterating (at most 103(h(y) - h(x)) times), we reach the conclusion that y is small. (If y had been smaller than x, by (5) we would immediately have reached the conclusion).

By applying (4), we prove that all men who are taller than a small man are small.

Where is the problem?

equ

The idea to resolve this paradox is to represent the set S (i.e. the extension of the concept “small man” as a fuzzy set (see definition 10.4), i.e. a class of objects for which the transition from membership to non-membership is gradual rather than abrupt.

FL and fuzzy sets were introduced to take what is called practical reasoning into account, essentially to model those aspects that are hard to handle with classical logic. An example of practical reasoning is the frequent mix of precise and approximative reasoning that is performed when solving problems.

FL deals with fuzzy propositions such as: Patricia is extremely intelligent. Most dogs are nice. Michael is much taller than John, and so on.

One particular characteristic is that not only is the meaning of terms subjective, but it is also local, i.e. restricted to the considered domain of discourse (it is not really the same to be considered as tall by Pygmies or by Scandinavians). Thus, FL can be viewed as a logic whose propositions, connectives, truth values, etc. do not have a universal value. This implies that inference processes in FL are of a semantic nature rather than a syntactic one.

The creator of FL (L. Zadeh) described it as follows:

Perhaps the simplest way of characterising fuzzy logic is to say that it is a logic of approximate reasoning.

The term fuzzy logic is used to describe an imprecise logical system, FL, in which the truth-values are fuzzy subsets of the unit interval with linguistic labels such as true, false, not true, very true, quite true, not very true and not very false, etc.

Some examples are:

EXAMPLE 10.4.–

equ

equ

Fuzzy sets: definition and fundamental properties

DEFINITION 10.4.Let U denote the universe of discourse.

A fuzzy set A is defined by the generalized characteristic function:

equ

(the range of a standard characteristic function is {0, 1}, hence ordinary sets are considered as particular cases in the theory of fuzzy sets).

Interpretation: the values of ieqA(x) are interpreted as degrees of membership of x to A (0 denotes non-membership, 1 denotes membership, and 1/2 could represent semi-membership).

EXAMPLE 10.5.– Consider the property “to be much greater than 1”, defined on r1.gif+. It can be assigned to a fuzzy set A with ieqA : r1.gif+ → [0, 1], where ieqA is an arbitrary function that is continuous and non-decreasing such that, for example:

equ.gif

EXAMPLE 10.6.– The characteristic function of the set of people who can be professional football players could be:

image

The usual operations are (in general) defined as follows.

DEFINITION 10.5.– (set operations on fuzzy sets).

– A fuzzy set A is empty iff ieq.

Two fuzzy sets A and B are equal (denoted by A = B) iff ieq.

ieq

(if an element is somehow an element of A, then it must be an element of B in at least the same degree).

ieq (for all x, we shall thus write ieq.

ieq (for all x, we shall thus write ieq.

ieq (for all x, we shall thus write ieq.

EXAMPLE 10.7.–

image

EXERCISE 10.1.– (properties). Prove that if A, B, and C are fuzzy sets:

a) AB is the smallest set containing A and B.

b) AB is the biggest set contained in A and B.

c) A ∪ (BC) = (AB) ∪ C

d) A ∩ (BC) = (AB) ∩ C

e) ieq

f) ieq

g) C ∪ (AB) = (CA) ∩ (Cb)

h) C ∩ (AB) = (CA) ∪ (CB)

DEFINITION 10.6.– A fuzzy n-ary relation in E is a fuzzy set in En.

EXERCISE 10.2.– Give the generalized characteristic function for the fuzzy relation in ieq (i.e. (x, y) ∈ ieq2): x is much smaller than y.

EXERCISE 10.3.– (relations). The composition of two binary relations R and S, denoted by R o S, is defined as follows.

x(R o S)y iff ∃z such that xRz and zSy.

a) How would you define the composition of two fuzzy binary relations, i.e. what would the generalized characteristic function ieqRoS be (as a function of ieqR and ieqS)?

How would you define the generalised characteristic function of a fuzzy relation R that is:

b) reflexive?

c) symmetric?

d) antisymmetric?

e) transitive?

The most commonly used basic logic for FL is Łukasiewicz’s ieq (i.e. the set of truth values is the real interval [0, 1]). To the qualifiers true, very true, more or less true, neither true nor false, etc. fuzzy sets of the interval [0, 1] will be assigned. This choice is necessary to be able to manipulate the logic practically. In general, we are interested in a finite (small) number of truth values.

These ideas were formalized by L. Zadeh, with the concept of linguistic variables.

More precisely, a linguistic variable is a 5-tuple:

equ

where:

image is the name of the variable (i.e. age, size, etc. from a qualitative point of view, by opposition to a quantitative point of view, with values 32 years, 1.75 m, etc.).

image is the set of linguistic variables (young, not young, very young, not very young, etc.).

U is the universe of discourse (i.e. the set of values that the variable may take).

G is the set of production rules that generate image.

M is the semantic rule.

equ

where image : set of parts of U, i.e. the set of all fuzzy subsets of U.

The idea behind the concept of linguistic variables is to obtain the fuzzy value of different terms as a function of those terms that were chosen as basic terms.

EXAMPLE 10.8.– (L. Zadeh). Linguistic variable: size

Basic term: small

ieq = {small, not small, very small, very (not small), not very small, very very small, etc.}

is generated by the grammar (with axiom S):

SA

Snot A

AB

image

Bvery B

B → (S)

Bsmall

If we assume that the generalized characteristic function (see definition 10.4) of the extension A of a concept is ieqA, then in general, we take (other similar choices are of course possible):

equ

graphical meaning:

image

equ

If the grammar had authorized it, we could have let, for example:

equ.gif

EXAMPLE 10.9.– (syntax and semantics). The symbols occurring in the rules on the right-hand side correspond to the fuzzy subsets of [0, 1]. Of course, indices l and r correspond to left and right (of a rule).

EXERCISE 10.4.– Does the grammar of example 10.9 permit us to generate the linguistic variable not very true and not very false? If so, compute its truth value.

Fuzzy sets permit us to obtain an elegant solution to the paradox4.

All men are small.

The only remark to be made is that not all generalized characteristic functions correspond to the common notion of “small”. We define:

equ.gif

for some function f: X → P, which, to represent the extension of the concept “small man” must be5:

1) continuous (our appreciation changes progressively);

2) monotonically decreasing;

3) f(h(x)) = 1 (or f(h(x)) ≈ 1), for those men that are certainly small;

4) f(h(x)) = 0 (or f(h(x)) ≈ 0) for those men that are certainly not small.

For example:

image

We define the fuzzy relation “smallness of y with respect to x

equ

we assume there exists an xX such that ieqX(x) = 1 (there is a man who is certainly small), for example, h(x) ≈ 0 (i.e. his height is almost zero).

Consider yX such that h(y) ≥ h(x), we may construct a finite sequence x = x0, x1, x2,… xN-1, xN = y and:

equ

This formula translates the intuition that, as the process is iterated, the conviction that y is small disappears. Indeed, the product of the values in ]0, 1[ gets closer and closer to 0. The mathematical details are simple and are left to the reader.

REMARK 10.5.– We may see the origin of the paradox of small men (of the heap of grains, etc.) in the fact that each time a new man is considered, the memory that in general there were many men taller than someone small was lost. The treatment proposed above restores this memory.

10.2.1. Inference in FL

L. Zadeh proposed to treat the gradual membership of element x to a fuzzy set A by fixing values image, image; 0 < image < image < 1 and to use as a convention:

if ieqA(x) ≤ image, then x does not belong to A;

if ieq, then x belongs to A;

if ieq, then the membership of x to A is undetermined.

From the point of view of the corresponding logic, this would lead to a logic with three truth values: T, F, i (see section 10.1).

The method proposed above can be viewed as a generalization of this simple idea.

It is natural to try to apply inference rules from classical logic to non-classical logics. For example, the resolution method was extended (by C. Chang) more than 30 years ago from FOL to first-order FL. But first we must formally define the semantics of FL.

10.2.1.1. Syntax

The same as for classical logic.

10.2.1.2. Semantics

The only difference with FOL (that corresponds to intuition) is that every predicate P(n) is not interpreted as an n-ary relation (i.e. the set of n-tuples of the relation) in the universe of discourse, but as a fuzzy set, or more precisely the corresponding generalized characteristic set (in other words, to every n-tuple (x1,…, xn), we do not assign true or false to P(n) (x1, …, xn), but a value in [0, 1], which is given by the characteristic function).

It is thus possible to view classical logic as a particular case of FL.

The semantics (i.e. the value given to) composed wffs is given by the function v defined below.

DEFINITION 10.7.–

equ

or:

equ

An interpretation ieq satisfies (respectively, falsifies) a formula s_f.gif iff s_f.gif (respectively < 0.5).

REMARK 10.6.– In general,

equ.gif

EXAMPLE 10.10.– Consider the wff ∀xyP(x, y)

on domain D = {a1, a2}

equ.gif

The next step is to find a universe on which it is sufficient to focus to test whether a wff is contradictory.

10.2.2. Herbrand’s method in FL

The key property (see theorem 5.7) is that given a set of clauses S, we can associate to any interpretation ieq of S a Herbrand interpretation ieq such that if ieq then ieq.

We show this on an example.

EXAMPLE 10.11.– Consider the clause:

equ

and the interpretation image:

equ

This interpretation is a model of ieq. Indeed, ieq

The corresponding interpretation ieqH is:

equ.gif

Semantic trees are defined in the same way as for FOL, by replacing L with ieq and ieq by image(L) < 0.5.

10.2.2.1. Resolution and FL

In PL, the resolvent is a logical consequence of the parent clauses (i.e. every model of the parent clauses is a model of the resolvent).

We first analyze an example the way the resolution rule for PL transmits truth values when it is applied to FL.

EXAMPLE 10.12.– Consider the two following clauses:

equ

(whose resolvent is Q)

with:

equ

hence, there is a difference with classical PL:

equ.gif

This example is a particular case of the following theorem.

THEOREM 10.1.– Let C1 and C2 denote two clauses and ieq(C1, C2) denote a resolvent.

If max[image(C1), image(C2)] = b and min[image(C1), image(C2)] = a > 0.5

then:

equ.gif

PROOF.– C1: Pimage, C2: ¬Pimage (image and image: disjunctions of literals)

ieq (C1, C2): imageimage

Without loss of generality, we assume:

* ab and:

ieq

from (1) and (2) and by definition of max:

ieq

there are two cases to consider:

i) image(image) = image

equ

ii) image(image) < a, from (1) image(P) = a

By hypothesis a > 0.5, hence imageP) = 1 − image(P) < 0.5 < a from (2) and *image(image) = = ba

equ

This theorem can easily be generalized.

THEOREM 10.2.– Consider the set of clauses S = {C1,..., Ck} (k ≥ 2)

If ieq

then

equ.gif.

10.3. Modal logics

In a dictionary on language sciences (O. Ducrot and T. Todorov), it is written:

Logicians and linguists have often deemed it necessary to distinguish, in an enunciation act, a representative content, sometimes called dictum (relating a predicate to a subject), and an attitude expressed by the subject speaking of the content, that is the modus or modality.

The four following sentences have the same dictum, but different modalities:

– Peter will come.

– Let Peter come!

– It is possible that Peter will come.

– Peter must come.

Modal logic studies modalities, i.e. the logical operations that qualify the assertions (or more precisely that qualify their truth modes). We will say, for example, that a proposition is necessarily true, or possibly true, or that it has always been true, or that it will be true one day, or that we know that it is true, or that it should be true, etc.

Originally, modal logic was designed to characterize the possible and necessary truths. It was progressively extended to the study of statements on knowledge, beliefs, time, ethics, properties of programs, etc.

They are particularly relevant in AI, for example, in the so-called description logics for ontologies and in multi-agent systems.

The concepts of necessity and of possibility have been studied by philosophers, ever since Aristotle, at least.

It is generally acknowledged that C.I. Lewis (in the 1930s) first studied these modalities. His analysis was mainly concerned with strict implication, thus called to be distinguished from material implication and its paradoxes (see exercise 3.7).

A strictly implies B (written AB) iff A∧ ¬B is impossible, or, with modern notations,

¬image(A image ¬B) or, equivalently, image(A d_arr.gif B)

where ieq is interpreted as possible and image is interpreted as necessary.

Little by little, the study of the connectives ieq and image themselves became a field by itself.

For example, in computer science, when referring to a non-deterministic program, means image “in every execution that halts, A is T”, whereas ieq A means “there exists an execution that halts with A T”.

Gödel was interested in modalities from the point of view of provability.

For historical purposes, i.e. Aristotle’s philosophy and its huge influence on Western culture, those that were the most studied are:

– ontic modalities: it is necessary (possible, impossible,…) that…

– epistemic modalities: Peter knows that…

– temporal modalities: since, starting from, until, he always thinks that…

– deontic modalities: it is forbidden, it is permitted, it is illegal…

Other modalities:

– doxastic modalities: we believe that,…

– metalogic modalities: it is provable that, this is satisfiable,…

There exist propositional, first-order and higher-order modal logics. We restrict ourselves to propositional modal logics (we will see they have a great expressive power).

One essential characteristic of these logics is the fact that their connectives are not truth functional.

EXAMPLE 10.13.– (See example 2.6.)

Consider the conjecture (we could have chosen any other one)

P = NP

It could be T or (exclusive or) F, but the propositions:

ieq (P = NP): It is possible that P = NP

ieq(¬ (P = NP)): It is possible that P ≠ NP

could both be considered as true propositions. We thus have (no matter the value given to P = NP):

image

On the other hand:

1 + 1 = 3 is F and it is normal to consider image(1+1 = 3) as F. In other words, we do not imagine that it is possible for 1 + 1 = 3 to be T.

By replacing F in (*) by 1 + 1 = 3 (i.e. by F), we obtain:

equ

By considering propositions (*) and (*) (*), we verify that ieq (connective meaning possibly) is not truth functional.

10.3.1. Toward a semantics

There exist relational (possible worlds), algebraic, and topological semantics.

We will study the semantics that is best known: the semantics of possible worlds.

The notion of possible worlds seems to have been around at least since Leibniz. Wittgenstein used to talk of the possible state of things.

In a book that is considered as a reference in logic (and that ignores modal logic)6, it is written:

“…Therefore, a ‘possible world’, or model of [a language] ieq …”

This seems to show that the transition from one to the other (i.e. from classical logic to non-classical ones) can be made naturally via the notion of possible worlds.

To better see the relationship between the notion of a model in classical logic and that of a possible world, we consider the formula: [(AB) d_arr.gif C] ⇔ [(A d_arr.gif C) ∨ (B d_arr.gif C)]

and the interpretations:

equ

We can imagine that each Ii(1 ≤ i ≤ 4) corresponds to a situation, a moment, a state, a state of knowledge, a world, etc.

These interpretations can be represented as worlds (we only provide the elementary propositions that are evaluated to true).

image

We can also add a relation between these worlds (which leads to the so-called Kripke semantics or the possible worlds semantics).

image

And instead of saying:

(CL) Formula imageis satisfied (or not satisfied) by interpretation I

we will say:

equ

It will also be possible to construct formulas mentioning different worlds.

REMARK 10.7.– The notion of possible worlds is familiar (not necessarily with the same name) in the study of probability theory. For example, when analyzing the logical possibilities of the results of an arbitrary experiment (for example, throwing a die).

More formally.

10.3.1.1. Syntax (language of modal logic)

< formula >::=< atomic formula >| ¬ < formula >|< formula > ∧ < formula >|< formula > ∨ < formula >|< formula >d_arr.gif< formula >|< formula >⇔< formula >|image <formula > | box < formula >

< atomic formula >::= Pi (i ≥ 0) 7

or more concisely:

equ

One of the characteristics of modal logics is their “parametrized flexibility”. This sentence, whose meaning will be made accurate in what follows, is metaphorical and expresses the fact that the new connectives that are introduced by these logics can be used to express very different characteristics in very different domains. For example, consider the different meanings of formula imageimage:

imageimage: necessarily image

imageimage: in the future always image

imageimage: it must be the case that image

imageimage: we know that image

imageimage: we believe that image

imageimage: after any execution of the program has halted, we have image

imageimage: in Peano’s arithmetic we can prove image

10.3.1.2. Semantics

DEFINITION 10.8.– (Kripke semantics, Kripke models).

A modal frame or K-modal frame is a pair s_f.gif = (W, R)

where W is a non-empty set and R is a binary relation on W.

– A Kripke model8 is a triple:

equ

with:

W: non-empty set (of worlds);

– R: binary relation on W (accessibility relation);

image : ieq or image : ieq

(ieq: set of all atomic propositions (formulas). ieq and image, respectively, denote the sets of parts of ieq and image);

image is called the interpretation function or valuation

(image indistinctly provides the set of worlds in which a proposition is evaluated to T or the set ofpropositions that are evaluated to T in each world)

– we shall indistinctly say wW or wimage;

– the frame s_f.gif is called the base of the model.

– The domain of image is extended to the set of formulas (image is an arbitrary formula) ieq.

DIGRESSION 10.1.– Kripke defined propositions as functions from the set of worlds to the truth values and an n-ary predicate as a function from the set of worlds to the set of n-ary relations.

DEFINITION 10.9.– (satisfiability, validity). In what follows, image is a wff and Mod is a set of Kripke models ieq.

ieq means: the formula image is satisfied in the world w of the model ieq. The relation |= is defined as follows (Mod denotes a set of models):

equ

ieq iff for all w' such that R(w, w'): ieq

imageimage: necessarily image, or image is necessarily T 9;

ieq iff there exists w' such that R(w, w'): ieq

boximage: image is possible or p is T in a possible (accessible) world;

image is Mod — satisfiable iff there exists ieq Mod and ieq such that ieq, w true.gif image;

image is valid in model ieq (ieqMod), written ieq iff v(image) = W (i.e. iff for all wW ieq

image Mod — valid iff image is valid in all models in Mod;

image is valid in modal frame (W, R) iff image is valid in all models admitting (W, R) as a base;

– A model is finite iff the set of worlds of the modal frame is finite; it is infinite otherwise10.

REMARK 10.8.– The underlying notion of necessity in the definition is not always related to a necessity in the real world, as we see it. There could be, for example, several parallel worlds (or flows of time) that coexist (that flow at different speeds) and we only have access to one of them.

We obtain different logics by imposing conditions (in particular none) on the accessibility relation. The best-known logics are K, S5, S4 etc.

These logics can also be characterized by formal systems.

EXAMPLE 10.14.– Consider the Kripke model represented by the graph below. We indicate the corresponding interpretation in each world. The edges of the graph define the accessibility relation.

image

In this model ieq, we have:

equ

10.3.2. How to reason with modal logics?

Similar to classical logic, there are different approaches. We shall study the following ones:

– syntactic approach (formal systems);

– direct approach;

– translation approach.

In the first approach, we provide a formal system (see section 3.3) for the logic under consideration. In the direct approach, we give inference rules for the modal connectives, and in the translation approach, we translate formulas into FOL and reason in FOL11.

We present here the syntactic approach and the translation approach for some propositional modal logics. The direct approach is treated for temporal logics, which can be considered as particular cases of modal logics.

10.3.2.1. Formal systems approach

We specify those that are best known. The language is still the one of section 10.3.1.1.

DEFINITION 10.10.

The minimal logic K (K as Kripke)

ieq (axioms):

All tautologies of PL12 un.gif image(P d_arr.gif Q) d_arr.gif (imageP d_arr.gif imageQ).

ieq (inference rules):

MP: ieq necessitation :ieq substitution % see remark 3.20

Other logics

– Inference rules: the same

– Axioms:

T = K un.gif {imageP d_arr.gifP}

D = K un.gif{imageP d_arr.gif boxP}

S4 = K un.gif {imageP d_arr.gif P,imageP d_arr.gifimageimageP}

S5 = S4 un.gif {boximageP d_arr.gif P}

G = K un.gif {image(imageP d_arr.gif P) d_arr.gifimageP}

EXERCISE 10.5.– Give the proofs (deductions) that correspond to the following theorems:

a) image

b) image

c) image

d) image

e) image

f) image

g) image

h) image

10.3.2.2. Translation approach

The connectives that are difficult to translate are image and box. The translation function (Andréka, Németi, and Van Benthem) tr (see definition 10.9) is defined as follows:

image

P: propositional symbol.

X: free variable, denoting the world in which we are.

y: fresh variable not occurring in the formulas to be translated.

R: accessibility relation.

The idea behind this translation is of course the capture of the semantics imposed by the definition: P(y) (respectively, image(y) should be interpreted as P (respectively, image) is T in world y.

EXAMPLE 10.15.– The formula whose validity is to be proven is:

equ

This is the “specifically modal” axiom of logic K (see definition 10.10). Note that no particular property is imposed on the accessibility relation (this should be compared with exercise 10.7).

image

EXAMPLE 10.16.– Using the translation method and the method of semantic tableaux:

a) give a Kripke model ieq such that in a world w1: ieq,w1 |= image(P d_arr.gif Q) and ieq, w1 m_op1.gif (P image Q);

b) give a Kripke model ieq such that in a world w1: ieq,w1 true.gif P d_arr.gif imageQ and ieq,w1 m_op1.gif image(P d_arr.gif Q).

a) We consider the set {image(P d_arr.gif Q), ¬(P d_arr.gif imageQ)}

Translation:

equ

REMARK 10.9.– The constants that are introduced by variable instantiations (here a) correspond to accessible worlds.

The branch going through leaf 12 cannot be closed by instantiating the only formulas that can still be instantiated.

We have constructed the desired model:

image

b) We consider the set {P d_arr.gif imageQ, ¬image(P d_arr.gif Q)}

Translation:

equ

equ

We have constructed the desired model:

image

EXERCISE 10.6.– (sea battle argument). Express the following argument (given by Aristotle) in modal logic and verify its correctness using the translation approach.

(Similar to classical logic, given an argument expressed in a natural language, the classification of this argument as correct or incorrect will generally depend on the way it is translated.)

If I give the order to attack then necessarily there will be a sea battle tomorrow. Otherwise, necessarily, there will be none. I either give the order or I do not give it. Hence, necessarily, there will be a sea battle tomorrow, or necessarily there will be none.

Two classical translations of the sea battle argument are given below:

P: “I give the order to attack”;

Q: “There will be a sea battle tomorrow”.

equ

EXAMPLE 10.17.– (accessibility relation for T, S4, S5). We can use the translation method and the method of semantic tableaux to try to discover the sufficient properties that are required of the accessibility relation so that the following axioms are valid.

equ

equ

If R is reflexive, i.e. ∀xR(x, x), then we can graft to each branch of the tree:

equ

and the two branches can be closed.

equ

If R is symmetric, i.e. ∀xyR(x, y) d_arr.gif R(y, x), then we can graft to each branch of the tree:

equ

and the two branches can be closed.

equ

If R is transitive, then the branch on the left-hand side can also be closed, by grafting to node 11 the tree below:

equ

As an example, we give here a proof of this property that is representative of the properties that appear in textbooks on this topic.

imageimage imageimage image is valid in a modal frame iff the accessibility relation R is transitive.

if

We show that if R is transitive and we have

equ

then we also have:

equ

R is transitive, i.e.:

equ

hence, if formula imageimage is satisfied in world w1, then it is satisfied (see definitions 10.8 and 10.9) in all worlds that are accessible from (related to) w1; hence, it is satisfied in w3, so that image(imageimage) is also satisfied in w1.

only if

We prove the contrapositive, we assume that R is not transitive and we need to prove that:

equ

and that:

equ

to do so, we propose, for example, image(w2) = image; image(w3) = ¬image.

This valuation is possible as R is not transitive; hence, we may assume that we have ¬image in w3. This would not be possible if R had been transitive, as we assumed:

equ

and if R had been transitive, w3 would have been accessible from w1, and by definition of the semantics, imageimage is T in a world iff image is T in all worlds accessible from the world we are starting from (see definition 10.9).

With our approach, we could also have proved the only if part of the theorem (the if part has already been proved):

If R is not transitive, we graft to node 11. the tree:

image

By considering the branch going through nodes 7, 9, 10, 11, 13, and by applying the same method as in example 10.16, we obtain the Kripke model (interpretation):

image

which verifies:

equ

and:

equ

EXERCISE 10.7.– Use the translation method and the method of semantic tableaux to discover (sufficient) properties of the accessibility relations of the modal frames in which the following formulas are valid:

a) image

b) image

c) image

d) image

e) image (called Geach’s axioms)

A question that naturally arises is whether other axioms (formulas) correspond to other simple relations, or if we can always express these relations by FOL formulas.

REMARK 10.10.– The problem of knowing whether an arbitrary wff of modal logic is satisfied by Kripke frames that are characterized by FOL formulas is undecidable. But Sahlqvist’s theorem guarantees that a (syntactically characterized) class of formulas of modal logic corresponds to Kripke frames that are definable by FOL formulas. Sahlqvist’s class is decidable.

The two following formulas do not belong to Sahlqvist’s Class.

McKinsey’s axiom:

(MK) imageboximage image boximage

(S4.1 = K un.gif {MK})

and Löb’s axiom:

equ

cannot be defined by a set of wffs of FOL.

Furthermore, there exist formulas of modal logic that are valid in frames that are characterizable by FOL formulas, but are not equivalent to formulas in Sahlqvist’s class, for example:

equ

As a consequence, we can consider the expressive power of the PLs.

Propositional modal logic corresponds to a fragment of SOL.

As can be seen, for example, in exercise 10.6, Aristotle was interested in other arguments than those of his syllogistic (see section 2.2). A well-known argument that he proposed on the need to philosophize is part of deontic logic:

We must philosophise or we must not philosophise. If we must philosophise then we must do so. Otherwise, we must still philosophise (to justify this point of view). Therefore, in all cases, we must philosophise.

EXERCISE 10.8.– How would you define the semantics of the following operators from deontic logic, using the semantics of possible worlds?

Oimage: image is compulsory

Fimage: image is forbidden

Pimage: image is allowed

Eimage: image is optional

10.4. Some elements of temporal logic

When I am not asked, I know what time is; when I am asked, I no longer do.

Saint Augustin

The notion of time is essential in philosophy, in physics, in biology, and also in computer science (sequential and parallel algorithms, time complexity of algorithms, etc.).

In philosophy, it leads to different positions, it suffices to consider Parmenides for whom “nothing changes” and Heraclitus for whom “everything changes”. See also exercise 10.6. This notion is closely related to notions of space, matter, energy, evolution, etc. Its conceptualization poses huge problems and it is difficult to try avoiding them, as time is implicitly or explicitly present in all human activities (and all around us).

We often use metaphors to mention time (the flow of time, time’s arrow, etc.).

Similar to other topics, a logical study of time only aims at capturing some aspects that are relevant for the class of problems that are studied.

The notion of organization of instants:

image

seems to be the most fundamental one.

In mathematics (in the statement of definitions, theorems, etc.), we are not interested in time. This is not the case in natural languages. Linguists have studied the way time can be taken into account in logical structures.

For example (as is also common in physics): vt0,vt1,vt2, …(i.e. the speed of an object at times t0, t1, t2,…) or by explicitly introducing time as a parameter: interests(x, ieq), alive(x, t), etc.

We can also (and this possibility is of a particular interest to us) treat instants as possible worlds (now corresponds to the world we are currently in). The accessibility relation will be a total order (linear time) or a partial one (branching time13).

Reference to the past and to the future depends on languages and is not always that obvious. For example, in English, we say:

– the plane is scheduled to land at 6:30 pm (and it is currently 3:10 pm, we use the present tense to talk of the future);

–I now understand the rules of rugby (we are using the present tense to talk of an event that belongs in a large part to the past: the understanding of the rules).

Problems about the nature of time and temporal concepts have been a concern for philosophers at least since there has been a written philosophy.

Time is part of our sensory experience (at least indirectly), and our discourse relies heavily on time (in the sentences above, for example).

Classical mathematical logic abstracts time away. The propositions that are obtained from sentences from everyday life are sometimes very artificial.

The goal of temporal logic (also called tense or change logic) is to systematize reasoning with propositions involving temporal aspects.

The Megarian and Stoic Greeks studied theories for these logics.

Modern temporal logic was initiated by the work of logician and philosopher Arthur N. Prior in the 1950s. Prior systematically studied the links between temporal and modal logics.

The representation (picture) of time as dots that denote instants without duration as basic entities is frequently used. Examples are graphs representing functions of time (speed, distance travelled, GDP, unemployment rate, etc.) that are used to teach physics, economy, etc. where in each instant (point) on the abscissa is assigned an ordinate point.

We must distinguish between the graphs of functions that correspond to continuous phenomena and those that correspond to discrete phenomena.

Although common, the concept of time as a set of instants without duration is a very abstract model, compared to our perception in everyday life, and it is similar to the notion of a point in space (what exactly is a region of space, say, a sphere of radius 0?).

Another basic entity that is widely used is the period or interval.

Depending on the problem under consideration, one or the other notion will be better adapted.

In particular, there are problems that cannot be solved with one of these notions, but can be with the other (or not even be problems at all for the other). Here is an example dating at least from the Middle Ages: at the precise moment a man dies, is he alive or dead?, or at the precise moment afire is put out, is it burning or extinct?

This problem cannot be solved with the notion of instants, but with the notion of time interval, there is no problem at all: the man died between 2 and 3 am, if he was alive at 2 am and dead at 3 am.

If we use a discrete model for time, then there is no problem either.

Under some circumstances, it is more interesting to consider both notions at the same time. For example, if we say Alice and Bob took the 6:42 train and chatted throughout the entire trip, we are likely to be considering the departure time as an instant and the duration of the trip as an interval.

When the basic entities are instants, the relations before and after are used.

For periods or intervals, the following relations (among others) are used: overlap, non-overlap, meet, start and finish

We provide a graphical representation and logical translation of each relation, using as primitive relations the precedence relation (ieq) and the interval inclusion relation (ieq). Given two intervals A and B, they can be related the following ways (among others):

.Overlap:

image

(Note that x denotes an interval.)

.Non-overlap::

image

.Meet:

image

.Start

image

.Finish

image

10.4.1. Temporal operators and semantics

As was mentioned in section 10.4, Prior introduced modern temporal logic essentially for philosophical purposes and he naturally focussed on linguistic constructions.

Natural languages and the inferences that are performed on their sentences provide examples of the peculiarities that come up when time is taken into account.

For example, from The weather is nice and The weather is cold, we can (correctly) deduce that The weather is nice and cold. From The weather will be nice or The weather will be cold we can deduce The weather will be nice or cold. But starting with The weather will be nice and The weather will be cold, we cannot (correctly) deduce that The weather will be nice and cold.

We introduce the following operators:

Fimage: at some point in time, it will be the case that image (it will be the case at least once);

Pimage: it was the case at some point in time that image (it was the case at least once);

Gimage :def ¬F¬image: it will always be the case that p;

Himage :def ¬P¬image: image was always the case in the past.

For the Megarians (philosophers of the school of philosophy founded by Euclid of Megara)14:

– the real is what is realized (i.e. true) now;

– the possible is what is realized (i.e. true) at some arbitrary time;

– the necessary is what is true all the time.

With modern notations, we have the following respective translations for “possible” and “necessary”:

aaa

For Stoicians:

– the real is what is realized (i.e. true) now;

– the possible is what is realized (i.e. true) now or at a future time;

in other words:

a proposition is possible if it is true or will be true;

– the necessary is what is realized (i.e. true) or will be realized at all future times.

With modern notations, we have the following respective translations for “possible” and “necessary”:

aaa

Note that with one or the other definition, “possible” and “necessary” keep their usual relationship.

10.4.1.1. A famous argument

Some Greek philosophers credit the Greek philosopher of the Megarian school Diodorus Cronus (– 4th) with a famous argument (that was analyzed by Prior) called the Master argument, that seems to be the following:

D1) Every true proposition about the past is necessary;

D2) An impossible proposition cannot be a consequence of a possible proposition;

D3) There exists a proposition that is possible, but is not and will not be true.

Diodorus concludes (the reasoning to reach the conclusion is unknown) that the set {D1, D2, D3} is unsatisfiable, which allowed him to characterize:

– the possible as what is or will be true;

– the impossible as what, being already false, will not be true;

– the necessary as what, being true, will not be false;

(Some authors translate: the necessary as what is and will be always the case (true).);

– the unnecessary as what is already false, or will be false.

Logicians and modern philosophers tried to reconstruct and formalize Diodorus’s reasoning to conclude that {D1, D2, D3} is unsatisfiable.

One of the formalizations translates Diodorus’s assertions as follows:

image

and adds the following two assertions that were (according to historians) accepted by the Greeks.

image

The proof of the unsatisfiability of {D1', D2', D3', D4, D5}:

image

image

The logic that corresponds to Diodorean logic is:

image

meaning that we add Geach’s axiom to S4 (see exercise 10.7), which is valid in directed frames (directed relations correspond to the so-called (strongly) confluent in λ-calculus):

image

For temporal frames (T, ≤), it is often expressed as:

image

10.4.2. A temporal logic

Given the language of PL enriched with operators F, P, G, H and formally defined by the following syntactic rules:

image

a model for this language is a triple ieq

where:

T is a set of instants.

We restrict ourselves as follows:

ieq is a total order relation (see definition 3.25) % Other hypotheses (density, etc.) lead to other temporal logics

image (valuation) image →2T

(where ieq is the set of basic symbols and 2T is the set of subsets of T)

We will say that image is true at instant t in model ieq, which will be denoted by ieq and by ieq, where relation ieq is defined inductively as follows:

ieq iff tv(P) % P : propositional symbol

image

A wff image[t] is T iff (T,<,v) = image[t] for every valuation v; image is T, noted = image, iff image[t] is T for all tT.

10.4.3. How to reason with temporal logics?

Similar to modal logics, we can envisage formal systems or translation methods. For the temporal logic defined in section 10.4.2, we will propose a direct method: the method of semantic tableaux. We apply the same process as for classical and many-valued logics. We shall start by the rules that define the semantics to define the syntactic rules (that respect this semantics) that will allow us to construct the tableau.

As in its previous versions, the method of semantic tableaux enumerates the sets of (partial) models of a set of temporal formulas, in which time occurs. The considerations of section 3.2 about the usage of the method apply.

The following remarks are important:

– there is a new parameter: time. As mentioned, we restrict ourselves to totally ordered instants:..., t-n,t-(n-1),..., t-1,t0, t1,t2, …,tn,…;

– the contradictions that permit us to close a branch (and thus to conclude that it is impossible to construct the potential model that is represented by the branch) between a formula p and its negation ¬image must correspond to the same instant (which is rather obvious: there is no contradiction in the fact of being alive at instant ti and not alive at instant tj, where ij!);

– when applied to a formula image at instant ti, the rules that are concerned with connectives ¬, ∧, ∨, d_arr.gif, xand ⇔ will permit us to replace image by the formulas denoting all models of image at the same instant ti. As usual, we shall tick image as already analyzed (√);

– rules F, P, ¬G, and ¬H below apply only once and will therefore be ticked after being used (√). Rules ¬F, ¬P, G, and H below can be applied at any desired instant. We will not be interested in the problem of decidable subclasses.

The rules are therefore the following:

10.4.3.1. The method of semantic tableaux

equ

EXAMPLE 10.18.– Prove the validity (or non-validity) of the formula below, using the method of semantic tableaux:

equ

Of course, every ti is different from the other tj’s (i j).

The considered formula is not valid. Counter example (i.e. model of its negation):

{image|[t1] ¬image[t2]}.

EXERCISE 10.9.– Use the method of semantic tableaux to prove the validity (or nonvalidity) of the formulas below:

equ

EXERCISE 10.10.– Prove the validity of formula:

equ

using the method of semantic tableaux.

The two following operators are particularly important in computer science and confer a great expressive power to temporal logic. These are: since and until:

image: it was the case that image and since then and up till now (t0) it was the case that ψ.

image: it will be the case that image and from now (t0) until then it will be the case that ψ.

image

Their translation in FOL is:

equ

In computer science, temporal logics are used in program verification, AI, databases, etc. They are used for the analysis and proofs of properties verified by many systems, including those that are concurrent, non-deterministic, real time, that do not end, etc.

The following relations are often studied in temporal logic:

irreflexivity: ieq

asymmetry: ieq

transitivity: ieq

there exists a beginning: ieq

there exists an end: ieq

linearity: ieq

discrete: ieq ieq

(To be compared with the notion of density below.)

density: ieq

Warning: density ≠ continuity

DIGRESSION 10.2.– (on continuity). A total ordering (S, R) is continuous if it does not contain any gap.

A cut is a partition (X, Y) (i.e. SX un.gif Y and X uw.gif Y = ieq) such that:

ieq

A gap is a cut (X, Y) such that X has no last element and Y has no first element.

((q.gif, ) has gaps: take the cut image

REMARK 10.11.– The formula (Gp d_arr.gif PGp) d_arr.gif (Gp d_arr.gif Hp) is valid in all continuous frames.

10.4.4. An example of a PL for linear and discrete time: PTL (or PLTL)

10.4.4.1. Syntax

equ

o : next

box : eventually

image: always

U : until

10.4.4.2. Semantics

image= (T,S, image)

T: a finite or denumerable set of states (instants)

S: total function (successor function): T → T

s0: first element

(as usual, we shall note Si(t):

equ

An interpretation (model) satisfies a formula image iff ieq

10.4.4.3. Method of semantic tableaux for PLTL (direct method)

It is based on the same principle as the method used for PL and FOL, i.e. we enumerate all the models of a set of formulas. To do so, we propose the following rules that translate the semantics of the connectives that appear in the syntax (section 10.4.4.1). The rules that correspond to the classical connectives are the same as those for PL.

Rules:

equ

→: decomposition of the wffs (independent of time). % see below;

d_arr.gif: corresponds to moving forward one unit of time. % see below.

Once the principle mentioned above has been accepted, two new problems related to time arise:

1) the rules on box and U;

2) how to close a branch.

1) These rules can generate a potentially infinite tree, if we systematically choose the rightmost branches (they replace a formula by another formula containing the same connective).

To solve this problem, each time a formula is replaced, it is marked as already developed but still kept (we memorize that it has been developed). If at a given time (see below (*)), say tk, we obtain the same set of formulas as one occurring at a previous instant, say ti, we add an edge: tk → ti. This will lead to a directed graph instead of an infinite tree (see digression 3.1).

2) If at a given instant we have two contradictory literals Pi and ¬Pi, then the model we are trying to construct is not viable and we say the branch is unsatisfiable.

If there is a cycle in a branch, this means that an event (boxf, f U g) will never occur. We will therefore never be able to construct a model on the branch containing the cycle. The node from which the branch originates will be classified as unsatisfiable.

Of course, if all descendants of a node n are unsatisfiable, then n will be classified as unsatisfiable.

(*) …But we still have not mentioned time!

Here is how we proceed: we first decompose (i.e. we apply the rules on) the formulas that do not begin with o. When they have all been decomposed, we decompose those beginning with o (we move forward one unit of time).

EXERCISE 10.11.– Use the direct method for PLTL to prove the unsatisfiability of the following formula:

equ

You will need to propose a rule for imageP (use the definition from section 10.4.4.1).

EXAMPLE 10.19.-

equ

equ

equ

1) is satisfied by every interpretation such that every state in which P is true is followed by a state in which Q is true.

2) is satisfied by every interpretation such that if P is true in a state then starting from the next state, Q is always false until the first state in which R is true.

3) is satisfied by every interpretation in which P is true infinitely many times.

EXERCISE 10.12.– Can you justify the assertions (1), (2), and (3) of example 10.19, using the translation method and the direct method of semantic tableaux?


1 The intension is the set of characters that are (considered as) essential to a class.

2 What possesses an existence of its own, independently of the knowledge or the idea that cognitive beings can have about it.

3 For example, in practice, saying that two objects have the same color means that the potential differences are not detectable by the sense (or device) that detects colors.

4 The solution is given by J. Goguen.

5 The required properties on generalized characteristic functions are a key point in the formalization of FLs.

6 Chang and Keisler: Model Theory.

7 It is worth mentioning that Aristotle was the first to use propositional variables, i.e. letters that can be replaced by propositional symbols and not by terms (of a syllogism, see definition 2.8) in modal logic.

8 Note that, in accordance with usage, we talk about Kripke models and not Kripke interpretations. Thus, a formula may be evaluated to F in a Kripke model. We may also talk of the Kanger-Kripke semantics.

9 A necessary truth is a truth that is verified in all possible (and accessible) worlds.

10 As in the case of FOL, the property of admitting finite models (see definition 5.15) is important for the design of decision procedures for modal logics.

11 Previously, the founder of temporal logic, appears to be a pioneer of the translation method.

12 Note that we could have chosen any complete formal system, for example, Si (see section 3.4) for PL. The choice that was made permits us not to have to (re)prove all theorems (tautologies) of PL.

13 In branching time, we can have t1 Rt2t1 Rt3t2t3 ∧ — (t2Rt3t3Rt2).

14 The author of the famous Elements was Euclid of Alexandria.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.21.159.82