Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

2
Introduction to Semantics of Programming Languages

This chapter introduces intuitively the notions of name, environment, memory, etc., along with a first formal description of these notions. It allows readers to familiarize themselves with the semantic approach of programming that we share with a number of other authors [ACC 92, DOW 09, DOW 11, FRI 01, WIN 93].

Any high-level programming language uses names to denote the entities handled by programs. These names are generally known as identifiers, drawing attention to the fact that they are constructed in accordance with the syntactic rules of the chosen language. They may be used to denote program-specific values or values computed during execution. They may also denote locations (i.e. addresses in the memory), they are then called mutable variables. And identifiers can also denote operators, functions, procedures, modules, objects, etc., according to the constructs present in the language. For example, pi is often used to denote an approximate value of π; + is also an identifier, denoting an addition operator and often placed between the two operands, i.e. in infix position, as in 2 + 3. The expression 2 * x + 1 uses the identifier x and to compute its value, we need to know the value denoted by x. Retrieving the value associated with a given identifier is a mechanism at the center of any high-level language. The semantics of a language provides a model of this mechanism, presented – in a simplified form – in section 2.1.

All the formal definitions of languages, instructions, algorithms, etc., given in the following are coded in the programming languages OCaml and Python, trying to paraphrase these definitions and produce very similar versions of code in these two languages, even if developers in these languages may find the programming style used here rather unusual. For readers not introduced to these languages, some very brief explanations are given in the codes’ presentation. But almost all features of OCaml and Python will be considered either in this first volume or in the second, where object-oriented programming is considered. We hope that these two encodings of formal notions can help readers who are not truly familiar with mathematical formalism.

2.1. Environment, memory and state

2.1.1. Evaluation environment

Let X be a set of identifiers and V a set of values. The association of an identifier x ∈ X with a value v ∈ V is called a binding (of the identifier to its value), and a set Env of bindings is called an execution environment or evaluation environment. Env(x) denotes the value associated with the identifier x in Env. The set of environments is denoted as E.

In practice, the set of identifiers X that are actually used is finite: usually, we only consider those identifiers that appear in a program. An environment may thus be represented by a list of bindings, also called Env:

where {x₁ ,x₂, . . ., x_n} denotes a finite subset of X, known as the domain of the environment and denoted as dom(Env). By convention, Env(x) denotes the value v, which appears in the first (x, v) binding encountered when reading the list Env from the head (here, from left to right).

In this model, a binding can be added to an environment using the operator ⊕. By convention, bindings are added at (the left of) the head of the list representing the environment:

Suppose that a certain operation introduces a new binding of an identifier, which is already present in the environment, for example (x₂, v_new):

The so-obtained environment (x₂,v_new) ⊕ Env contains two bindings for x₂. Searching for a binding starts at the head of the environment, and, with our convention, new bindings are added at the head. So the most recent addition, (x₂,v_new), will be the first found. The binding (x₂, v₂) is not deleted, but it is said to be masked by the new binding (x₂, v_new). Several bindings for a single identifier x may therefore exist within the same environment, and the last binding added for x will be used to determine the associated value of x in the environment. Formally, the environment (x, v) ⊕ Env verifies the following property:

By convention, the notation (x₂,v₂) ⊕ (x₁ ,v₁) ⊕ Env is used to denote the environment (x₂,v₂) ⊕ ((x₁,v₁) ⊕ Env). For example, ((x,v₂) ⊕ (x,v1) ⊕Env)(x) = v2

When an environment is represented by a list of bindings, the value Env(x) is found as follows:

Python
def valeur_de(env,x):
    for (x1,v1) in env:
         if x==x1: return v1
    return None

OCaml
let rec valeur_de env x = match env with
  | [] -> None
  | (x1, v1) :: t -> if x = x1 then Some v1 else (valeur_de t x)
val valeur_de: (‘a * ‘b) list -> ‘a -> ‘b option

If no binding can be found in the environment for a given identifier, this function returns a special value indicating the absence of a binding. In Python, the constant None is used to express this absence of value, while in OCaml, the predefined sum type ’a option is used:

OCaml
type ’a option = Some of ’a | None

The values of the type ’a option are either those of the type ’a or the constant None. The transformation of a value v of type ’a into a value of type ’a option is done by applying the constructor Some to v (see Chapter 5). The value None serves to denote the absence of value of type ’a; more precisely, None is a constant that is not a value of type ’a. This type ’a option will be used further to denote some kind of meaningless or absent values but that are needed to fully complete some definitions.

The domain of an environment can be computed simply by traversing the list that represents it. A finite set is defined here as the list of all elements with no repetitions.

Python
def union_singleton(e,l):
    if e in l: return l
    else: return [e]+l

def dom_env(env):
    r=[]
    for (x,v) in env: r = union_singleton(x,r)
    return r

OCaml
let rec union_singleton e l = if (List.mem e l then l else e::l
val union_singleton : ’a -> ’a list -> ’a list
let rec dom_env env = match env with
  | [] -> [] I (x, v) :: t -> (union_singleton x (dom_env t))
val dom_env : (’a * ’b) list -> ’a list

Since the value returned by the function valeur_de is obtained by traversing the list from its head, adding a new binding (x, v) to an environment is done at the head of the list and the previous bindings of x (if any) are masked, but not deleted.

Python
def ajout_liaison_env(env,x,v): return [(x,v)]+env

OCaml
let ajout_liaison_env env x v = (x, v) :: env
val ajout_liaison_env : (’a * ’b) list -> ’a -> ’b -> (’a * ’b) list

2.1.2. Memory

The formal model of the memory presented below makes no distinction between the different varieties of physical memory described in Chapter 1. The state of the memory is described by associating a value with the location of the cell in which it is stored. The locations themselves are considered as values, called references. As we have seen, high-level languages allow us to name a location c, containing a value v, by an identifier x bound to the reference r of c.

Let R be a set of references and V a set of values. The association of a reference r ∈ R with a value v ∈ V is represented by a pair (r, v), and a set Mem of such pairs is called here a memory. Mem(r) denotes the value stored at the reference r in Mem. Let M be the set of memories. In practice, the set of references, which is actually used, is finite: once again, only those locations used by a program are generally considered. This means that the memory can be represented by a list, also called Mem:

The existence of a pair (r, v) in a memory records that an initialization or a writing operation has been carried out at this location. Every referenced memory cell may be consulted through reading and can be assigned a new value by writing. In this case, the value previously held in the cell is deleted, it has “crashed”.

Writing a value v at an address r transforms a memory Mem into a memory denoted as Mem[r := v]; if a value was stored at this location r in Mem, then this “old” value is replaced by v; otherwise, a new pair is added to Mem to take account of the writing operation. There is no masking, contrary to the case of the environments. Writing a new value v at a location r that already contains a value deletes the old value Mem(r):

The domain of a memory dom(Mem) depends on the current environment and represents the space of references, which are accessible (directly or indirectly) from bound and non-masked identifiers in the current execution environment. The addition of a binding (x, r) to an environment Env has a twofold effect, creating (x, r) ⊕ Env and extending Mem to Mem[r := v].

NOTE.– Depending on the language or program in question, the value v may be supplied just when x is introduced, or later, or never. If no value is provided prior to its first use, the result of the program is unpredictable, leading to errors called initialization errors. Indeed, a location always contains a value, which does not need to be suited to the current computation if it has not been explicitly determined by the program.

Note that the addition of a binding (x, r) in an environment Env, of which the domain contains x, may mask a previous binding of x in Env, but will not add a new pair (r, v) to Mem if r was already present in the domain of Mem. Thus, any list of pairs representing a memory cannot contain two different pairs for the same reference. The memory Mem[r := v] verifies the following property:

The memory (Mem[r₁ := v₁ ])[r₂ := v₂] is denoted as Mem[r₁ := v₁ ][r₁ := v₂].

For example, (Mem[r₁:= v1 ][r₂:= v₂])(r) = v₂.

The function valeur_ref computes the value stored at a given location. If nothing has previously been written at this location, the function returns a special value (None), indicating the absence of a known value (i.e. a value resulting from initialization or computation).

Python
def valeur_ref(mem,a):
    if len(mem) == 0: return None
    else:
        a1,v1 = mem[0]
        if a == al: return v1
        else: return valeur_ref(mem[1:],a)

OCaml
let rec valeur_ref mem a = match mem with
  | [] -> None
  | (a1, v1) :: t -> if a = a1 then Some v1 els (valeur_ref t a) 
val valeur_ref : (’a * ’b) list -> ’a -> ’b option

The following function writes a value into the memory:

Python
def write_mem(mem,a,v):
    if len(mem) == 0: return [(a,v)]
    else:
        a1,v1 = mem[0]
        if a == a1: return [(a1,v)] + mem[1:]
        else: return [(a1,v1)] + write_mem(mem[1:],a,v)

OCaml
let rec write_mem mem a v = match mem with
  | [] -> [(a, v)]
  | (a1, v1) :: t ->
     if a = a1 then (a1, v) :: t else (a1, v1) :: (write_mem t a v)
val write_mem : (’a * ’b) list -> ’a -> ’b -> (’a * ’b) list

2.1.3. State

A state is defined as a pair (Env, Mem) ∈ E × M such that any reference in the domain of Mem is accessible from a binding in Env. A reference is said to be accessible if its value can be read or written from an identifier contained in Env by a series of operations of reading, writing, or reference manipulation.

Given an environment Env, the set of identifiers X is partitioned into two subsets: which contains the identifiers bound to a reference, and , which contains the others:

The value associated with an identifier x in is a reference Env(x) = r where a value Mem(r) is stored, which can be modified by writing. Identifiers of are generally called mutable variables.

2.2. Evaluation of expressions

The value of an expression is computed according to an evaluation environment and a memory, i.e. in a given state. This computation is defined by the evaluation semantics of the expression.

2.2.1. Syntax

The language of expressions Exp₁ used here will be extended in Chapters 3 and 4. Its syntax is defined in Table 2.1.

Table 2.1. Language of expressions Exp₁

e ::= k	Integer constant	(k ∈ ℤ)
\| x	Identifier	(x ∈ X)
\| e₁ + e₂	Addition	(e₁, e₂ ∈ Exp₁)
\| !x	Dereferencing	(x ∈ X)

Thus, an expression e ∈ Exp₁ is either an integer constant k ∈ ℤ, an identifier x ∈ X, an expression obtained by applying an addition operator to two expressions in Exp₁ or an expression of the form !x denoting the value stored in the memory at the location bound to the mutable variable x. Thus, this is an inductive definition of the set Exp₁. Note that Exp₁ does not include an assignment construct. This is a deliberate choice. This point will be discussed in greater detail in section 2.3 by means of an extension of Exp₁.

NOTE. – The symbol + used in defining the syntax of expressions does not denote the integer addition operator. It could be replaced by any other symbol (for example ⊠). Its meaning will be assigned by the evaluation semantics. The same is true of the constant symbols: for example, the symbol 4 may be interpreted as a natural integer, a relative integer or a character.

EXAMPLE 2.1.– !x + y is an expression of Exp₁ in the same way as (x + 2) + 3. Parentheses are used here to structure the expression, they are part of the so-called concrete syntax and will disappear in the AST.

The set Exp₁ of well-formed expressions of the language is defined by induction and expressed directly by a recursive sum type. Types of this kind can be constructed in OCaml, but not in Python; in the latter case, they can be partially simulated by defining a class for each sum-type constructor. Each class must contain a method with arguments corresponding exactly to the arguments of the sum type constructors it implements. An implementation of this type in Python is naive, and users must ensure that these classes are used correctly. We know that there are possibilities of programming dynamic type verification mechanisms in Python, which simulate strongtyping (similar to that used in OCaml) and ensure that the code is used correctly; however, these techniques lie outside of the scope of this book. The objective of all implementations shown in this book is simply to illustrate and intuitively justify the correct handling of concepts. As we have already done, we choose this approach to implement sum types.

Using Python, we define the following classes to represent the constructors of the set Exp₁:

Python
class Cstel:
    def __init__(self,cste):
        self.cste = cste
class Var1:
    def __init__(self,symb):
        self.symb = symb
class Plusl:
    def __init__(self,exp1,exp2):
        self.exp1 = exp1
        self.exp2 = exp2
class Bang1:
    def __init__(self,symb):
        self.symb = symb

For example, the expression e₁ = !x + y defined in example 2.1 is written as:

Python
ex_expl = Plusl(Bangl(“x”),Varl(“y”))

Using OCaml, the type of arithmetic expressions is defined directly as:

OCaml
type ’a exp1 =
 Cstel of int | Var1 of ’a | Plusl of ’a expl * ’a expl | Bangl of ’a

Values of this type are thus obtained using either the Cste1 constructor applied to an integer value, in which case they correspond to a constant expression, or using the Var1 constructor applied to a value of type ’a, corresponding to the type used to represent identifiers (the type ’a exp1 is thus polymorphic, as it depends on another type), or by applying the Plus1 constructor to two values of the type ’a expl, or by applying the Bang1 constructor to a value of type ’a. For example, the expression e₁ = !x + y is written as:

OCaml
let ex_exp1 = Plus1 (Bang1 (“x”), Var1 (“y”))
val ex_exp1 : string exp1

2.2.2. Values

Given a state (Env, Mem), we determine the evaluation semantics of an expression e ∈ Exp₁ by computing the value of e in this state, i.e. by evaluating e in this state. Values may be relative integers or references, hence V = ℤ U R. An additional, specific value Err is added to the set V; this result is returned as the value of “meaningless” expressions. The result of the evaluation of an expression in Exp₁ will therefore be a value belonging to the set

Values in V are either relative integers or references. By defining a sum type, these two collections of values can be grouped into a single type.

Python
class CInt1:
    def __init__(self,cst_int):
        self.cst_int = cst_int 
class CRef1:
    def __init__(self,cst_adr):
        self.cst_adr = cst_adr

Each class possesses a (object) constructor with the same name as the class: the constant k obtained from integer n (or, respectively, from reference r) is thus written as CInt1(n) (respectively, CRef1(r)), and this integer (respectively, reference) can be accessed from (the object) k by writing k.cst_int (respectively k.cst_adr). With OCaml, the type of elements in V is defined directly, as follows:

OCaml
type ’a constl = CIntI of int | CRefl of ‘a

A value of this type is obtained either using the constructor CInt1 applied to an integer value or using the constructor CRef1 applied to a value of type ’a corresponding to the type used to represent references.

A type grouping the elements of is defined by applying the same method:

Python
class VCste1:
    def __init__(self,cste):
        self.cste = cste
class Erreur1:
     pass

An element v in is either a value in V obtained from a constant k and written as VCste1(k), or an object in the class Erreur1 (pass is used here to express the fact that the (object) constructor has no argument). With OCaml, the type of the elements in ? is defined directly as follows:

OCaml
type ’a valeursl = VCstel of ’a constl | Erreur1

2.2.3. Evaluation semantics

There are several formalisms that may be used to describe the evaluation of an expression. These will be introduced later. Let us construct an evaluation function:

The evaluation of the expression e in the environment Env and memory state Mem is denoted as with v ∈ V. Table 2.2 contains the recursive definition of the function

**Table 2.2.** *Evaluation of the expressions of* **Exp**₁

The value of an integer constant is the integer that it represents. The value of an identifier is that which is bound to it in the environment, or Err. The value of an expression constructed with an addition symbol and two expressions e₁ and e₂ is obtained by adding the relative integers resulting from the evaluations of e₁ and e₂; the result will be Err if e₁ or e₂ is not an integer. The value of !x is the value stored at the reference Env(x) when x is a mutable variable, and Err otherwise.

Thus, if e is evaluated as a reference, then e can only be an identifier. Furthermore, certain expressions in Exp₁ are syntactically correct, but meaningless: for example, the expression !x when x is not a mutable variable, i.e. when x does not bind a reference in the environment, or x₁ + x₂ when x₁ (or x₂) is a mutable variable. On the other hand, !x + y is a meaningful expression that denotes a value when y binds an integer and x binds a reference to an integer.

EXAMPLE 2.2.– Let us evaluate the expression !x + y

in the state and

The evaluation function is obtained directly as follows:

Python
def eval_exp1(env,mem,e):
    if isinstance(e,Cste1): return VCste1(CInt1(e.cste))
    if isinstance(e,Var1):
       x = valeur_de(env,e.symb)
       if isinstance(x,CInt1) or isinstance(x,CRef1): return VCste1(x)
       return Erreur1()
	 if isinstance(e,Plus1):
	     ev1 = eval_exp1(env,mem,e.exp1)
	     if isinstance(ev1,Erreur1): return Erreur1()
	     v1 = ev1.cste
	     ev2 = eval_exp1(env,mem,e.exp2)
	     if isinstance(ev2,Erreur1): return Erreur1()
	     v2 = ev2.cste
	     if isinstance(v1,CInt1) and isinstance(v2,CInt1):
		return VCste1(CInt1(v1.cst_int + v2.cst_int))
       return Erreur1()
   if isinstance(e,Bang1):
        x = valeur_de(env,e.symb)
        if isinstance(x,CRef1):
            y = valeur_ref(mem,x.cst_adr)
            if y is None: return Erreur1()
            return VCste1(y)
        return Erreur1()
   raise ValueError

OCaml
let rec eval_exp1 env mem e = match e with
  | Cste1 n -> VCste1 (CInt1 n)
  | Var1 x ->
     (match valeur_de env x with Some v -> VCste1 v | _ -> Erreur1)
  | Plus1 (e1, e2) -> (
    match ((eval_exp1 env mem e1), (eval_exp1 env mem e2)) with
    | (VCste1 (CInt1 n1), VCste1 (CInt1 n2)) -> VCste1 (CInt1 (n1 + n2))
  | _ -> Erreur1)
  | Bang1 x -> (match valeur_de env x with
    | Some (CRef1 a) ->
        (match valeur_ref mem a with Some v -> VCste1 v | _ -> Erreur1)
  | _ -> Erreur1)
val eval_exp1 : (’a * ’b const1) list -> (’b * ’b const1) list -> ’a exp1 -> ’b valeurs1

Considering example 2.2, we obtain:

Python
ex_env1 = [(“x”,CRef1(“rx”)),(“y”,CInt1(2))]
ex_mem1 = [(“rx”,CInt1(3))]
>>> (eval_exp1(ex_env1,ex_mem1,ex_exp1)).cste.cst_int
5

OCaml
let ex_env1 = [ (“x”, CRefl (“rx”)); (“y”, CIntl (2)) ]
val ex_env1 : (string * string constl) list
let ex_mem1 = [ (“rx”, CIntl (3)) ]
val ex_mem1 : (string * ‘a constl) list
# (eval_exp1 ex_env1 ex_mem1 ex_exp1) ;;
- : string valeursl = VCstel (CIntl 5)

2.3. Definition and assignment

2.3.1. Defining an identifier

The language Def ₁ extends Exp₁ by adding definitions of identifiers. There are two constructs that make it possible to introduce an identifier naming a mutable or non-mutable variable (as defined in section 2.1.3). Note that, in both cases, the initial value must be provided. This value corresponds to a constant or to the result of a computation specified by an expression e ∈ Exp₁. These constructs modify the current state of the system; after computing , the next step in evaluating let x = e; is to add the binding (x, ) to the environment, while the evaluation of var x = e; adds a binding (x, r_x) to the environment and writes the value to the reference r_x. In this case, we assume that the location denoted by the reference r_x is computed by an external mechanism responsible for memory allocation.

Table 2.3. Language Def 1 of definitions

d ::= let x = e;	Definition of a non-mutable variable	(x ∈ X, e ∈ Exp₁)
\| var x = e;	Definition of a mutable variable	(x ∈ X, e ∈ Exp₁)

The evaluation of a definition is expressed as follows:

(2.1)

This evaluation →_{Def 1} defines a relation between a state, a definition and a resulting state, or, in formal terms:

Starting with a finite sequence of definitions d = [d₁; . . . d_n] and an initial state (Env₀, Mem₀), this relation produces the state (Env_n, Mem_n):

This sequence of transitions may, more simply, be noted (Env_n, Mem_n).

EXAMPLE 2.3.– Starting with a memory with no accessible references and an “empty” environment, the sequence [var y = 2; let x = !y + 3;] builds the following state:

In the environment Env = [(x, 5), (y, r_y)], we obtain = {y} and = {x}.

NOTE.– In the definition of the two transitions in [2.1], we presume that the result of the evaluation of the expression e, denoted as , is not an error result. In the case of an error, no state will be produced and the evaluation stops.

The abstract syntax of language Def ₁ may be defined as follows:


Python
class Let_def1:
    def __init__(self,var,exp):
        self.var = var
        self.exp = exp
class Var_def1:
    def __init__(self,var,exp):
        self.var = var
        self.exp = exp

OCaml
type ’a defl = Let_def1 of ’a * ’a expl | Var_def1 of ’a * ’a expl

We choose to construct a value corresponding to a reference using a constructor applied to an identifier.

Python
class Ref_Var1:
    def __init__(self,idvar):
        self.idvar = idvar

OCaml
type ’a refer = Ref_Var1 of ’a

Hence, r_x will be represented by Ref_Var1(”x”). As the relation →Def₁ defines a function, it can be implemented directly as follows:

Python
def trans_def1(st,d):
    (env,mem) = st
    if isinstance(d,Let_def1):
        v = eval_exp1(env,mem,d.exp)
        if isinstance(v,VCste1):
            return (ajout_liaison_env(env,d.var,v.cste),mem)
        raise ValueError
    if isinstance(d,Var_def1):
        v = eval_exp1(env,mem,d.exp)
        if isinstance(v,VCste1):
            r = Ref_Var1(d.var)
            return (ajout_liaison_env(env,d.var,CRef1(r)),
                    write_mem(mem,r,v.cste))
    raise ValueError
raise ValueError

OCaml
let trans_def1 (env, mem) d = match d with
  | Let_def1 (x, e) -> (match eval_exp1 env mem e with
      | VCste1 v -> ((ajout_liaison_env env x v), mem)
      | Erreur1 -> failwith “Erreur”)
  | Var_def1 (x, e) -> (match eval_exp1 env mem e with
      | VCste1 v -> ((ajout_liaison_env env x (CRef1 (Ref_Var1 x))),
                  (write_mem mem (Ref_Var1 x) v))
      | Erreur1 -> failwith “Erreur”)
val trans_def1 :
(’a * ’a refer const1) list * (’a refer * ’a refer const1) list
-> ’a def1
-> (’a * ’a refer const1) list * (’a refer * ’a refer const1) list

By iterating this function, we obtain an implementation of

Python
def trans_def1_exec(st,ld):
    (env,mem) = st
    if len(ld) == 0: return (env,mem)
    else: return trans_def1_exec(trans_def1((env,mem),ld[0]),ld[1:])

OCaml
let trans_def1_exec (env, mem) ld = (List.fold_left trans_def1 (env, mem) ld)
val trans_def1_exec :
    (’a * ’a refer const1) list * (’a refer * ’a refer const1) list
->   ’a def1 list
->  (’a * ’a refer const1) list * (’a refer * ’a refer const1) list

Now, considering example 2.3, we obtain:

Python
ex_ld0 = [Var_def1(“y”,Cste1(2)), Let_def1(“x”,Plus1(Bang1(“y”),Cste1(3)))]
 (ex_e0,ex_m0) = trans_def1_exec(([],[]),ex_ld0)
>>>  eval_exp1(ex_e0,ex_m0,Var1(“x”)).cste.cst_int
5
>>> eval_exp1(ex_e0,ex_m0,Bang1(“y”)).cste.cst_int
2

OCaml
let ex_ld0 = [ Var_def1 (“y”, Cste1 2);
               Let_def1 (“x”, Plus1 (Bang1 “y”, Cste1 3)) ]
val ex_ld0 : string def1 list
# (trans_def1_exec ([], []) ex_ld0) ;;
- : (string * string refer const1) list *
    (string refer * string refer const1) list
= ([(“x”, CInt1 5); (“y”, CRef1 (Ref_Var1 “y”))], [(Ref_Var1 “y”, CInt1 2)])

2.3.2. Assignment

The language Lang₁ extends Def ₁ by adding assignment. The syntax of an assignment instruction is:

x : = e

where x ∈ X and e ∈ Exp₁. When the mutable variable x is already bound in the current environment, this instruction enables us to modify the value of !x. Formally, execution of the instruction x := e modifies the memory of the current state, and it is described by the following transition:

NOTE.– Once again, if the identifier x is not bound in the environment or if the evaluation of e results in an error, no state is generated and evaluation stops.

EXAMPLE 2.4.– Based on the state obtained in example 2.3, the following two assignments can be executed:

Representing the abstract syntax of the assignment x := e by the pair (x, e), the relation →_Lang₁ and the iteration of this relation from a sequence of assignments are implemented as follows:

Python
def trans_lang1(st,a):
    (env,mem) = st
    (x,e) = a
     v = valeur_de(env,x)
     if isinstance(v,CRef1):
         ve = eval_exp1(env,mem,e)
         if isinstance(ve,VCste1):
             return (env,write_mem(mem,v.cst_adr,ve.cste)) 
     raise ValueError

def trans_lang1_exec(st,la):
    (env,mem) = st
     if len(la) == 0:return (env,mem)
     else: return trans_lang1_exec(trans_lang1((env,mem),la[0]),la[1:])

OCaml 
let trans_lang1 (env,mem) (x, e) = match valeur_de env x with
  | Some (CRef1 (Ref_Var1 y)) -> (match eval_exp1 env mem e with
      | VCste1 v -> (env, (write_mem mem (Ref_Var1 y) v))
      | Erreur1 -> failwith “Eval error”)
  | _ -> failwith “Undefined var”
val trans_lang1 :
   (’a * ’b refer const1) list * (’b refer * ’b refer const1) list
-> ’a * ’a exp1
-> (’a * ’b refer const1) list * (’b refer * ’b refer const1) list
let trans_lang1_exec (env, mem) la =
    (List.fold_left trans_lang1 (env, mem) la)
val trans_lang1_exec :
    (’a * ’b refer const1) list * (’b refer * ’b refer const1) list
-> (’a * ’a exp1) list
-> (’a * ’b refer const1) list * (’b refer * ’b refer const1) list

Considering example 2.4, we obtain:

Python
ex_la0 = [(“y”,Plus1(Bang1(“y”),Var1(“x”))), (“y”,Cste1(8))]
 (ex_e1,ex_m1) = trans_lang1_exec((ex_e0,ex_m0),ex_la0)
>>> eval_exp1(ex_e1,ex_m1,Bang1(“y”)).cste.cst_int
8

OCaml
let ex_la0 = [ (“y”, Plus1 (Bang1 “y”,Var1 “x”)); (“y”, Cste1 8) ]
val ex_la0 : (string * string exp1) list
# (trans_lang1_exec (trans_def1_exec ([],[]) ex_ld0) ex_la0);;
- : (string * string refer const1) list *
    (string refer * string refer const1) list
=   ([(“x”, CInt1 5); (“y”, CRef1 (Ref_Var1 “y”))], [(Ref_Var1 “y”, CInt1 8)])

NOTE.– In the presentation above, assignment concerns only mutable variables. Similarly, the dereferencing operator ! can only be applied to a reference to obtain the stored value at this location. Thus, the assignment of a value to a mutable variable from a given reference requires here the use of the dedicated syntactic structures: x := ! x + 1. However, in many languages, assignment does not explicitly mention the dereferencing operator, and the syntactic use of variables is identical on both sides of the assignment: x = x + 1. Hence, an identifier x denotes two different notions: the value Env(x) on the left of the assignment symbol, and the value Mem(Env(x)) on the right of the assignment symbol.

These languages mask the different roles of a variable according to its position in the assignment. When a variable x is positioned to the left of the assignment, it is known as an l-value and it denotes a location where the value to be assigned should be stored. In this way, it acts as a pointer. The variable x on the right of the assignment is known as the r-value, and implicitly acts as a dereferenced pointer: the value to fetch is found at the location denoted by the variable. Thus, in the expression x = x + 1, even if x is used in the same way from a syntactic perspective, the instance on the right implicitly denotes ! x. In some languages, variable declaration implicitly involves the creation of a reference, unless otherwise stated; the name of the variable represents the location where the compiler will store the values assigned to it.

2.4. Exercises

Exercise 2.1

Consider the state etat₀ defined by:

1) compute and .
2) Give a sequence of definitions to obtain the state etat₀ from the empty state ([ ], [ ]). Are there other sequences that can be used to obtain this state?
3) compute
4) Give three examples of expressions that generate three different error types when evaluated in the state etat₀.
5) Consider the sequence

Determine the states etat1 = (Env₁, Mem₂) and etat₂ = (Env₂, Mem₂) and the sets , , .
6) Consider the sequence

Determine the states etat₃ = (Env₃, Mem₃) and etat₄ = (Env₄, Mem₄). Does

Exercise 2.2

The two constructs x + + and + + x:

are added to the language Exp₁. In these two new constructs, the identifier x must be a mutable variable. Intuitively, we see that the evaluation of the expression x + + produces the value stored in the location denoted by x and increments the value in the memory by 1. The expression + + x is evaluated differently: the value stored at the location denoted by x is incremented by 1, and this new value is the result of the evaluation.

1) Define an evaluation function:

for expressions in the language Exp₁, extended so that expresses the fact that evaluation of the expression e in the state (Env, Mem) transforms this state into a state (Env’, Mem’) and produces the value v.
2) Let Env = [(x, r_x)] and Mem = [(r_x, 6)]. Compute:
3) Show that, for any state (Env, Mem):

Now, we extend the language Lang₁ by considering the expressions of the extended Def ₁ language and adding the construction x+ := e. In informal terms, the execution of this instruction in a state (Env, Mem) consists of first finding the value v_x stored at the reference v in Mem, then evaluating the expression e in this state to obtain its value v_e and a new state (Env’, Mem’), and finally, assigning to v the result of the addition of v_e and v_x. If at least one of the two values v_e and v_x is not an integer, the execution fails.
4) Redefine the relation →Lang₁ for the extended Lang₁ language.
5) Let Env₀ = [(x, r_x)] and Mem₀ = [(r_x, 2)]. Determine the state etat₁ such that
6) Do the assignments x := x + +, x := + + x and x+ := 1 produce the same states when executed in the same state?