5 Balance equations from the second law of thermodynamics: the case of hardening plasticity

4.2 Cauchy's theorem for microstructural contact actions

We assume that τ is a function of the point x and the normal n to $\partial b$ at all points where the normal itself is uniquely defined and at every instant. In other words, we presume the validity of Cauchy's postulate for the microstructural contact actions. A question is then whether we can prove the Cauchy theorem for τ.

A proof is given in Capriz and Virga (1990), but there the microstructure is represented in a manifold embedded into a linear space, with undoubted advantages. Here we want to maintain the representation of the microstructure in a manifold as abstract as possible, avoiding even the embedding of $M$ into a linear space for it is not unique (a question already discussed in the introduction). We then follow another path as sketched below.

First, imagine freezing the macroscopic motion and allowing just the microstructure to vary in time. In other words, select $\dot{y} = 0$ . The requirement of invariance of $P_{b}^{ext} (0, \dot{ν})$ implies just the validity of the integral balance

$\int_{b} A^{*} β^{‡} d x + \int_{\partial b} A^{*} τ d H^{2} = 0 .$

(1.8)

This has two main advantages:

1. The linear operator $A^{*}$ projects both β^‡ and τ into $R^{3}$ from $T^{*} M$ , which is, in general, a nonlinear space.

2. $A$ does not depend on n. It is a function of $\tilde{ν} (x)$ alone at every instant t.

These two aspects allow us to use the integral balance (1.8) in the standard way leading to Cauchy's theorem.

We presume first that both $A^{*} (\tilde{ν} (\cdot)) β^{‡} (\cdot)$ and $A^{*} (\tilde{ν} (\cdot)) τ (\cdot, n)$ are essentially bounded.³⁷ By the standard technique leading to the action–reaction principle (see, e.g., Truesdell, 1991) we can then prove that

$A^{*} (\tilde{ν} (x)) τ (x, n) = - A^{*} (\tilde{ν} (x)) τ (x, - n),$

which is

$A^{*} (τ (x, n) + τ (x, - n)) = 0,$

i.e., the sum τ (x, n) + τ (x, −n) belongs to the kernel of the linear operator $A^{*}$ . That sum is well defined: since $\tilde{ν} (\cdot)$ is continuous and single-valued, as assumed from the beginning, we have at x a unique value ν such that both τ (x, n) and τ (x, −n) belong to the same cotangent space $T_{ν}^{*} M$ , which is a linear space.³⁸

Example 2

Consider ν as a three-dimensional real vector. This is, for example, the case of quasicrystals, where ν collects inner degrees of freedom exploited for the atomic rearrangements determining the quasiperiodic structure of the lattice. In this case we compute $A = - ν \times$ . Consequently, at a given ν the kernel of $A^{*}$ is the one-dimensional space parallel to ν. Hence, the action–reaction principle deduced above prescribes that the sum τ (x, n) + τ (x, −n) is a vector parallel to ν, which reduces to zero when it is projected on the physical space. Such a vector is powerless on any “rigid” rate of ν, i.e., a rate of the type $A_{q}$ , with q the rotation velocity vector in the physical space. Let now assume that $A^{*} (\tilde{ν} (\cdot)) τ (\cdot, n)$ is continuous.

We can exploit then the integral balance (1.8) reproducing the standard tetrahedron argument or exploit just two linearly independent vectors in space as in Truesdell (1991). What we find is that $A^{*} τ (x, n) = A^{*} S (x) n$ , i.e., at x there is a linear operator mapping n into the cotangent space $T_{\tilde{ν} (x)}^{*} M$ (we write in short $S (x) \in Hom (T_{x}^{*} B, T_{\tilde{ν} (x)}^{*} M)$ ) such that

$τ (x, n) = S (x) n,$

(1.9)

with $S (x) \in Hom (T_{x}^{*} B, T_{\tilde{ν} (x)}^{*} M)$ . Precisely, we get

$S (x) : = \sum_{K = 1}^{3} τ (x, e_{A}) \otimes e_{A},$

where, as above, e_A is the Ath vector of a basis in a neighborhood of x. In components we have

$τ_{α} (x, n (x)) = S_{α}^{A} (x) n_{A} (x),$

where Greek indices indicate components over $M$ . We call $S$ microstress to recall its role analogous to the standard stress and its microstructural origin.

Refinements seem possible:

• The construction of a framework determining the need for Cauchy's postulate for τ. (I refer to the appropriate generalization of the Hamel–Noll theorem (Truesdell, 1991).)

• The weakening of the continuity assumption along the lines indicated in Šilhavý (1991, 2005).

Difficulties arise when we consider $\tilde{ν} (\cdot)$ as a multivalued function over $M$ , with values determined modulo a permutation, as proposed for a refined description of material complexity in Focardi et al. (2014), where the conditions for the existence of the ground states for the relevant elastic energy are determined in this case.

4.3 The relative power: a definition

In the expressions of the power discussed above, the reference place $B$ is presumed fixed once and for all. In the presence of bulk mutations in the matter, we can resort to the idea of having multiple reference shapes interpreted in one of the ways described in the introduction. Here our attention is on the definition of the vector field $x \mapsto w (x) \in T_{x} B$ , which we presume to be differentiable, a virtual velocity mimicking the incoming rearrangements of material elements that determine the mutation. In this case we can think of writing the external power relative to w. Moreover, since the vector field w (·) represents material mutations, we have to consider that during these mutations we have

1. changes in the energetic landscape and

2. actions in $B$ power conjugated with the rupture of existing material bonds and/or the formation of new ones, and mutation-induced anisotropy.

In principle, in both cases we can have energy fluxes across boundaries inside the body and consequent emergence of anisotropies in the distribution of the energy itself. However, although we mention energy at this stage, we are not referring to specific constitutive classes. We need just to affirm that there is the free energy ψ and it changes in space and time when material mutations occur, nothing more. In particular, we write

$ψ = \tilde{ψ} (x, t, ς),$

with ς the list of state variables that we do not specify. They have to be rendered explicit in discussing constitutive issues, but not here. Moreover, in addition to standard and microstructural actions, we should include the ones not associated with deformation or microstructural events described by $\dot{ν}$ (see the second item in the list above). A way to maintain distinct these new actions is to represent them as covector fields over $B$ (vectors if we use the standard identification of $R^{3}$ with its dual) of forces f and couples μ developing power on w and its curl, respectively. Further assumptions apply:

1. f may have just dissipative nature. It vanishes when the mechanical process is conservative.

2. μ has dissipative and conservative components, the latter appearing when the material mutations produce anisotropy without breaking and/or reforming material bonds.

By taking into account the representation of the contact actions in terms of stress and microstress in the Lagrangian configuration, I define the relative power, writing $P_{b}^{rel} (\dot{y}, \dot{ν}, w)$ for it, as the sum of the relative power of actions, indicated by $P_{b}^{rel - a} (\dot{y}, \dot{ν}, w)$ , and another functional that I call the power of disarrangements, $P_{b}^{dis} (w)$ , determined by the energy fluxes and the configurational forces f and μ listed above. Precisely, $P_{b}^{rel} (\dot{y}, \dot{ν}, w)$ has the following form:

$P_{b}^{rel} (\dot{y}, \dot{ν}, w) : = P_{b}^{rel - a} (\dot{y}, \dot{ν}, w) + P_{b}^{dis} (w),$

with

$P_{b}^{rel - a} (\dot{y}, \dot{ν}, w) : = \int_{b} b^{‡} \cdot (\dot{y} - Fw) d x + \int_{\partial b} Pn \cdot (\dot{y} - Fw) d H^{2} + \int_{b} β^{‡} \cdot (\dot{ν} - Nw) d x + \int_{\partial b} S n \cdot (\dot{ν} - Nw) d H^{2}$

si572_e

and

$P_{b}^{dis} (w) : = \int_{\partial b} (n \cdot w) ψ d H^{2} - \int_{b} (\partial_{x} ψ + f) \cdot w d x + \int_{b} μ \cdot curl w d x .$

si573_e

In the previous expressions, ∂_xψ is the explicit derivative of $\tilde{ψ} (x, t, ς)$ with respect to x, holding fixed all the other entries of the energy. It is an indicator of the loss of homogeneity in the energy landscape, altered by the mutation. The term (n · w) ψ is the energy density flux across the boundary $\partial b$ , due to the mutation itself.

When w = 0 at every point (the body does not undergo bulk macroscopic mutations), $P_{b}^{dis} (w)$ vanishes and $P_{b}^{rel} (\dot{y}, \dot{ν}, w)$ reduces to the external power $P_{b}^{ext} (\dot{y}, \dot{ν})$ .

$P_{b}^{dis} (w)$ accounts for macroscopic mutations. The microscopic ones pertain to the terms

$\int_{b} β^{‡} \cdot \dot{ν} d x + \int_{\partial b} S n \cdot \dot{ν} d H^{2}$

in $P_{b}^{rel - a} (\dot{y}, \dot{ν}, w)$ . There is micro-to-macro interaction. It appears in the pointwise balance equations and the constitutive issues.

When we do not consider a multifield and multiscale representation of material microstructures and we describe bodies in the standard format, the relative power obviously reduces to

$P_{b}^{rel} (\dot{y}, w) : = P_{b}^{rel - a} (\dot{y}, w) + P_{b}^{dis} (w),$

where $P_{b}^{rel - a} (\dot{y}, w)$ is derived from $P_{b}^{rel - a} (\dot{y}, \dot{ν}, w)$ by canceling microstructural actions and is given by

$P_{b}^{rel - a} (\dot{y}, w) : = \int_{b} b^{‡} \cdot (\dot{y} - Fw) d x + \int_{\partial b} Pn \cdot (\dot{y} - Fw) d H^{2},$

while $P_{b}^{dis} (w)$ remains the same.

Notice that I have written $P_{b}^{rel} (\dot{y}, \dot{ν}, w)$ in terms of the first Piola–Kirchhoff stress P and the microstress $S$ taking advantage of the discussion about their existence in previous sections. However, we could write $P_{b}^{rel} (\dot{y}, \dot{ν}, w)$ in its more primitive form including $t$ and τ in place of Pn and $S n$ . In this case the results below, emerging from a requirement of invariance of $P_{b}^{rel} (\dot{y}, \dot{ν}, w)$ under changes in observers of class 2, would be enriched by the proof of the existence of P and $S$ , and their independence from n.

4.4 Kinetics

Microstructural inertia can appear, for example, in the case of bubbles migrating inside a liquid in motion, relative to it, and vibrating within it (see the remarks in Capriz & Giovine, 1997), or solids with an enormous number of cavities, each one containing a gyroscope (a case discussed in Milton & Willis, 2007).

In his book on continua with microstructure, Capriz (1989) writes the kinetic energy in a multifield and multiscale representation of bodies as the sum of the standard macroscopic kinetic energy and a microscopic component. By indicating with b in the apex position a covector corresponding to the vector decorated by the apex, we rewrite explicitly the sum as

$k ({\dot{y}}^{b}, ν, {\dot{ν}}^{b}) : = \frac{1}{2} ρ {\dot{y}}^{b} \cdot \dot{y} + κ (ν, {\dot{ν}}^{b}),$

where κ is such that κ (ν, 0) = 0 and it admits a second derivative with respect to ${\dot{ν}}^{♭}$ , which is positive definite, namely,

$\frac{\partial κ (ν, {\dot{ν}}^{b})}{\partial {\dot{ν}}^{b} \partial {\dot{ν}}^{b}} \cdot ({\dot{ν}}^{b} \otimes {\dot{ν}}^{b}) \geq 0 .$

The equality sign holds when ${\dot{ν}}^{♭} = 0$ .

In contrast with Capriz (1989), I presume that the dependence of $κ (ν, {\dot{ν}}^{♭})$ must be considered deprived of the effects of macroscopic rigid-body motion. Microstructural inertia appears, should it exist, as a local microscopic fluctuation with respect to the macroscopic motion. Hence, we may consider κ as a function:

$κ (ν, {\dot{ν}}^{b}) = h (ν, {\dot{ν}}^{b} - {(A_{q})}^{b}) .$

The choice prevents an incongruence that would occur, in contrast, when κ is quadratic with respect to ${\dot{ν}}^{♭}$ and we calculate the total kinetic energy of the body during a rigid-body motion—a physically questionable extra inertia moment would appear if we do not use a form like $h$ (details are given in Mariano, 2002).

A standard assumption used below is that both b^‡ and β^‡ admit additive decompositions into inertial (bⁱⁿ and βⁱⁿ) and noninertial (b and β) components:

$b^{‡} = b^{in} + b, β^{‡} = β^{in} + β .$

4.5 Invariance of the relative power under isometry-based changes in observers

Here I refer to changes in observers of class 2. The velocity w is defined on $B$ (precisely, the map $x \mapsto w (x) \in T_{x} B$ is a section of the tangent bundle to $B$ ), so we have to use the rule (1.4) in the changes in observers.

Along the path, I shall assume that some fields are piecewise differentiable over $B$ (in short, we say that they are of class PC¹) with bounded discontinuities over a surface Σ, oriented locally by the normal m, and not moving relative to $B$ itself. For any field x a (x) of this type, taking values in a linear space, the limits a^± (x) := lim_ε↓0 a (x ± εn), x ∈ Σ, define the jump [a] of a as the difference [a] := a⁺ − a−, and the average 〈a〉 as $〈 a 〉 : = \frac{1}{2} (a^{+} + a^{-})$ . Given two fields x a₁ (x) and x a₂ (x) taking values in a linear space and such that a product a₁a₂ between them, distributive with respect to the sum, can be defined, we get the identity [a₁a₂] := [a₁] 〈a₂〉 + 〈a₁〉 [a₂].

The definition of [a] underlines the need of having a field taking values in a linear space. If it were not so, the difference would possibly not be defined. For this reason, in what follows, I shall consider the field x ν continuous across Σ. In fact, since $M$ is here, in general, nonlinear, the jump [ν] of ν could not make sense. In contrast, the jump of $\dot{ν}$ and the jump of $S$ are always defined, both being in linear spaces at every $x \in B$ .

I presume also that the derivatives of the map x ν suffer bounded discontinuities across Σ. Also, Σ is here unstructured: this means that it cannot sustain its own surface standard and microstructural tractions; in other words it is not endowed with its own surface energy.

Axiom 1

$P_{b}^{rel} (\dot{y}, \dot{ν}, w)$ is invariant under isometry-based changes in observers in class 2 for any choice of $b$ and the rates involved.

Axiom 2

The bulk actions admit additive decompositions into inertial and noninertial parts and the inertial components are determined by the integral balance

$\frac{d}{d t} \int_{b} k (\dot{y}, ν, \dot{ν}) d x + \int_{b} (b^{in} \cdot \dot{y} + β^{in} \cdot \dot{ν}) d x = 0,$

which holds for any choice of the part $b$ and the velocity field, with the kinetic energy satisfying the structure assumptions presented above.

Theorem 1

The following assertions hold true:

1. The integral balances below hold, provided that the fields involved are integrable:

$\int_{b} b^{‡} d x + \int_{\partial b} Pn d H^{2} = 0,$

(1.10)

$\int_{b} ((y - y_{0}) \times b^{‡} + A^{*} β^{‡}) d x + \int_{\partial b} ((y - y_{0}) \times Pn + A^{*} S n) d H^{2} = 0,$

si618_e (1.11)

$\int_{\partial b} P n d H^{2} - \int_{b} (F^{*} b^{‡} + N^{*} β^{‡}) d x - \int_{b} (\partial_{x} ψ + f) d x = 0,$

si619_e (1.12)

$\int_{\partial b} (x - x_{0}) \times P n d H^{2} - \int_{b} (x - x_{0}) \times (F^{*} b^{‡} + N^{*} β^{‡}) d x - \int_{b} (x - x_{0}) \times (\partial_{x} ψ + f) d x + \int_{b} 2 μ d x = 0 .$

(1.13)

where P := ψ I − F*P − N*S, with I the second-rank unit tensor.

2. If the fields x P and x P are of class $P C^{1} (B) \cap C^{0} (\bar{B})$ with discontinuity set the surface Σ described above, and the fields x b, x F*b, x f, and x ∂_x ψ are continuous over B, we get in B

$Div P + b^{‡} = 0,$

(1.14)

and a field x z (x) ∈ T*_ν_(x)M, with z = z₁ + z₂, z₂ ∈ KerA*, exists and is such that

$Div S + β^{‡} - z = 0$

(1.15)

and

$Skw P F^{*} = \frac{1}{2} e (A^{*} z + (D A^{*}) S);$

si624_e (1.16)

moreover,

$Div P - F^{*} b^{‡} - N^{*} β^{‡} + \partial_{x} ψ = f,$

(1.17)

$Skw (g^{- 1} P) = - 2 \bar{e} μ,$

(1.18)

with $\bar{e}$ Ricci's symbol with all contravariant components, namely, ${\bar{e}}^{ABC}$ . Across Σ we get

$[P] m = 0,$

(1.19)

$[S] m = 0,$

(1.20)

$[P] m = 0 .$

(1.21)

3. The inertial components of the body actions are given by

$b^{in} = - ρ \ddot{y}$

and

$β^{in} = \frac{d}{d t} \frac{\partial_{χ} (ν, \dot{ν})}{\partial \dot{ν}} - \frac{\partial_{χ} (ν, \dot{ν})}{\partial ν},$

si633_e

with

$χ : T M \to R^{+},$

a C¹ function such that

$κ (ν, {\dot{ν}}^{b}) : = \frac{\partial_{χ} (ν, \dot{ν})}{\partial \dot{ν}} \cdot \dot{ν} - χ (ν, \dot{ν}),$

si635_e

at any $\dot{ν}$ .

4. If the material is homogeneous, no driving force is present, and μ = 0, then P is symmetric and, in the absence of body forces,

$\int_{\partial b} P n d H^{2} = 0,$

for any part $b$ .

5. An extended version of the virtual power principle holds. It reads

$P_{b}^{rel} (\dot{y}, \dot{ν}, w) = P_{b}^{rel - int} (\dot{y}, \dot{ν}, w),$

where

$P_{b}^{rel - int} (\dot{y}, \dot{ν}, w) : = \int_{b} (P \cdot \dot{F} + z \cdot \dot{ν} + S \cdot \dot{N} + P \cdot D w + μ \cdot curl w) d x + \int_{b \cap \sum} (〈 P 〉 m \cdot [\dot{y}] + 〈 S 〉 m \cdot [\dot{ν}] + 〈 P 〉 m \cdot [w]) d H^{2},$

with the obvious simplification when w is continuous across Σ and/or $\dot{y}$ and $\dot{ν}$ are continuous too.³⁹

Invariance of the relative power with respect to translations in ${\tilde{E}}^{3}$ furnishes the integral balance of forces. We do not have the integral balance of microactions (or microforces if you want to use the term force in an extended sense) because translations are not available over $M$ unless it is a priori selected as a linear space. Even in that case, however, if we accept changes in observers in class 2 (or class 1 in the absence of material mutations), a translation over $M$ is not accounted for.

The integral balance of couples (1.11) includes the microactions. However, it does not mean that $S$ and β^‡ are couples for they appear multiplied by the adjoint of the linear operator $A$ , which projects over the reference space their component over $M$ .

In principle, we could abandon the procedure based on the invariance of the relative power, or the external power alone, deciding to postulate the integral form of the balance equations. We would be then pushed to postulate an integral balance of microstructural actions, declaring it as “our first principle.” This way we would face the basic difficulty that in this case such a balance would be not defined in general for we take $M$ as a manifold not necessarily coincident with a linear space. In fact, when $M$ is a nonlinear manifold, the integrals in that balance would not be defined because the fields x β^‡ and $x \mapsto S n$ take values in the cotangent bundle of $M$ , a nonlinear target space. A balance of microstructural actions could be formally defined only when $M$ is a linear space. However, in any case its choice would introduce an assumption, namely, the structure of that integral balance, which is not necessary, as shown by the previous theorem (its proof can be developed by direct calculation). Moreover, if we presume such an integral balance a priori when $M$ is linear, we should postulate the existence of the self-action z, which has been, in contrast, deduced with the procedure used in the previous theorem.

Another option could be a virtual power approach. We could assume the identity

$P_{B}^{rel} (\dot{y}, \dot{ν}, w) = P_{B}^{rel - int} (\dot{y}, \dot{ν}, w),$

as a first principle, presuming its validity for any choice of (compactly supported) rate fields.⁴⁰ Such an assumption, however, is a way to affirm that we are postulating a priori the weak form of the pointwise balances of actions. We should then presume the existence of all ingredients appearing in the balance equations, having already in mind their structure. The difference between a procedure requiring the invariance of external power and the virtual power approach is not particularly appreciable in the standard setting for the elements appearing in the inner power are already present in the external one. In contrast, in the enriched setting discussed here, in postulating the inner power we should introduce a priori the self-action z without showing the need for its existence.

There is an indeterminacy in the pointwise balance of microactions (1.15). In fact, the addition to z of any z′ belonging to $Ker A^{*}$ satisfies Eq. (1.16). Hence, it would appear in Eq. (1.15).⁴¹ The indeterminacy can be eliminated by covariance techniques (de Fabritiis & Mariano, 2005), i.e., by requiring at least invariance with respect to the generalized class 1. If we require such an invariance for the external power or the relative one, however, we do not obtain an appreciable result. Covariance requirements need the use of the balance of energy or the second law of thermodynamics.⁴² In this case, however, energy is involved, and the specification of the list of state variables is required. This way we would pay for the use of a more stringent invariance requirement by losing the hierarchical distinction between the derivation of the balance equations and the discussion of constitutive issues, the former determined without the need for the latter.

Invariance of the relative power with respect to translations and rotations in the reference space $E^{3}$ determines integral balances of configurational forces and couples, the ones governing the bulk mutation. Hence, we are not forced to introduce a priori a stress $P$ and bulk configurational forces and then to identify them with $ψ I - F^{*} P - N^{*} S$ and −F*b^‡ − N*β^‡ by means of an additional procedure, the one described in Gurtin (1995, 2000a).

The assumption that f is solely dissipative reduces to the inequality f · w ≥ 0, the equality sign being valid only when w = 0, which implies that f is a linear function of w, with a positive coefficient. The result changes Eq. (1.17) into an evolution equation.

4.6 And if we disregard $M$ during changes in the observers?

In principle we could consider ν to be observer-independent. In this case the invariance of the external power or the relative one with respect to isometry-based changes in the observer would not lead (under appropriate regularity) to the pointwise balances of microstructural actions (1.14) and (1.15), as is obvious from the procedure sketched above. Hence, ν would play a parametric role at equilibrium and its evolution should be prescribed a part, with the sole proviso of satisfying the second law of thermodynamics. This way we would enter the scheme of internal variables, intended just as entities describing the removal from thermodynamical equilibrium (see de Groot & Mazur, 1962 for a standard treatise on the matter from the point of view of nonequilibrium thermodynamics, above all with reference to chemical processes). The approach has been coupled with deformations in Coleman and Gurtin (1967), Halphen and Nguyon (1975), with a subsequent rich literature, in the majority of cases related to plasticity and/or damage (see, e.g., Krajcinovic, 1996). The balance of microstructural actions can be reduced to the evolution equation that appears in internal variable schemes in the absence of external body actions (including even possible rotational microstructural inertia), microstress, and when the self-action is the sum of conservative and dissipative components (see Mariano, 2002 for details). However, the relation is just formal: the difference in the use of the notion of an observer continues to distinguish the two approaches.

When there is no link between changes of frames in ${\tilde{E}}^{3}$ and changes of atlas on $M$ , i.e., when {λ} is empty, the invariance procedure leading to the previous theorem would lead to a splitting of Eq. (1.11) into two integral balances:

$\int_{b} (y - y_{0}) \times b^{‡} d x + \int_{\partial b} (y - y_{0}) \times Pn d H^{2} = 0$

and

$\int_{b} A^{*} β^{‡} d x + \int_{\partial b} A^{*} S n d H^{2} = 0 .$

si675_e

The first one is the standard balance of couples leading to the symmetry of PF* under the regularity condition mentioned in the theorem above. The second balance would produce once again (1.14) and

$A^{*} z + (D A^{*}) S = 0 .$

This circumstance stresses the role played in this setting by the notion of an observer and its changes.

4.7 Perspectives: low-dimensional defects, strain-gradient materials, covariance of the second law

In the presence of structured discontinuity surfaces, those endowed with their own surface energy for they are able to sustain surface standard and microstructural actions (it is a reasonable mathematical scheme for thin transition layers between phases, for example), the expression of the relative power has to be extended with the addition of two contributions: (1) the relative power of surface actions, and (2) the surface power of disarrangements containing fluxes of the surface energy and the surface counterparts of f and μ. The list of surface actions includes the standard surface stress and surface microstress and self-actions—the existence of the last actions is proven in Mariano (2002). The definition of the relative power in this case and the results emerging from the requirement of its invariance are given in Mariano (2014). However, a special case of that extended expression of the relative power in the conservative case emerges from the extension of Nöther's theorem, presented in de Fabritiis and Mariano (2005), to the elasticity of complex materials endowed with structured discontinuity surfaces. Different approaches can be followed to analyze the mechanics of structured discontinuity surfaces, with other assumptions and different procedures (Gurtin, 2000a; Gurtin & Struthers, 1990; Maugin & Trimarco, 1995; Simha & Bhattacharya, 2000). The reader will be able to distinguish the procedure requiring the smallest number of assumptions, a peculiarity allowing it to be a flexible tool to tackle nonstandard situations.

Analogous generalizations can be obtained in the presence of line defects endowed with their own line energy. This one is a scheme that we can adopt, for example, for the description of the dislocation core in metals. In this case, an expression of relative power in a setting where dissipation is essentially attributed to the counterparts of f and μ and to the self-action is given in Mariano (2012b).

We can define the relative power even for strain-gradient materials, including the hyperstress (a third-rank tensor). When we focus attention on the actions in the bulk alone, in the conservative setting of strain-gradient elasticity an expression for the relative power can be derived from the Nöther theorem (for it, see Kouranbaeva & Shkoller, 2000). The extension of the Nöther theorem to the case in which structured discontinuity surfaces appear in strain-gradient elasticity (e.g., think of two strain-gradient elastic materials glued to each other) is proven in Mariano (2007), where the surface hyperstress was introduced first. In the dissipative setting, the appropriate expression for the relative power including bulk and surface hyperstress is given in Mariano (2014).

Besides issues concerning low-dimensional defects endowed with their own energy in single-gradient or second-gradient field theories, another perspective deals with covariance in a dissipative setting. In fact, we can write a version of the second law of thermodynamics including the relative power and impose invariance under diffeomorphism-based changes in observers. The procedure requires (1) the specification of the list of state variables (it includes the metric in the reference place) and (2) a rule satisfied by the rate of change of the free energy under changes in observers (it affirms essentially that the energy changes tensorially, as assumed in elasticity in Marsden & Hughes, 1983). This way we deduce (1) the existence of the stresses, (2) the pointwise balances in the previous theorem, (3) the constitutive restrictions (among them we find that the conservative part of the Eshelby stress is the derivative of the free energy with respect to the material metric), and (4) the structure of the dissipation. Details in the case of elastic–plastic hardening materials are provided in the next section.

5 Balance equations from the second law of thermodynamics: the case of hardening plasticity

In Section 4 we saw the link between isometric changes in observers and balance equations, established by the invariance of the external power or the relative power—the latter case determines even configurational balances. As anticipated above, an analogous link exists among diffeomorphism-based changes in observers, the existence of the standard stress, constitutive restrictions, and even dissipation (for it the use of the second law of thermodynamics is necessary). In this sense we can affirm that the structure of pointwise balances is covariant.

The concept can be specified in different settings:

• When the environment is purely conservative, horizontal and vertical first variations—the latter involving the actual shape $B_{a}$ of the body— of the total energy or a Lagrangian determine balance equations in weak or pointwise form, depending on the regularity of the fields involved (for nonlinear elasticity of simple bodies, see Giaquinta et al., 1989). In this setting, if we take into account the way in which they are defined, horizontal and vertical variations play the role of (can be interpreted as) diffeomorphism-based changes in observers.

• Another setting is established by the Marsden–Hughes theorem (1983), which enlarges the purely conservative case to include nonconservative body forces. The theorem deals with the standard format of continuum mechanics (Cauchy bodies). It is based on a requirement of invariance of the first law of thermodynamics, written with respect to the actual place $B_{a}$ , under changes in observers governed by the action of diffeomorphisms altering the physical ambient space ${\tilde{E}}^{3}$ where we evaluate the actual places $B_{a}$ . Ancillary but not less fundamental assumptions are (1) the dependence of the internal energy on the metric in ${\tilde{E}}^{3}$ , and (2) that the energy density behaves tensorially, as the density of a volume form, under diffeomorphism-based changes in observers. The results are (1) the derivation of the existence of the Cauchy stress tensor, (2) the pointwise balance equations of forces and couples, and (3) the constitutive restriction linking the Cauchy stress to the derivative of the energy with respect to the spatial metric (Doyle–Ericksen formula). The basic limitation of the theorem and that of its possible generalizations involving the description of microstructures and/or the relative power is that the use of the first law of thermodynamics excludes the possible presence of dissipative stresses, like the nonconservative part of the Piola–Kirchhoff stress in viscoelasticity. Hence, it does not furnish the expression for the dissipation in the presence of plastic effects.

• To go beyond the point of view of the Marsden–Hughes theorem, with the aim of including dissipation, we need to impose covariance to the second law of thermodynamics. This idea appeared first in Mariano (2013) with reference to the description of elastic-perfectly plastic bodies.⁴³ Here, I refer once again to plastic behavior for the discussion allows me to put together diffeomorphism-based changes in observers with the notion of independent tangent maps, mentioned in the introduction as one of the possible approaches to the description of the mutations in solids. I present a mild generalization of the result in Mariano (2013) to the case of the traditional representation of hardening, without adding proofs, for they are exactly like the ones in Mariano (2013) to within an addendum that needs just a little care to be managed.

• The setting, also, allows the reader to think once again of analogies and differences between the framework discussed in previous sections for describing micro-to-macro interactions in solids and the scheme of internal variables that appears useful at times when we describe phenomena far from thermodynamic equilibrium.

To express clearly the result, recalling some notions can be expedient.

5.1 Multiplicative decomposition of F

Plasticity is the macroscopic emergence of the cooperation of microscopic structural changes in the matter. In this sense, the phenomenon is a mutation.

There are various manners of interpreting plastic phenomena. In this sense we can speak of theories of plasticity instead of a unique format.

A traditional approach is based on a multiplicative decomposition of the deformation gradient, F, into elastic, F^e, and plastic, F^p, factors:

$F = F^{e} F^{p} .$

For F^p we presume that

• det F^p > 0 at all x in $B$ , where F^p is defined,

• the field x F^p (x) is differentiable, and

• curl F^p ≠ 0.

With the last assumption we affirm that F^p is not intended as the spatial derivative of any deformation. F^p is only a linear operator that maps the tangent space to $B$ at x onto another linear space that we imagine (and in this setting we cannot do more than imagine) as the tangent space to what is commonly called an intermediate configuration, determined by the structural changes in the material. This way we are factorizing the elastic-plastic process. The choice appears fictitious for elastic and plastic changes in the matter cooperate. However, the factorization of elastic and plastic phenomena seems to have microscopic justification at least in the case of crystals even without calling upon explicitly a notion of an intermediate configuration. Parry (2004) has shown that, for crystal lattices, a view based on the dislocation tensor and other elastic invariants⁴⁴ leads to a decomposition of the type F = F^e₁F^pF^e₂, which obviously reduces to the traditional one when F^e₂ coincides with the identity (see also Parry, 2001). Another justification constructed by considering deformations as $SBV (B)$ maps⁴⁵ appears in Reina and Conti (2014).

When we accept the multiplicative decomposition F = F^eF^p, the right Cauchy–Green tensor in its version with both covariant components is

$C = F^{p *} F^{e *} \tilde{g} F^{e} F^{p} = F^{p *} C^{e} F^{p},$

with $C^{e} = F^{e*} \tilde{g} F^{e}$ . The second-rank tensor C^e, endowed with a positive determinant, is the so-called elastic right Cauchy–Green tensor. Its components are covariant. Precisely, C^e is the pullback of the spatial metric $\tilde{g}$ through F^e.

The 1-contravariant, 1-covariant versions of C^e and C are, respectively, ${\tilde{C}}^{e} = F^{eT} F^{e}$ and $\tilde{C} = F^{pT} {\tilde{C}}^{e} F^{p}$ . Also, the push-forward by F^p of the material metric, namely, the second-rank tensor $\bar{g} : = F^{p - *} g F^{p - 1}$ , is independent of any change of frame on $B$ , induced by diffeomorphisms of the reference space onto itself. The proof is elementary and can be found in Mariano (2013). Notice that ${\tilde{C}}^{e} = {\tilde{g}}^{- 1} C^{e}$ and $\tilde{C} = {\tilde{g}}^{- 1} C$ .

The plastic factor of the deformation gradient maps the tangent spaces to $B$ at different points onto distinct linear spaces. We do not have any information assuring us that we can glue together all the linear spaces obtained by means of F^p, varying x in $B$ , to construct the tangent bundle of a manifold that could be a fit region of the type $B$ , identified with what we call an intermediate configuration.

This is another way to interpret the assumption curl F^p ≠ 0, which excludes the possibility to individualize such a configuration that remains fictitious. In principle, it is even not necessary to imagine an intermediate (global) configuration, as I have already affirmed, although, point by point, the factorization F^eF^p implies a mapping from $T_{x}^{*} B$ onto an unknown linear space determined by F^p. The linear spaces determined by F^p varying x in $B$ could be interpreted as the tangent spaces of different configurations. Such an interpretation brings us back to the notes in the introduction dealing with virtual tangent spaces and multiple reference configurations.

5.2 Factorization of changes in observers

Consider changes in observers in the generalized class 2 presented previously, excluding what is pertinent to $M$ , which does not appear in this section. The velocity field $\bar{v}$ in the relation

$\dot{y} \to {\dot{y}}^{#} : = \dot{y} + \bar{v}$

is a function of x through y := u(x), so we write $D \bar{v} = D_{y} \bar{v} F$ . For $D \bar{v}$ , according to Mariano (2013), I assume a multiplicative decomposition of the type

$D \bar{v} = \overset{\cdot}{\bar{H^{e} F^{p}}} .$

(1.22)

The presence of F^p does not reduce the generality of $D \bar{v}$ , owing to the arbitrariness of H^e, a linear operator from the linear space individuated by F^p to the translation space over ${\tilde{E}}^{3}$ . Although it has no effects on the generality of changes in observers, the previous factorization of $D \bar{v}$ is crucial for the result on hardening plasticity presented here.

A relation given below is useful. To get it, we define

$L_{H}^{e} : = {\dot{H}}^{e} F^{e - 1}, L^{p} : = {\dot{F}}^{p} F^{p - 1},$

and use the identity $D \bar{v} = D_{y} \bar{v} F$ and Eq. (1.22). By computing $D_{y} \bar{v}$ , we get⁴⁶

$D_{y} \bar{v} = \overset{\cdot}{\bar{H^{e} F^{p}}} F^{- 1} = Sym L_{H}^{e} + Skw L_{H}^{e} + H^{e} L^{p} F^{p - 1} .$

5.3 A version of the second law of thermodynamics involving the relative power

In accordance with Mariano (2013), I use here an isothermal version of the second law of thermodynamics (a mechanical dissipation inequality), which, for any part $b$ of $B$ and any choice of the velocity fields, is given by

$E (\dot{y}, w; ψ, b) : = \frac{d}{d t} \int_{b} ψ d x - P_{b}^{rel} (\dot{y}, w) \leq 0,$

(1.23)

where, we recall, ψ is the free energy and in the relative power we write $t$ instead of Pn in the surface integral including the contact actions.

The common expression of the mechanical dissipation inequality involves the external power alone. Here, we include the relative power, extending in this sense the standard inequality to account for the remodeling of the material structure induced by the plastic phenomena.

When this is the case—not here, however—the inequality can be further generalized by including the expression of the relative power that involves microstructural actions or hyperstresses in the presence of strain-gradient effects.

A direct description of the microstructures does not appear in this section for we are restricting the treatment to the standard framework of hardening plasticity just to exemplify how some general ideas discussed here work on a well-known ground.

5.4 Specific constitutive assumptions

The assumptions listed below apply:

H1 The state variables pertaining to the generic material element are F, F^p, and the hardening parameter α, a second-rank tensor taking into account hardening anisotropy and measuring how much during the plastic process the material goes far from thermodynamic equilibrium, where α plays only a parametric role, being considered an observer-independeni iniernal variable. Hence, the free energy is of the form

$ψ = \hat{ψ} (x, F, F^{p}, α) .$

It satisfies further assumptions:

H1.a For any linear operator $G \in Hom (R^{3}, R^{3})$ with det G = 1,

$\hat{ψ} (x, F, F^{p}) = \hat{ψ} (x, FG, F^{p} G, α) .$

H1.b The free energy is objeciive. The requirement emerges naturally from the isotropy of the three-dimensional Euclidean space: if we rotate rigidly a frame in the physical space, the free energy of a material should not change. Elements of the orthogonal group SO (3) describe rotations, we recall.

Assumption H1.a is completely standard (see, e.g., Mielke, 2004; Ortiz & Repetto, 1999). This implies that

$\hat{ψ} (x, F, F^{p}, α) = \bar{ψ} (x, F^{e}, \bar{g}, α),$

where $\bar{g} : = F^{p - *} g F^{p - 1}$ , and we adopt the multiplicative decomposition of F. The previous restriction on the structure of the energy enlarges the common use of the invariance under the action of G, as described above, interpreted as a constraint leading to a structure of the free energy of the type $\hat{ψ} (F, F^{p}) = \bar{ψ} (F^{e})$ alone, not considering $\bar{g}$ , which plays, in contrast, a role here, as we shall see below.
For any Q ∈ SO (3) that rotates frames in the space where the actual places are determined, since $\bar{g}$ has no components in that space and α is considered here to be observer-independent, just to follow the standard view on hardening, objectivity is written formally as

$\bar{ψ} (x, F^{e}, \bar{g}, α) = \bar{ψ} (x, Q F^{e}, \bar{g}, α),$

which implies

$\bar{ψ} (x, F^{e}, \bar{g}, α) = \overset{˘}{ψ} (x, {\tilde{C}}^{e}, \bar{g}, α) .$

Since ${\tilde{C}}^{e} = {\bar{g}}^{- 1} C^{e}$ , we can write

$\overset{˘}{ψ} (x, {\tilde{C}}^{e}, \bar{g}, α) = \tilde{ψ} (x, C^{e}, \bar{g}, α) .$

H2 Under diffeomorphism-based changes in observers acting on both the ambient space and the reference space (it is the generalized class 2 in which we do not consider $M$ ), we get

$\frac{d ψ^{#}}{d t} = \frac{d ψ}{d t} + {\frac{\partial ψ}{\partial C^{e}} |}_{↓ #} \cdot 2 Sym L_{H}^{e} + {\frac{\partial ψ}{\partial \bar{g}} |}_{*} \cdot L_{\bar{w} g} + \frac{\partial ψ}{\partial α} \cdot \dot{α},$

where (·) |_* indicates the pullback in the reference place of (·), while (·) |_↓# is the push-forward of (·) in the current configuration with the additional lowering of the first index.⁴⁷
The push-forward of (·) in the current configuration, with the additional lowering of the first index, is given explicitly by

${\frac{\partial ψ}{\partial C^{e}} |}_{↓ #} = \tilde{g} F^{e} \frac{\partial ψ}{\partial C^{e}} F^{e *},$

which is, in components,

${({\frac{\partial ψ}{\partial C^{e}} |}_{↓ #})}_{i}^{j} = {\tilde{g}}_{ik} {(F^{e})}_{α}^{k} {(\frac{\partial ψ}{\partial C^{e}})}^{α β} {(F^{e *})}_{β}^{j} .$

si754_e

${\frac{\partial ψ}{\partial C^{e}} |}_{↓ #}$ si755_e is then a 1-contravariant, 1-covariant tensor in the current place, an element of the dual space of Sym L^e_H.
The pullback in the reference place of (·) is given by

${\frac{\partial ψ}{\partial \bar{g}} |}_{*} = F^{p^{T}} \frac{\partial ψ}{\partial \bar{g}} F^{p - *},$

which is, in components,

${({\frac{\partial ψ}{\partial \bar{g}} |}_{*})}^{ij} = {(F^{p^{T}})}_{α}^{i} {(\frac{\partial ψ}{\partial \bar{g}})}^{α β} {(F^{p - *})}_{β}^{j} .$

si757_e

The term $L_{\bar{w}} g$ indicates that the (virtual) velocity $\bar{w}$ alters the material metric g, dragging it. Indirectly, then, $\bar{w}$ induces changes in the metric $\bar{g}$ on the intermediate configuration, since $\bar{g}$ is the push-forward of g induced by F^p. Hence, instead of writing ${(\partial ψ / \partial \bar{g}) |}_{*} \cdot L_{\bar{w}} g$ , we could consider the push-forward of $L_{\bar{w}} g$ through F^p, multiplying it by $\partial ψ / \partial \bar{g}$ , which would conceptually be the same thing.
The spatial metric $\tilde{g}$ can suffer alterations when the physical space is altered by a time-dependent family of diffeomorphisms having as an infinitesimal generator the (virtual) velocity $\bar{v}$ . There is then an effect on ψ through C^e, the pullback through F^e of $\tilde{g}$ into the linear space determined by F^p and coinciding with $F^{p} T_{x} B$ . This is the reason justifying the introduction of the term (∂ψ/∂C^e) |_↓# in assumption H2.
In assumption H2 the factor $L_{\bar{w}} g$ has no counterpart $L_{\bar{v}} \tilde{g}$ in the current place because H^e is not the spatial derivative of any vector field. This aspect justifies the presence of the factor SymL^e_H.

H3 Contact actions depend on the same state variables entering the energy.⁴⁸

5.5 The covariance principle in a dissipative setting

Independently of the use of external, relative, or internal power, the isothermal version of the second law of thermodynamics is a certain expression lesser than or equal to zero, say, B ≤ 0. Another observer $O^{'}$ always evaluates an inequality, say, B′ ≤ 0, with B ≠ B′, in general.

Thanks to assumption H2 and the linearity of the relative power with respect to the velocities $\dot{y}$ and w, the pullback of B′ into $O$ gives rise to an inequality of the type B* ≤ 0, with B* = B + B^†. The addendum B^† involves the velocity fields $\bar{v}$ and $\bar{w}$ , entering the rules of changes in observers. Conversely, if we push forward B to the frames defining the observer $O^{'}$ , we find an inequality of the type B′ + B^‡ ≤ 0 because now the change in observer $O^{'} \to O$ is governed by the inverse of the previous maps, namely, h⁻¹ and ${\hat{h}}^{- 1}$ . Hence, B^‡ is, in principle, different from B^†.

Previous remarks suggest a principle.

Proposition 2 (Covariance principle in a dissipative setting (Mariano, 2013))

In any change in observer in the generalized class 2 (defined previously), when we project the mechanical dissipation inequality evaluated by an observer into the frame defining the other observer, the additional term arising in the process is always nonpositive.

Essentially, the principle affirms that the dissipative nature of a process is indifferent to changes in observers.

5.6 The covariance result for standard hardening plasticity

The covariance principle in a dissipative setting is the key ingredient for proving the following theorem. It shows the covariant structure of the equations governing the standard description of hardening plasticity in the finite-strain regime.

Theorem 2

If we adopt for the inequality (1.23) the covariance principle in a dissipative setting, under assumptions H1, H2, and H3, the expression for the contact actions in terms of stress follows and if the fields x P and $x \mapsto P : = ψ I - F^{*} P$ , with I the identity the space of second-rank 1-contravariant, 1-covariant tensors,⁴⁹ are continuous and differentiable everywhere in $B$ , except for a (fixed and free of its own energy) smooth surface Σ, oriented by the normal m, where they suffer bounded jumps, and the fields x b, x F*b, and x ∂_xψ are integrable over $B$ , the local balance equations

$Div P + b^{‡} = 0,$

(1.24)

$Skw (P F^{*}) = 0,$

(1.25)

$Div P - F^{*} b^{‡} - \partial_{x} ψ = f,$

(1.26)

$Skw (g^{- 1} P) = - 2 \bar{e} μ,$

(1.27)

with $\bar{e}$ Ricci's symbol with all contravariant components, namely, ${\bar{e}}^{ABC}$ , hold in the bulk, while

$[P] m = 0, [P] m = 0$

(1.28)

are valid along Σ. Moreover, we get

$P = 2 \tilde{g} F^{e} \frac{\partial \tilde{ψ} (x, C^{e}, \tilde{g}, α)}{\partial C^{e}} F^{p - *},$

si791_e (1.29)

$\bar{P} = 2 F^{P^{T}} \frac{\partial \tilde{ψ} (x, C^{e}, \tilde{g}, α)}{\partial \bar{g}} \bar{g} F^{p - T},$

si792_e (1.30)

with $\bar{P} : = g^{- 1} P_{g}$ , and the local mechanical dissipation inequality

$P \cdot F^{e} {\dot{F}}^{p} + π \cdot \dot{α} \geq 0,$

(1.31)

where π is the thermodynamic flux $π : = - \frac{\partial \tilde{ψ}}{\partial α}$ si795_e conjugated with $\dot{α}$ in terms of dissipation production.

The proof of this theorem can be developed by reproducing the analogous proof in Mariano (2013), with the minor variations required by the presence of α. It is left to the reader. Notice that with respect to what is presented in Mariano (2013), assumption H2 is varied by the insertion of the factor 2, which appears then in Eq. (1.29).

Evolution equations for ${\dot{F}}^{p}$ and $\dot{α}$ can be derived by accepting the maximum dissipation principle. This is a standard view determining associate plasticity (Simo & Hughes, 1998), while the previous theorem is completely nonstandard.

To state the maximum dissipation principle, we need first to introduce an admissibility criterion. It can be expressed in terms of stress or strain. Here we adopt a standard representation in terms of stress P and thermodynamic flux π and write f(P, π) ≤ 0 for such a criterion, considering admissible the pairs (P, π) for which f satisfies the previous inequality.

The principle of maximum dissipation⁵⁰ prescribes that among all possible pairs (P, π) satisfying the admissibility criterion, the one that is physically realized maximizes the dissipation.

The expression for the dissipation here is the inequality (1.31). Maximizing it among admissible pairs (P, π) is tantamount to minimizing the Lagrangian

$L : = - (P \cdot F^{e} {\dot{F}}^{p} + π \cdot \dot{α}) + λ f (P, π)$

with respect to P and π. When f is differentiable, we get

${\dot{F}}^{p} = λ F^{e - 1} \frac{\partial f (P, π)}{\partial P} = λ F^{p} F^{- 1} \frac{\partial f (P, π)}{\partial P}$

and

$\dot{α} = λ \frac{\partial f (P, π)}{\partial π},$

si801_e

with λ ≥ 0, so we can identify the Lagrange multiplier λ with the rate of the plastic shift (Marsden & Hughes, 1983), and we have first λf(P, π) = 0, and then we can prove the consistency condition $λ \dot{f} (P, π) = 0$ (see Simo & Hughes, 1998 for the proof).

Another way of determining evolution laws for ${\dot{F}}^{p}$ and $\dot{α}$ is, obviously, to prescribe them. The choice depends on the specific case that we are handling.

5.7 Doyle–ericksen formula in hardening plasticity

Relation (1.29) allows us to show that the Doyle–Ericksen formula, commonly derived and discussed in finite-strain elasticity (Doyle & Ericksen, 1956), holds also for elastic–plastic materials with hardening.

First, consider that Eq. (1.29) can be written as

$P = 2 ρ \tilde{g} F^{e} \frac{\partial {\tilde{ψ}}^{◊} (x, C^{e}, \bar{g}, α)}{\partial C^{e}} F^{p - *},$

si805_e

after defining ${\tilde{ψ}}^{◊} (x, C^{e}, \bar{g})$ as the free energy per unit mass, namely,

$\tilde{ψ} (x, C^{e}, \bar{g}, α) = 2 ρ {\tilde{ψ}}^{◊} (x, C^{e}, \bar{g}, α),$

ρ being the density of mass in the reference configuration.

We write ρ_a for the density of mass in the actual configuration. When mass is preserved, we have ρ = ρ_a det F.

Proposition 3

In finite-strain (traditional) hardening plasticity, in the assumptions satisfying the covariance theorem above, if the mass is conserved, the Doyle–Ericksen formula

$σ = 2 ρ_{a} \tilde{g} \frac{\partial ψ}{\partial \tilde{g}}$

si808_e

holds true.

Proof. Since by definition the right elastic Cauchy–Green tensor with all covariant components is defined by $C^{e} : = F^{e *} \tilde{g} F^{e}$ , namely, $C_{α β}^{e} = {(F^{e*})}_{α}^{i} {\tilde{g}}_{ij} {(F^{e})}_{β}^{j}$ , we can consider C^e as a function of F^e and $\tilde{g}$ , namely, $C^{e} = {\tilde{C}}^{e} (F^{e}, \tilde{g})$ . Since $ψ = \tilde{ψ} (x, C^{e}, \bar{g})$ , as a result of G-invariance and objectivity requirements, we then have

$\frac{\partial ψ}{\partial \tilde{g}} = F^{e} \frac{\partial \tilde{ψ} (x, {\tilde{C}}^{e} (F^{e}, \tilde{g}), \bar{g}, α)}{\partial C^{e}} F^{e *},$

$\frac{\partial \tilde{ψ} (x, C^{e}, \bar{g}, α)}{\partial C^{e}} = F^{e - 1} \frac{\partial ψ}{\partial \tilde{g}} F^{e - *} .$

si815_e

From the definition of the first Piola–Kirchhoff stress, it then follows that

$\begin{array}{l} σ & = \frac{1}{det F} P F^{*} = \frac{2 ρ}{det F} \tilde{g} F^{e} \frac{\partial {\tilde{ψ}}^{◊} (x, C^{e}, \bar{g}, α)}{\partial C^{e}} F^{p - *} F^{*} \\ = 2 ρ_{a} \tilde{g} \frac{\partial ψ}{\partial \tilde{g}} F^{e - *} F^{p - *} F^{*} = 2 ρ_{a} \tilde{g} \frac{\partial ψ}{\partial \tilde{g}} F^{- *} F^{*}, \end{array}$

si816_e

which completes the proof.

5.8 Remarks and perspectives

• π does not appear in the power as an action conjugated with $\dot{α}$ , with an eventual identification with $- \frac{\partial \tilde{ψ}}{\partial α}$ . For this reason π does not contribute to any balance equation. Hence, α is an internal variable in the sense of nonequilibrium thermodynamics (see, e.g., Capriz & Giovine, 1997; de Groot & Mazur, 1962).

• The previous remark inspires naturally (at least for me) another question: Can plasticity be described in the sense of the framework discussed in previous sections? In other words, can we associate with the plastic phenomena microstructural interactions satisfying their own balance equations? The approach would be in contrast with the choice of α, which is an unknown parameter useful just to measure the trend far from the neighborhood of thermodynamical equilibrium, where the mechanical behavior ceases to be elastic. In principle there is no obstacle to obtaining an affirmative answer. Of course it is matter of modeling because many choices of the nature of ν can be made, depending on the specific mechanism that we want to describe. An approach that connects the evolution of microdefects leading to plasticity with microactions satisfying their own balance is given in Dłużewsky (1996). The micromorphic scheme, the case when ν is a second-rank tensor (see Mindlin, 1964, for the linear case and also Green & Rivlin, 1964; Toupin, 1962, for the nonlinear setting), was adapted to plasticity in Fleck and Hutchinson (1997). Another example pertaining specifically to plasticity is given in Gurtin (2000b), where ν is identified with the slip velocity in single crystals. The subsequent pertinent literature is rather wide (see, e.g., Gudmundson, 2004; Gurtin & Anand, 2009; J. W Hutchinson, 2012; Reddy, Ebobisse, & McBride, 2008), and a specific essay could be dedicated to review it. The proposed models seem adequate (even particularly in certain cases) to capture various aspects of plastic phenomena and are often evidently useful to develop computations that may solve practical problems. And essentially the balance equations involving both macroscopic and microscopic actions (the latter associated with the mechanisms that the authors believe are essential to the description of specific aspects of the plastic flows) are in the most general cases deduced by resorting to the principle of virtual power as a basic source. To me this choice involves a foundational problem. In fact, when we start from the virtual power principle to find balance equations for a certain model, morally we have already in mind the exact structure of these balances. I have already stressed the point speaking in general about microstructures in previous sections. Proposing a virtual power principle is tantamount to assigning a priori the weak form of balance equations. Essentially, it could be the same to declare candidly in pointwise form the balances that one believes to be necessary for the analysis at hand. The question pertains to the internal actions appearing in these balances. When we assign the expression of the virtual power, we are postulating the existence of such inner actions (see, e.g., Gurtin & Anand, 2009). It could perhaps be useful to find that these actions are necessary, by means of some invariance procedure, for example, from the external power alone, as it appears in previous sections. Hence, to me a rather interesting question is the following: For what available models of plastic phenomena, based on balances of microactions, it is possible to prove the need for the existence of the inner actions that they involve (when they are involved) by means of some invariance procedure accepted as a first principle without postulating such actions? An answer could probably help in discriminating among models of the same phenomena.

• Beyond the question of the emergence of self-actions in the balance equations involving microactions, another issue to be discussed is the choice of ν to represent adequately plastic mechanisms. The issue could appear volatile when considered in full generality, for an answer depends on the specific material or phenomenon under analysis. However, in the case of crystalline materials, some details can be provided. For crystal lattices, Parry and Šilhavý (2000) have determined a basis for elastic invariants (see also Davini, 1996, for their definition). In a discussion we had in September 2001 at Taormina, Parry and I were in agreement that generic functions of the elastic invariants could be an adequate candidate for ν, but we neither followed up on our discussion nor wrote something about it. If we accept our remark and want to follow it, however, we have to handle it with care. In fact, a minimalistic choice for ν could be the dislocation density tensor, even without considering its gradient. The choice could be also appealing for it can be associated with geometric properties of the body manifold (its torsion). However, the same choice could be criticized. At the end of a 2001 paper entitled “Benefits and shortcomings of the continuous theory of dislocations,” Kröner (2001, p. 1132) wrote: “The greatest shortcoming is that the dislocation density tensor α, no matter whether introduced through differential geometry or in the conventional way, measures the average dislocation density only and, therefore, regards the internal mechanical state utmost incompletely. In principle, this shortcoming could be overcome by reorientation of dislocation theory towards a statistical theory, but only with highest expenditure of computations.” The remark suggests at least prudence in selecting a candidate for ν.

• Viewing plasticity in terms of the general framework of multiscale and multifield representations of material complexities, as introduced in previous sections, opens the way to models of strain-gradient effects in plastic phenomena. The necessity of the extension has been pointed out by crucial experiments (Fleck, Muller, Ashby, & Hutchinson, 1994), which have evidenced the effects due to the grain size in the torsion of thin metallic wires. These effects can be interpreted in terms of strain gradients. As a consequence, a number of related models of strain-gradient plasticity have been developed. Some of them have been quoted in the previous items, and at all times we have discussed the possibility of interpreting plastic phenomena in terms of a framework involving the balance of microactions due to microstructural events. A question is then the origin of the link between the analysis of the strain-gradient effects and the multifield setting. To give an adequate answer, we have to refer to a basic 1985 paper by Capriz (1985), with a preamble concerning a contemporary work by Dunn and Serrin (1985), who showed that the presence of the spatial derivatives of strain in the list of constitutive variables, defining the state of a material point, is compatible with the second law of thermodynamics when the standard inner power density $P \cdot \dot{F}$ is augmented by an addendum that depends on the same spatial derivatives of strain appearing in the list of state variables, decided from the beginning (such a description is rough, however it is sufficient to explain our argument here). Dunn and Serrin called such an addendum interstitial working, to remind us of the pioneering use of gradients of density made by Korteweg to describe capillary effects. However, notwithstanding the clear indication of the way to be followed to consider correctly strain-gradient effects, they did not go into the nature of the interstitial working with the aim of linking it with microstructural events. The link was established explicitly by Capriz (1985) for second-grade elasticity, i.e., in the case in which we consider the first derivative of F in the list of state variables. His remark is simple but has deep consequences. Let us consider the multifield and multiscale model-building framework discussed in previous sections. Imagine also that external bulk actions on the microstructure are absent, i.e., β = 0. If there is some physical reason to imagine an internal constraint of the type $ν = \hat{ν} (F)$ , in a conservative setting, the multifield and multiscale scheme accounting for material complexity reduces to second-grade elasticity and the necessary interstitial working is no more than the power of the microactions. Ofcourse, without considering the multifield framework used in Capriz (1985), for second-grade elasticity, we could introduce directly a hyperstress, a third-rank stress performing inner power in the spatial derivative of $\dot{F}$ , developing then the relevant mechanical structures. Such a stress emerges naturally in a conservative setting when we consider an elastic energy depending on F and DF, and we evaluate the first variation of it around minimizers, after proving their existence. In the nonconservative case, the existence of a hyperstress should be established, e.g., by a Cauchy-type theorem—an issue tackled, but not yet closed (the reader can find basic remarks in Fosdick & Virga, 1989; Nečas & Šilhavý, 1991; Noll & Virga, 1990). With respect to the actual state of the art, the interpretation in Capriz (1985) establishes a direct link between higher-order stresses and microstructural events, whatever they may be.

• The general multifield and multiscale framework for the mechanics of microstructures appears clearly useful when we turn our attention to strain-gradient plasticity and identify ν with F^p (the literature in this sense is rather wide—see the remarks in Gurtin & Anand, 2009) or even with the plastic part of the small-strain tensor (Fleck & Hutchinson, 1997). In accepting the identification of ν with F^p and following the guidelines proposed in previous sections, we would face the problem of interpreting changes in observers over $M$ , now the set including F^p, because F^p is a factor of the macroscopic deformation gradient F. The question deserves further investigations.

6 Parameterized families of reference shapes: a tool for describing crack nucleation

Besides the notion of relative power and the use of virtual tangent maps appearing in the description of plasticity when we adopt the multiplicative decomposition, another possible way to account for multiple reference shapes is to consider a large class of them, all covering the set $B$ and differing from one another by possible defect patterns. I have already sketched this point of view in the introduction. Here, before describing the formal structure of the approach, I find it expedient to recall some disparate notions that delineate the scenario.

6.1 A remark on standard finite-strain elasticity

Consider the energy of an elastic simple body in the large-strain regime, disregarding body forces for the sake of simplicity:

$E (u, B) : = \int_{B} e (x, D u (x)) d x .$

A result obtained by Coleman and Noll (1959) formalizes the physical incompatibility between the objectivity of the elastic energy density e (x, F) and its convexity with respect to F. In essence, it implies loss of uniqueness of equilibrium configurations under prescribed boundary conditions.

A requirement of polyconvexity of e with respect to F (the dependence suggested in Ball (1976/77)) reconciles analytical and physical instances. Polyconvexity means that we have to consider the elastic energy density as a convex function of the triple constituted by F, cof F, and det F. Also, when we take a polyconvex elastic energy and try to determine its minimizers, the minimizing sequences of F, cof F, and det F are independent of each other. The procedure assures that the strain compatibility of the elements in the sequence is preserved in the limit.

In discussing strain measures, we have already pointed out that det F and the entries of F and cof F can be put together in a unique geometric entity, the three-vector $M (F) \in Λ_{3} (R^{3} \times {\tilde{R}}^{3})$ , with components the ones in the list (1, F,cof F,det F). Hence, we can consider the energy density as a convex function of M (F), as indicated in Giaquinta et al. (1989).

I have already remarked that M (F) does not always coincide with M (Du). The identity is ensured only when the strain is compatible. Considering e as a convex function of M (F) would then correspond to extending it even to incompatible strain states, a circumstance in agreement with the previous remark on minimizing sequences. Even in this case compatibility is recovered at the end of the minimizing procedure.

There is something more, however. The map F M (F) is not convex, nor is the subset of $Λ_{3} (R^{3} \times {\tilde{R}}^{3})$ containing elements of the type M (F)—write Σ_1,+,F for it. Hence, if we want to define a convex function of M (F), we must consider the convex hull of Σ_1,+,F, namely,

$\sum_{1, +} : = {M \in Λ_{3} (R^{3} \times {\tilde{R}}^{3}) | M = (1, H, A, a), a > 0},$

with H and A the tensors defined in Section 2.5, and $a$ the scalar coinciding with det F when M = M (F). Once we have defined the energy density as a convex function over Σ_1,+, we add to it further conditions dictated by physics: the energy density increases to infinity when det F goes to zero or |M (F)| tends to infinity—infinite energy has to be paid for by shrinking to a point a volume or by stretching to infinity a string. These requirements imply an analytical property: the energy is coercive. It is crucial in determining the existence of equilibrium states (the ones reached by a requirement of minimality for the energy). Previous conditions imply also that the energy is coercive even when evaluated over the inverse map, i.e., when it is referred to the actual shape of the body.

When we accept

$e (x, F) = \tilde{e} (x, M (F)),$

the first Piola–Kirchhoff stress

$P = \frac{\partial e (x, F)}{\partial F}$

becomes

$P = \frac{\partial \tilde{e} (x, M (F))}{\partial M (F)} \frac{d M (F)}{d F} .$

si831_e

Since, by definition, M (F) is a third-rank, skew-symmetric tensor with all contravariant components (see Section 2.5), the components of the third-rank, skew-symmetric tensor

$ω : = \frac{\partial \tilde{e} (x, M (F))}{\partial M (F)}$

si832_e

are all covariant, so ω is dual to M (F) and the product ω · M (F) is well defined in terms of duality pairing (see Section 2). Formally, we write $M (F) \in Λ_{3} (R^{3} \times {\tilde{R}}^{3})$ and $ω \in Λ^{3} (R^{3} \times {\tilde{R}}^{3})$ . The map x ω (x) is then a three-form over $B$ .

6.2 The current of a map and the inner work of elastic simple bodies

In the Lagrangian representation, consider the inner power of an elastic simple body undergoing large strains, namely,

$\int_{B} P \cdot \dot{F} d x .$

By taking into account the expressions in the previous section, we can write

$\begin{array}{l} \int_{B} P \cdot \dot{F} d x & = \int_{B} ω \frac{d M (F)}{d F} \cdot \dot{F} d x = \int_{B} ω \cdot \frac{d M (F)}{d F} \dot{F} d x \\ = \int_{B} ω \cdot \frac{d M (F)}{d F} \dot{F} d x = \int_{B} ω \cdot \dot{M} (F) d x, \end{array}$

si837_e

and, in case of strain compatibility,

$\int_{B} P \cdot \dot{F} d x = \int_{B} ω \cdot \dot{M} (D u) d x .$

si838_e

Since $ω \cdot \dot{M} (D u)$ is an inner power density, the integral

$\int_{B} ω \cdot M (D u) d x$

has the meaning of inner work.

Once we fix u, we can allow ω to vary arbitrarily. The physical significance of such a choice is that of a virtual inner work obtained by testing virtual stresses over a given deformation. The remark clarifies the physical meaning of the functional G_u defined on smooth forms compactly supported over $B \times {\tilde{R}}^{3}$ by

$G_{u} (ω) : = \int_{B} ω (x, u (x)) \cdot M (D u (x)) d x$

and commonly called the current of u in geometric functional analysis (see the treatise Giaquinta et al., 1998). For any second-rank skew-symmetric tensor-valued map $x \mapsto \bar{ω} (x)$ , we define another functional, ∂ G_u, by

$\partial G_{u} (\bar{ω}) = G_{u} (d \bar{ω}),$

with d the exterior derivative. ∂ G_u is commonly called the boundary of G_u.

Summable maps u over $B$ with a summable first distributional derivative, specifically elements of $W^{1, 1} (B, {\tilde{R}}^{3})$ , such that

• det Du (x) > 0 for almost every $x \in B$ ,

• the map assigning to every x the modulus (intended in the standard sense of modulus of tensors) |M (Du (x))| is summable too,

• ∂ G_u = 0 on smooth, compactly supported 2-forms over $B \times {\tilde{R}}^{3}$ , and

• for any $\tilde{f} \in C_{c}^{\infty} (B \times {\hat{R}}^{d})$

$\int_{B} \tilde{f} (x, u (x)) det D u (x) d x \leq \int_{{\tilde{R}}^{3}} sup_{x \in B} \tilde{f} (x, r) d r,$

(1.32)

are called weak diffeomorphisms (Giaquinta et al., 1989).

Under the conditions ensuring coercivity, minimizers of the elastic energy in the finite-strain regime are found in a subclass of the space of weak diffeomorphisms with summability p > 1, as shown in Giaquinta et al. (1989).

The integral inequality (1.32) allows self-contact of the body boundary along the deformation and prevents self-penetration. The constraint $\partial G_{u} (\bar{ω}) = 0$ is stable when we superpose on u any other smooth deformation. It excludes the formation of holes and/or fractures. When such a condition is satisfied, u cannot be multivalued in any part of its domain, as occurs, for example, when a crack is nucleated and, eventually, opens and/or closes along a deformation. In other words, the graph of u is free of vertical components—verticality refers to the reference place $B$ in the six-dimensional space $R^{3} \times {\tilde{R}}^{3}$ , the first factor referred to the three-dimensional point space containing $B$ .

In this framework, if we want to model elastic–brittle behavior, we need to enlarge the functional setting at least weakening the constraint $\partial G_{u} (\bar{ω}) = 0$ .

6.3 The griffith energy

When a fracture occurs in a body, energy is dissipated but energy is also localized along the crack margins, to ensure the stability of the matter. It was Josiah Willard Gibbs who insisted on the assignment of positive surface energy to interfaces to ensure stability of condensed matter structure. For fractures, in his pioneer work, Griffith (1920) presumed that the surface energy is proportional to the area of the crack margins. Hence, for an elastic–brittle solid undergoing bulk deformations, when a fracture occurs, the Griffith's energy, $E (u, B, C)$ , is

$E (u, B, C) : = \int_{B} e (x, D u (x)) d x + \int_{C} ɸ d H^{2},$

si857_e

where $C$ is the image in the reference place of the crack occurring in the actual place, and ɸ is the constant surface energy.

Such an expression was used in Francfort and Marigo (1998) to propose a variational approach to fracture processes. In their view, at each instant $t \in [0, \bar{t}]$ of a cracking process, the pair ( $C$ , u) should realize a minimum of the global energy $E$ , with $C$ an admissible crack, i.e., a rectifiable set (the image of a countable number of Lipschitz maps) with zero volume measure.

Formally, instead of considering continuous time variation, the interval of time is discretized and minimality is required at time steps. Various analytical problems appear even so. The essential difficulty is the control in three dimensions of minimizing sequences of surfaces leading to the image $C$ in the reference place of the possible actual crack.

By taking into account that $C$ coincides with the jump set of the deformation u when the entire crack is open, a convenient simplification of the model is the identification of cracks with such a set. In accepting this point of view, bounded variation (BV) or special bounded variation (SBV) functions can be involved as candidates to be minimizers of the elastic energy. This way, and thinking always of elastic–brittle bodies, the energy that we can consider is that of an elastic simple body (I have written it previously), and minimizers are sought in a space of maps including candidates to be reasonable descriptors of the elastic–brittle behavior. The approach stresses once again that the choice of function spaces where we search for minimizers of some energy has a constitutive nature. Along this path, essential results have been proven (Dal Maso, Francfort, & Toader, 2005; Dal Maso & Toader, 2004; Francfort & Mielke, 2006).

Further difficulties emerge, however. Theorems allowing the selection of fields with discontinuity sets describing reasonable (physically significant) crack patterns do not seem to be available yet (see Bourdin, Francfort, & Marigo, 2008 for a review of the current literature). Also, the identification of the crack with the discontinuity set of the deformation does not account for partially open cracks. In the time-discretized procedure mentioned above, during a loading program described by time-dependent boundary conditions, it could happen that a crack nucleated at the ith instant might close even partially, and then reopen at subsequent time steps. Along the closed margins the deformation is continuous, but the material bonds are broken in the actual place.

Once the minimizing problem has been successfully tackled, when a crack is identified with the discontinuity set of the deformation, stronger regularity assumptions on the geometry of the crack pattern are necessary to obtain balance equations (Bourdin et al., 2008; Dal Maso et al., 2005; Dal Maso & Toader, 2004; Francfort & Mielke, 2006).

6.4 Aspects of a geometric view leading to an extension of the griffith energy

We can have another view on the description of cracks. It was proposed in Giaquinta, Mariano, Modica, and Mucci (2010) (see also Mariano, 2010) and extended in Giaquinta, Mariano, and Modica (2010). The items below contain its peculiar features:

• We distinguish between a crack pattern and the jump set of u, as in Francfort and Marigo (1998), by considering the latter set constrained to be contained in the crack pattern. This way we can describe circumstances in which parts of the crack margins are in contact but the material bonds are broken there.

• In contrast with all previous proposals, we describe the crack pattern through measures giving information about points in $B$ when a crack can occur and the directions that the fracture can have in passing through those points. Such measures are called curvature varifolds for a generalized notion of curvature can be associated with them and is an indicator, in a precise sense, of how much a crack pattern is curved at a point, or better in a neighborhood of it.

• The energy resulting in the description of cracks in terms of varifolds differs from the Griffith energy by the presence of the generalized curvature in the surface energy and the curvature along the tip in three dimensions. In this sense the model is an evolution of Griffith's scheme.

• We require then minimality of the energy in terms of pairs of deformations and curvature varifolds. The curvature dependence of the surface energy has analytical advantages and permits the control of minimizing sequences. The proof of the existence of minimizers for the extended Griffith energy in appropriate measure and function spaces is given in Giaquinta, Mariano, Modica, and Mucci (2010).

• In the existence result, the emerging crack pattern is a rectifiable set with zero volume measure. Although it can be very irregular, it has the features that our intuition assigns to a fracture.

• In contrast with previous proposals already mentioned, the balance equations can be derived in weak form from the first variation of the extended Griffith's energy, even for a crack that is a generic rectifiable set. Details clarifying these items follow below.

Table of Contents for 5 Balance equations from the second law of thermodynamics: the case of hardening plasticity

Create new playlist

Sign In

Sign Up

4.2 Cauchy's theorem for microstructural contact actions

4.3 The relative power: a definition

4.4 Kinetics

4.5 Invariance of the relative power under isometry-based changes in observers

4.6 And if we disregard M<math altimg="si671.gif"><mrow><mi mathvariant="script">M</mi></mrow></math> during changes in the observers?

4.7 Perspectives: low-dimensional defects, strain-gradient materials, covariance of the second law

5 Balance equations from the second law of thermodynamics: the case of hardening plasticity

5.1 Multiplicative decomposition of F

5.2 Factorization of changes in observers

5.3 A version of the second law of thermodynamics involving the relative power

5.4 Specific constitutive assumptions

5.5 The covariance principle in a dissipative setting

5.6 The covariance result for standard hardening plasticity

5.7 Doyle–ericksen formula in hardening plasticity

5.8 Remarks and perspectives

6 Parameterized families of reference shapes: a tool for describing crack nucleation

6.1 A remark on standard finite-strain elasticity

6.2 The current of a map and the inner work of elastic simple bodies

6.3 The griffith energy

6.4 Aspects of a geometric view leading to an extension of the griffith energy

Table of Contents for
5 Balance equations from the second law of thermodynamics: the case of hardening plasticity

4.6 And if we disregard $M$ during changes in the observers?