2 First language acquisition and early cognitive development

When starting to learn a second language, most adult learners do so, intentionally or not, with reference to their previously internalized linguistic and cultural patterns for construal, as well as social norms and conventions of the L1 community. Since these have been acquired early in life in tandem with the process of developing mental functions, feelings, and plausibility structures, they have a deeply formative influence on all activities carried out by the subject, including learning another language and its use in sociocultural contexts. In this chapter, the acquisition of these influential L1 patterns and structures will be analyzed.

Development, change, and transformation are central notions, not only for the complex constructs of culture and society, but also for the no less complex cognitive and emotional capacities of the individual; this is particularly evident in the constantly ongoing activities of learning. Throughout biological life, the mental functions of the individual never cease to develop (and this can include regressive development). The largest leap in individual development, be it physical or psychological, takes place in the first fifteen years of life, during infancy, childhood, and early adolescence. And within this time-span, the first months and years are a crucial period, during which children acquire the basic structures of knowledge through participatory, intersubjective engagement with others and Other. These basic structures are mediated by culturally-constructed symbolic sign systems, among which language is by far the most diverse and complex mediator. Language is a highly sensitive instrument for linking cognitive, semiotic, and social activities. Mediation is a hugely important process by which culturally constructed artifacts (first and foremost language), concepts, values, norms, beliefs, and frames are increasingly used to regulate intra- and intersubjective mental activities (i.e., thought and interaction). The structures and patterns of the first language, society, and culture have repercussions for the process of second language learning because of their normative influence on thought, emotion, and behavior; the L2 is approached from the linguistic, cultural, and psychological bases of the L1-mediated constructs. They are also relevant since the deeply ingrained tacit linguistic, conceptual, and cultural knowledge is usually taken to be universally valid by the person – an assumption which will be fundamentally unsettled by the experience of learning another language. Therefore, this chapter will analyze the acquisition of fundamental concepts and categories through the medium of the first language; a process that constitutes and transforms the subjective mind by immersion in linguistic, social, and cultural practices of the immediate environment of family, peers, playground, and school.

For children in the pre-linguistic stage of development, knowledge is grounded in their direct and unmediated empirical experiences of their environment. This experiential knowledge is initially non-reflective, and infants cannot intentionally make use of this knowledge. However, the acquisition of language changes this situation: “Once they begin to think conceptually, children are able to reflect upon and therefore gain conscious control over their mental activity. In this way, memory, attention, planning, learning, and rational thought become voluntary” (Lantolf and Thorne 2006: 17). For the mental functions of the infant, the acquisition of language has both liberating and constricting consequences. On the one hand, the child loses his or her natural unmediated, unconscious, direct, and responsive relationship with the material and social environments; on the other, the child gains the freedom of decontextualized, voluntary, and reflective thinking. By acquiring language, the child vastly increases his or her cognitive resources because he or she can structure and control his or her thinking, and transcend the immediate moment by thought and action. Only through the medium of language (including, of course, Braille and natural gesticulated sign languages) and, to a lesser extent, physical activities, can the individual make sense of himself or herself, Other, and others.

Language, however, is not just a neutral medium (or, as Andersen [1996] phrased it, “language is not innocent”); it is a tool providing the subject with the basic mental concepts for construing his or her environment (including concepts of self). Language shapes ideas, concepts, experiences, memories, and has a formative influence on emotions and activities. Language also facilitates thought because it has a shaping influence on the externalization of thought. When we want to express our thoughts, we have to fit them into the corset of language and its immanent conceptualizations. There is a continual and complex interplay at work between language and thinking: “The structure of speech does not simply mirror the structure of thought; that is why words cannot be put on by thought like a ready-made garment. Thought undergoes many changes as it turns into speech. It does not merely find expression in speech; it finds its reality and form” (Vygotsky 1986: 219). Language, therefore, serves not just as a medium, but also as a tool for transforming, directing, and controlling voluntary thought. This is due to the fact that congenital in language, derived from social and cultural activities in a historical dimension, are certain ways of interpreting and acting within social and physical surroundings.

For example, Choi and Bowerman (1991) analyzed the spatial differences between the acquisition of prepositions in Korean and English by the respective actions of putting an apple in a bowl, a cup on a table, a cassette in a box and a lid on a container, carried out by 20-month old children. In English, the language suggests the perceived similarity of on the table and on the container, using the locative preposition on. In Korean, however, the putting of a cup on a table and a lid on a container are not perceived as being in the same spatial relationship. Hence, the children learning Korean have acquired a different concept of spatial relationship from those encoded in English. Bowerman (1996: 169) suggests that the developing minds of children seem to have a high degree of plasticity and are susceptible to “language specific principles of semantic categorization.” This shows that meaning is not a perceptible feature of reality, nor is it inherent in words. Notions of “reality” are generated by assigning meaning to objects, actions, experiences, memories, and circumstances. Meaning, once assigned, can be differentiated or even reversed and is therefore never stable, neither for the individual nor for the cultural community. Rather, it is constantly checked for viability through processes of socialization and individuation.

Language is the central symbolic system of any given human culture and society, and thus is very closely woven into the cultural and social fabric of life; it is not a neutral system that exists independently of people and their activities. Therefore, it would not make sense to learn a second language purely as a closed linguistic system. In fact, sociocultural patterns, structures, values, attitudes, and beliefs are inscribed in the language of a given community in a historical dimension and are partly reflected in the specific usage of elements of language within a certain culture and society.21 For example, the German term Heimat implies a certain affective attachment for the subject to the geo-social environment he or she considers home; the English translations of home town, country, home, native country, or homeland can only express certain aspects of the complex semantic ensemble that is implied in the term Heimat.

This means that the individual, too, is not a self-enclosed atomistic entity possessing an inner sovereignty, living his or her separate life in isolation from others. The individual can only position himself or herself and construe notions of selfhood and worldhood in dialogic interaction and in relation with others. Thus, a comprehensive theoretical framework for analyzing people’s use of language, be it the L1 or L2, must be “based on the idea that our mastery and use of language is crucially dependent on the fact that we are beings which are embodied as well as situated within a culture of shared practices” (Zlatev 1997: 1-2; emphasis in the original). The embodied aspect of our existence lends subjectivity to our actions, interactions, and feelings, in that they are being construed, be it consciously or subconsciously, on the basis of our subjective interpretation of our memories, experiences, intentions, and desires. The situated aspect relates to the fundamental intersubjectivity of our lives, because we are what we are only through (symbolic) interaction with others.

By almost completely relying on socioculturally constructed semiotic tools, the psyche of the individual is placed on the borderline between the self and the outside world: “Because internalization is the turning inward of some process that formerly had a social function, the internalization of social speech equals the socialization of the intellect. In going through the stages of natural psychology, mediated activity and internalization, the child’s mind is transformed” (Watson 1995: 62). Sociocultural constructs, once internalized, are a crucial part of the human mind; they provide the most important framework for mental representation and reflective thought. There is considerable debate about the mind’s existence (cf. Schlinger 2005) because it cannot be directly accessed. However, the fact that every human is endowed with a personal awareness seems to provide evidence for the reality of the mind. Cognitive neuroscientists are of the opinion that the human brain facilitates the workings of the mind: “[M]ind is neither nothing more nor nothing less than a function of the material brain” (Uttal 2011: 3) which “arises out of billions of component parts” (Uttal 2011: 6). They assume that the mind’s functions can be measured in terms of neural activity because “any mental or cognitive activities and processes as well as those that control behavior are the functions, the outcomes, or the results of the activities of the nervous system” (Uttal 2011: 5). Mind, therefore, can be defined as a profoundly intrapersonal phenomenon which can be examined by neuroscientific methods of analyzing the brain’s neural activities, but also by means of introspective autobiographical reports or by observing a person’s behavior (although these are considered to be deeply flawed methodologies by neuroscientists, due to the variability, subjectivity, insufficiency and inadequacy of these methods of verbalization or observation). The mind of the person is hence located in the brain.

However, this position can be seen as reductionist because the workings of the mind also include emotions and feelings which are related to the body.22 According to neuroscientist Damasio (1994: 225), “mind derives from the entire organism as ensemble.” Without being embodied, our mind would be different in structure and function because “the body contributes more than life support and modulatory effects to the brain. It contributes a content that is part and parcel of the workings of the normal mind” (Damasio 1994: 226; emphasis in the original). The embodied mind thus cannot be reduced to rational cognition, agency, judgment and moral value but must include bodily aspects such as feelings and emotions which guide us in our selection of relevant information and subsequent action in order to ensure the survival of our organism.23 However, the self-reflective mind can only come into existence and function through interaction with others which means that it is interpersonally constituted. Symbolic means such as language are a necessary precondition for the reflective mind’s operations in terms of categorization, memory, experience and intention; at the same time, language influences the workings of the mind, as, for example, Bowerman and Choi (1974; 1996; 2003; cf. above and Section 5.3.1) demonstrated with the different nature of spatial distinctions made by children with Korean or English as the L1: “As semantic categories are formed [in the mind of the child], the speaker becomes increasingly skilled at making the rapid automatic judgments they require” (Bowerman and Choi 2003: 417) in terms of culture-specific language use and behavior. Thus, grammatical patterning with semantic correlates clearly have an effect on implicit categorization in the mind; as soon as symbols, including linguistic signs, become involved, the way infants think and behave changes. However, this link between categorization and mind transcends the tool of language because “we think in line with how we speak, [so] that the clues are not all in the language but are distributed throughout the context of language learning” (Levinson 2003: 43). Hence, the mind provides the subject with cognitive perspectives which are very personal to the subject, although the mind is not necessarily conscious of these influences. Therefore, the concepts of mind and consciousness are not identical phenomena; consciousness relates to the more passive aspect of having perceptions, thoughts, and feelings, whereas mind refers to the more active and introspective aspect of applying (consciously or not) perceptions, thoughts, and feeling to subjective activities. For example, when we are ‘out of our mind’ the environment no longer has a personalized meaning, but we are still the passive recipients of incoming sensory information.24 The subject integrates the cognitive perspectives into a notion of subjectivity which is embodied and thus also includes the domains of affect, emotion, and bodily feelings. Subjectivity is not given from birth but it is mediated by the subject (consciously or subconsciously) through symbolic forms; it is constantly constructed by the subject and positioned by social forces such as the discourse, but the subject also positions himself or herself in the discourse.

Even before language and its use fundamentally transform the workings of the mind, the infant must have acquired basic skills of perception, categorization, relational understanding, problem solving, memory, and an understanding of other individuals as intentional agents like the self. The existence of these pre-linguistic forms of consciousness and awareness can be deduced from symbolic means which infants use to manipulate their environment, such as pointing gestures and imitation. Infants follow the gaze or pointing of adults, as well as urging adults to gaze towards a certain object or making grasping movement of the hand while looking at an object (hence clearly indicating that they want the adult to bring the object to them; cf. Section 2.1). This means that nonlinguistic joint attentional behavior emerges in individuals as a group, just as subsequent language acquisition does (and there seems to be a link between the two; cf. Tomasello 2003: 49). However, grammar and terminology, including the inherent rules, codes, and conventions of language use, precede the individual’s existence, and the subject has to adapt these given structures and meanings to his or her particular needs in any given situation in order to think, or make herself or himself understood. At the same time, the mind of the infant has to adapt according to the requirements of the language in terms of its grammar, conceptualizations, and lexical system. Language acquisition also implies the acquisition of the culturally-specific, perspective nature of linguistic symbols, including plausibility structures. So the subject’s mind is, to a large extent, structured by external sociocultural categories. In this sense, mediality is a crucial precondition for mentality.

Mediality also plays an important role in emotion, feeling, and, to a lesser degree, in affect which are represented as a very subjective reality to the individual. Whereas the term emotion relates to the physiological and behavioral aspects, affect tends to relate to the purely psychological state that carries positive or negative valence. Emotion refers to a (passing) state of body that has mental, bodily, or behavioral indicators, of which the subject can be aware and which can be verbalized. Since emotions can be consciously experienced or even managed (e.g., by feigning an emotional state), development of cognitive processes and physiological changes have an impact on the range of construal and (self-) representation of emotions. Emotions are experienced and expressed in the body and in the mind, and both the mind and body enable and constrain that experience (cf. Theodosius 2012: 64). Emotions are very private experiences which are as elusive as the mind in terms of valid assessment and measurability; they “can be only weakly inferred from introspection or behavior but cannot be directly measured” (Uttal 2011: 145). However, like cognition, emotions can also be co-constructed by persons, not only in terms of empathy and antipathy, but also with regard to shared emotional reactions to utterances, memories, experiences, objects, etc. The representation of emotions to the self through inner speech (cf. Section 2.2.3) socializes the experience of private emotions to some extent because the sociocultural semiotic of language gets involved. Hence, one strand of emotion research focuses on the organismic approach (e.g., research in psychology and neuroscience), whereas another approach focuses on intersubjective and interactional elements (e.g., research in sociology, anthropology, applied linguistics). In this book, the latter approach is adopted because emotions are crucial to the way we relate to other people, shaping psychological and bodily boundaries between the “I,” the “we,” and the “other,” but also producing the effect of collectivities (cf. Ahmed 2012). Affect, by contrast, “is understood as a broader concept than emotion, covering phenomena ranging from larger emotional episodes to sudden pleasure and pains, to momentary likes and dislikes” (Ruusuvuori 2013: 331). Unlike emotions, affects do not have a strong social dimension, although affects can spread between persons in terms of igniting rage, inciting shame, evoking tenderness, or exciting fear. They constitute a moment of unformed and unstructured potential; affects are pre-personal and pre-conscious, and therefore they cannot be fully verbalized.

Feelings represent a subcategory of emotions. They are sensations that are checked against the subject’s previous experiences; therefore, they tend to be personal and biographical, and the person is consciously aware of them. Ludwig Wittgenstein (1953) vividly illustrates the transformative influence of language on the feeling of bodily sensations with the example of pain. Sensations such as pain can only be expressed, even to the inner self, by using linguistic categories for the description of the sensation of pain. Pain is a direct, linguistically and conceptually unmediated sensation. It can lead to spontaneous expressions of pain (“Ow!”) which provide the subject with basic evidence for the ascription of pain. But the translation of the directly-felt sensation into the external symbolic system of language transforms the sensation of pain itself. The subject’s grasp of the concept of pain is dependent on the ability to make judgments about oneself, and about others to whom the pain is verbalized. This basic ability of translating directly-felt sensations and feelings into the semiotic system of language is already explicitly taught to children: “How do words refer to sensation? (...) A child has hurt himself and he cries; and then adults talk to him and teach him exclamations and, later, sentences [to express the feeling of pain: its degree, location, sensation]. They teach the child new pain-behaviour” (Wittgenstein 1953: §244; emphasis in the original).

The verbalization of pain socializes, and hence transforms, the intimately felt sensation of pain. “So you are saying the word ‘pain’ really means ‘crying’? – On the contrary: the verbal expression of pain replaces crying and does not describe it” (Wittgenstein 1953: § 244; emphasis added). Therefore, the sensation of pain is not completely private because one only knows what pain is in relation to the experiences of others. This example illuminates the transformative impact of the sociocultural instrument of language on the cognitive and psychological structures of the subject, including on his or her most intimate feelings. Once acquired as the central semiotic system, language structures and influences the mind, feelings, emotions, understandings, and constructions of the subject to a large degree (cf. Section 3.5).

Gergen (1999) further underlines this mechanism with an experiment that expands on Wittgenstein’s notion of translating the sensation of pain into linguistic expression of same. Gergen starts his argument by criticizing the essentializing observation of Elaine Scarry (1985: 4) that, “Physical pain does not simply resist language but actively destroys it, bringing about an immediate reversion to a state anterior to language, to the sounds and cries a human being makes before language is learned.” Scarry insinuates that pain is a condition which humans experience universally in the same manner, regardless of history and culture. However, in a diachronic perspective, one can observe that prior to the development of modern anesthetics, people were able to endure painful surgical procedures, and pain was, in medieval Christendom, often welcomed as an expression of religious devotion. In a synchronic perspective, there is, according to Gergen (1999: 104) evidence of “significant variations in the constitution of pain across different ethnic groups; Italian and Jewish patients, for example, often evidence far more pain than the more stoic New England Yankee culture.” This would suggest that pain is not just naturally experienced and linguistically represented, but it is also culturally constructed. Thus, the directly-felt sensation of pain requires interpretation which inevitably involves language and culture.

While psychologists such as Jean Piaget view language as a semiotic system that can be used to express knowledge acquired through interaction with the physical world, Vygotsky sees language as socially constituted and subjectively internalized speech. Vygotsky could show in his experiments that speech grows into children from their social environment: “The primary function of speech, in both children and adults, is communication, social contact. The earliest speech of the child is therefore essentially social” (Vygotsky 1986: 34-35). Hence, the infant is not externalizing internal thought, as suggested by Piaget (1970b: 59), but rather internalizing external concepts and patterns that he or she has experienced. This, however, is not done by simply duplicating the externally experienced constructs; rather, this process is transformative because the child adapts the external configurations to his or her particular needs and circumstances. “Thought is not merely expressed in words; it comes into existence through them” (Vygotsky 1986: 218). Only after accomplishing the internalization process of particular linguistic and conceptual items can the subject use these tools in a particular subjective way to externalize reflective thought. However, one cannot assume that the internal and external processes are distinct and unconnected; both are interrelated in a dialectical manner. “The bi-directional process of internalization and externalization, mediated through semiotic artifacts, both idealizes the objective and objectifies the ideal” (Lantolf and Thorne 2006: 155). Thus, according to a Vygotskian view, language and consciousness are not part of the inner self but are lodged within patterns of social activity. Consequently, it is the social activity system and underlying cultural patterns, rather than the individual, that should be the primary focus of linguistic and psychological studies, including L2 research and classroom practice. In this chapter, the sociocultural acquisition process of language and concepts will be analyzed with emphasis on the massive impact on the cognitive, behavioral, and emotional development of the subject.

2.1 First language acquisition

The processes of basic transformation of bodily-produced feelings into linguistically structured, and hence intra- and interpersonally comprehensible utterances, dominate the early socialization of the infant in the pre-linguistic stages. This includes the incidental acquisition of linguistically mediated concepts, categories, and schemata which exert an increasing influence on mental development. In these processes that contribute to the formation of the mind, the child is not just a passive recipient of language, but an active participant. Even before the infant has acquired any language, he or she has developed bodily sensations and emotions which will influence his or her behavior. Throughout the life-span of the individual psyche, mind and body influence one another so that the Cartesian body-mind dualism breaks down. Socialization is only made possible by some form of bodily, affective, and preconscious differentiation from others, for instance, parents and other significant individuals, such as older siblings, playmates, or relatives. “The very first of all cultural objects, and the one by which all the rest exist, is the body of the other person as the vehicle of a form of behavior” (Merleau-Ponty 1945: 348, cited in Gallagher 2008: 287). Zlatev (2008: 301; emphasis in the original) claims that through physical interaction with others, “the infant develops mimetic schemas, which constitute body-based, pre-linguistic, consciously accessible representations that serve as the child’s first concepts.”

This is an important claim for cognitive linguists who increasingly work with the concept of embodiment. Zlatev (2008: 301) goes on to suggest that “mimetic schemas possess a basic intersubjectivity which can serve as the foundation for developing a conventional normative semiotic system, i.e. language.” Hence, the use of language cannot be separated from the mind and subjectivity of the user who fills language with his or her subjective feelings, intentions, desires, memories, and fantasies. Kramsch (2009a: 18) comments: “One could say that becoming a subject means becoming aware of the gap between the words that people utter and the many meanings these words could have.” In intersubjective interaction, the gap has to be mentally filled by the subject on the spot which positions his or her self in a certain manner (cf. Section 4.5).

Both mimetic schemata and the child’s increasing ability to differentiate between self and other trigger a developmental cascade in several dimensions. The infant acquires motor skills (mastering of bodily movements, such as lifting the head, sitting up, conscious grasping, and, at a later stage, standing and walking), cognitive skills (formation of categories and concepts such as father, mother, people, teddy bear, etc., and using these to regulate thought and activities), linguistic skills (understanding communication directed at him or her, and producing his or her first own utterances), emotional competences (expressing and regulating his or her own feelings), and social competences (ability to make contact with others, to interact and cooperate with others) (cf. Winkel, Petermann, and Petermann 2006: 237). Most of this acquisition process is based on incidental and implicit perceptual learning, repeating, and copying the actions of others. One aspect of the acquisition of motor skills consists of the infant increasingly insisting on initiating his or her own activities, for example, when he or she begins to master the skill of walking. At this stage, at about nine months of age, the infant clearly has some awareness of the social dimension of his or her existence. He or she tries to communicate not only in non-linguistic ways, but also expresses interpersonal functions in simple utterances, such as pleasure or displeasure, greeting people, demanding attention, or getting people to do things. Usually, each utterance stands for a specific function; however, the infant’s protolanguage is characterized by the absence of grammar. The infant also begins to develop knowledge about the functional properties of objects which allows prediction about how the object will respond and behave in the future, as Madole and Oakes (2005: 279-280) explain: “If an infant notices that her rubber ducky has squeaked when it was squeezed the last five times she took a bath, then she can predict that her rubber ducky should squeak when squeezed in the future. However, before such predictions can be made, infants must be able to recognize the correlation between an object’s appearance and its function.” This example illustrates that children begin to develop knowledge of categories and plausibility structures through direct personal experiences. Children at this stage are also engaged in building complex blends between the concepts of which they are trying to make sense (cf. Section 4.7).

These pre-linguistic developments are followed by a great stream of word learning, of mastering both expressive speech and denotative labeling (cf. Messer 1994). The first stage of this complex developmental process of language acquisition has already started long before the baby utters his or her first word. It covers the period from birth, when the baby produces his or her first sound, the birth cry, to approximately twelve months. By the age of four days, babies can already differentiate their native language from another language on the basis of language-specific rhythm and prosody, even if they are not familiar with the speaker’s voice, or his/her accent (cf. Guasti 2002: 24). However, between six and twelve months of age, this ability to discriminate foreign sounds declines as infants have to focus on the sounds of the language they hear most. It has been shown that “infants listen longer to infant-directed speech sequences that reflect natural prosodic units” (Vihman 2009: 16) of the main language the child is exposed to.

The first weeks and months are characterized by implicit perceptual acquisition of the sounds of the L1. In the first eight weeks of life, infants do not produce a wide range of sounds; the principal sounds are cries. However, crying is used by infants as a means of communication, and parents usually develop astonishingly exact skills at determining different types of cry, for example, for hunger, pain, seeking attention, or other. This initial phase is followed at about the age of three months by cooing, i.e., the production of vowel sounds such as oooo which vary in tone and volume and which are normally associated with pleasure (cf. Catell 2007: 3-4). In these first two stages of vocalization, the infant produces random sounds which are generally believed to reflect sounds of any human language, not just the specific language of the child’s speech community.

At about six months of age, the cooing stage develops into a phase characterized by babbling, consisting of syllable sequences such as bababa. It has been suggested that the immaturity of the infant’s speech apparatus, which is not yet capable of producing speech-like sounds, may be responsible for the delay in speech production. However, deaf infants engage in “manual babbling” at the same stage of development. This observation may suggest that it is “the maturation of the neural substrate supporting language that is responsible for babbling” (Guasti 2002: 47). This leads Guasti to conclude that, “Humans are born with special sensitivity not to sounds, per se, but to the particular units, structures, and regularities found in natural languages, regardless of the modality of expression” (Guasti 2002: 47).

Babbling accounts for a large part of the infant’s early vocalization up to the stage when infants produce their first words, at approximately the age of twelve months. The babbling stage can be differentiated into the first stage of repeated sequences of babbling (known as canonical babbling) and the later stage of producing more varied sequences of sound (known as variegated babbling) (cf. Foster-Cohen 1999: 22). Babbling seems to be a universal feature of infants at this stage of development because: “The onset of babbling and the types of sounds produced seem to be universal and are similar regardless of culture or of whether the child can hear or not. This suggests that babbling is the result of maturational processes rather than learning” (Lund 2003: 41).

In the activity of babbling, infants subconsciously test their articulatory capacities; they also discover and practice the sounds and sound combinations of the language they are most exposed to, eventually leading to the production of phonemes and words. Children need adequate scaffolding by parents or other adults in terms of developing their utterances (if they can be called that) in the direction of appropriate sounds, morphemes, or even protowords of their first language. There is, however, much difference in the provision and quality of scaffolding by adults, largely influenced by their sociocultural background, as, for example, Heath (1983; 2009) has analyzed. Whereas American middle class parents spend a lot of time providing structured phonetic scaffolding for their young children, African-American working class parents seem to have a different approach: As children make cooing or babbling sounds, adults refer to them as “noise,” and no attempt is made to interpret these sounds as words or communicative attempts on the part of the baby. Adults believe they should not have to depend on their babies to tell them what they need or when they are uncomfortable; adults know, children only “come to know.” (Heath 2009: 354). This is a very crude and generalizing statement with regard to rather monolithically construed social (and racial) differences in the background of parents of young children. There certainly have to be exceptions in both directions in socio-pedagogical studies like Heath’s. However, if such differences do exist as a consequence of these different attitudes and behaviors with regard to the first cooing and babbling sounds of children, the onset of word formation and speech can vary. In fact, there is evidence that infants understand many words before they are actually able to produce any. Before the end of their first year of life, most babies understand a few frequently repeated words; for example, they wave their hands when someone says “bye-bye” (cf. Bee 2000: 231). Thus, infants’ lingualization, i.e., the process of acquiring the linguistic system in the wider sense, including the socially appropriate use of language, can be said to begin with the babbling stage when the infant produces sounds that begin to approximate the sounds of his or her native language.

When infants are about nine months old, they extend their range of communicative tools from producing sounds to using gestures. An infant who points at something and makes grasping movements of the hand is clearly “asking” for the object in order to elicit a certain response from others. Adults around the child will interpret the grasping gesture as an act of pointing, indicating the child’s desire to have that object. The child then internalizes the significance accorded to the gesture by adults and now performs the grasping gesture for them more consciously and intentionally. In doing so, the child demonstrates that he or she can consciously send out signals to others, and obviously anticipates a certain reaction. Although pointing does not constitute a sign system, it is a sign form that the child begins to master. It clearly shows that the child has begun to develop an “internal plane” of consciousness (cf. Wertsch 1985: 65). It also indicates that there is no direct reciprocal connection between thought and speech because there is “speech” without thought (infants in the crying, cooing, and babbling stages), and there is also thought without speech. But those two areas of thought and speech overlap in verbal thought, and this is coincident with language. Since we can share only what we articulate and communicate, it is this linguistic dimension alone that has historical validity.

The one-word stage for infants typically starts at about ten to twelve months of age. When learning to produce words, children have to perform four separate tasks, namely segmenting the speech stream into units that represent recognizable words, learning the sound-image of words, learning the syntactic properties of words, and associating meaning with forms of words. The early words produced by infants are often not like the adult versions of the word but are approximations of them. The rate of learning words is initially very slow (3-4 months for ten words) but at some point between 18 and 21 months a “vocabulary explosion” (Winkel, Petermann, and Petermann 2006: 237) or “language explosion” (Siegler 1998: 152) occurs. Normally, a two-year-old child adds an average of ten words per day to the 500-600 words of vocabulary he or she has already acquired. An average six-year-old has acquired a vocabulary of approximately 14,000 words to which he or she adds another 3,000 words annually from school-going age (cf. Kuczaj II 1999: 133). This amounts to knowledge of a huge range of vocabulary when compared to the much slower rate of learning vocabulary in the L2. At this rate of L1 learning, children frequently make a quick guess about a word’s denotation on the basis of experience and of knowledge of known words. This phenomenon has come to be called fast mapping. If, for instance, “a child is first exposed to the word beige in the context of the instruction ‘bring me the beige one, not the blue one’, the child might conclude that beige is a colour term and that it is a colour other than blue (assuming the child knows the denotation of the word blue)” (Kuczaj II 1999: 141; emphasis in the original).

The child therefore increasingly gains additional experience with a word and its conventional uses by comparing new words to the meaning of familiar words, thus constantly refining meanings. The child’s increasing awareness that words can denote things, people, and actions fosters more proactive word-learning with a greater degree of intent and effort. In this process, the knowledge of morphosyntax, lexicon, and the stock of experience also expands and refines rapidly, leading to higher cognitive abilities in the child, in that the representational nature of thought is influenced by the manner in which language organizes the construction of self, Other, and world.

The exact mechanism of how children acquire a lexicon is still being researched. When segmenting the flow of speech into words, the child first has to identify useful chunks of language, such as whole utterances, beginnings and ends of utterances, rhythmic stretches, or certain intonational packages (cf. Peters 2009: 44-45), and then associate meaning with the corresponding lexical items. One explanation for how children acquire the meaning of words and phrases focuses on the theory that infants learn words by forming hypotheses about their meanings and then testing them. When children notice a co-occurrence between a word and the object it refers to, or between a word and an object being pointed at, they will form hypotheses about what could be meant by that word. Subsequently, they will test them in new contexts and either verify or falsify their hypotheses (cf. Bruner 1975; Bruner, Olver, and Greenfield 1966). This approach, however, cannot explain the child’s construal of meaning for abstract or mass nouns referring to non-perceivable constructs. It also does not explain a situation where children can form multiple hypotheses about the meaning of a word they hear in a given context. For example, the word can refer to an object as a whole (e.g., a table), to its parts (e.g., its legs), or the material that it is made out of (e.g., wood, PVC, metal, etc.).

It seems to be a cross-culturally observable feature that children’s early productive vocabulary consists mainly of nouns. This can be explained by the “word-to-world” procedure whereby the word is mapped onto the object it refers to (cf. Guasti 2002: 81). Abstract lexical items such as verbs, adjectives, conjunctions, etc. are actively used only at a later stage since the concrete object-relationship is missing. Thus, children must rely on other evidence, and that is provided by the syntax of a sentence. As soon as children begin to combine words, their lexicon starts to grow massively (cf. Gilette et al. 1999: 139). It seems likely that the emergence of syntax accelerates the acquisition of the lexicon. Children start to use verbs meaningfully, based on the syntactic context and the meaning of nouns they already know. This suggests that children bootstrap the meaning of verbs onto syntax, that is, they can infer the category (and hence meaning) of a lexical item from its position in a sentence, provided they understand some of the nouns used in that sentence. For example, in the sentence Frank hits Oliver, it is obvious from the placement and syntactic marking of Frank and Oliver who the guilty party is.

The term “syntactic bootstrapping” was coined by Lila Gleitman (1990). In direct opposition to Bowerman (1974) and Pinker (1984), who suggest that children develop an understanding of syntax on the basis of semantic information, Gleitman (1990) suggests that children attend to the structural regularities of the language they are exposed to from a young age. Thus, they develop sensitivity to syntactic and semantic correspondences that are inherent in the linguistic structure. The key for the process of bootstrapping seems to be the initial word-to-world procedure. With this procedure, children will have acquired basic scaffolding around nouns on which they then can construct partial sentential representation by inferring the meaning of other words used in a sentence, such as verbs or adjectives, from the syntactic cues. This basis can be expanded by associating words and phrases with other words and phrases or morphemes that occur with them. For instance, in learning a language where nouns have different genders (for example, German, French, or Russian), the child must not only learn the correct articles (which infer gender, number, and case) but also the corresponding pronouns and adjectival endings for each of the different cases, both in singular and plural. This is possible because children have the general ability to develop associations between articles, words, and pronouns that frequently occur together, whereas second language learners, especially in adulthood, have to explicitly learn these complex and potentially confusing constellations. The differences in syntax between languages also implies that the mechanism of syntactic bootstrapping is always specific to a language.

In the process of acquiring a fair amount of lexical items, children start to expand their single word stage and start producing two-word phrases at about the age of 18-24 months. These two-word utterances are called “telegraphic speech” (Haslett 1989: 23) and usually consist of content words (verbs, nouns); function words (articles, conjunctions) are generally not yet produced. This two-word or multiple-word stage marks the starting point for the development of syntax. Once children start to combine words, they learn grammar very quickly and “pass through a number of phases that are characterised by increasing complexity of grammar and sentence length” (Lund 2003: 43). At about 30 months, children start to use more complex grammar and enter a phase of “grammar explosion” (Bee 2000: 237-239). There is evidence that children in this phase do not simply imitate adult speech but actually learn, by trial and error of forming assumptions, the rules of grammar so that they are able to independently produce increasingly complex sentences in a grammatically and syntactically correct manner, including the use of conjunctions (cf. Pinker 2003). In addition to acquiring the structure of the language in terms of grammar and syntax, children also “acquire entire systems of mappings, blends, and framing, along with their concomitant language manifestations” (Fauconnier 1997: 189; cf. Section 4.7). The success of these processes is even more astonishing when we consider that language directed to small children by adults is frequently deficient (e.g., baby talk, motherese, parentese).

However, during the process of acquiring the standard language, the child’s protolanguage is, at any given time, a legitimate system in its own right (cf. Catell 2007). The child’s linguistic development is not a linear process of producing fewer and fewer incorrect structures. Rather, the child’s language is systematic in that the child constantly forms hypotheses on the basis of the input he or she receives. These hypotheses are then actively tested in utterances; based on the feedback received, these hypotheses are continuously revised, reshaped, or abandoned.

In the famous “wug” experiment of 1958, conducted at a time before generative linguistics became dominant, Jean Berko Gleason (2004) showed that children learn language not as a series of isolated, discrete items but as an integrated system. In this experiment, children aged six or seven years were shown drawings of imaginary creatures with novel names or people performing mysterious activities. For example, they were told “This is a wug. Now there is another one. There are two of them. There are two ” (Berko Gleason 2004: 256), or “This is a dog with quirks on him. He is all covered with quirks. What kind of dog is he? He is a dog” (Berko Gleason 2004: 159), or “This is a man who knows how to bod. He is bodding. Yesterday he did the same thing. What did he do yesterday? Yesterday, he ” (Berko Gleason 2004: 159). The correct answers are “wugs,” “quirky,” and “bodded,” and by completing the sentences correctly, the children demonstrated that they knew the rules for the formation of plural, derived adjectives, and the simple past tense in English. In her experiments, Berko Gleason discovered that English-speaking children as young as four years of age correctly applied rules to the formation of plurals, present progressive, third person singular, and possessives. By being able to apply these patterns to expressions they had never before encountered, the children demonstrated that their language was not based solely on a list of memorized words, but that they were able to creatively apply rules to new contexts.

In the process of language acquisition, the child grows, so to speak, out of the relatively general (or universal) realm of protolanguage and into the relatively specific structure of his or her native tongue of which he or she becomes an increasingly competent user because of the ability to successfully project certain parameters. For example, the English language has a very strict positioning of subject, verb, and object in sentences. This is so to avoid ambiguities in the roles of agent (doer) and patient (sufferer) of actions expressed. In other languages this is not the case. For example, German has a more flexible word order because the cases of nouns and pronouns, including corresponding inflection of verbs, are more marked than in English. Therefore, the agent is always in the nominative case whereas the patient is expressed in the accusative (or dative/genitive) case. Thus, a rule of projection would be that the morphological endings of words can be neglected for English, whereas a child acquiring German would stress exactly the opposite features (cf. Edmondson 1999: 26—27). For example, the English clause “The student donates his books to the library” could be translated into German as Der Student schenkt seine Bücher der Bibliothek. However, in German one can put semantic emphasis by positioning an item at the beginning; if one wants to emphasize the receiver of the books it is possible to say Der Bibliothek schenkt der Student seine Bücher (‘*To the library the student donates his books’), or if one wants to emphasize the objects in question, one could say Seine Bücher schenkt der Student der Bibliothek (‘*His books the student donates to the library’). The flexibility in word order is facilitated here by the case marking in German, so that, unlike in English, the grammatical subject does not have to be positioned in the first place in the sentence but in fact any expression can occupy the first position, including prepositional phrases or subordinate clause, as long as the conjugated form of the verb remains in second position in the German main clause.

Growing into any language implies that the infant and child are not just regurgitating what he or she hears; children are highly creative in what they construct linguistically, as Pinker explains:

[C]hildren’s language errors such as braked and holded, which could not have been parroted from their parents’ speech, have served as a vivid reminder that the mind of the child is not a sponge, but actively assembles words and concepts into new combinations guided by rules and regularities. (...) The ingredients of language are words and rules. Words in the sense of memorized links between sound and meaning; rules in the sense of operations that assemble the words into combinations whose meaning can be computed from the meanings of the words and the way they are arranged. (Pinker 2003: xi, 300; emphasis in the original)

These rules are productive in the sense that they specify a string of kinds of words and not just a string of actual words which allows for the spontaneous production of new sentences and texts. The words contained by the rules are, of course, symbolic and hence abstract, which allows for talking about all kinds of things; events, feelings, thoughts, etc. Furthermore, according to Pinker (2003: 7), the rules are combinatorial, meaning that every position in the sentence offers a choice of possible words, thus facilitating an immense number of syntactically correct and semantically meaningful utterances. Pinker (2003: 7) calculates that these rules allow for 6.4 trillion possible five-word sentences in the English language.

By the age of five years, most children can give commands, ask questions, report events, and create stories about imaginary events, using correct syntax and grammatical markers. It is generally accepted that by this age children have mastered the basic structures of the language they have acquired (cf. Schumann 1997: 34). While continuing to learn vocabulary at the rate of approximately ten items a day, their acquisition of linguistic structures slows down. However, this is mainly because they have already internalized most of the basic structures of the L1 so that their learning now focuses on more complex linguistic structures, such as the passive voice, the subjunctive, and subordinate clauses which take more time to acquire than the simpler foundational morpho-syntactical and grammatical structures. This increasing complexity of grammar acquisition is complemented by the child’s ability to use language in a widening social environment. Using language in a greater variety of situations implies that children begin to develop sociolinguistic competence, i.e., they are learning how to use language appropriately in different contexts. Children at the age of five are typically able to defend their place in the playground in an aggressive voice, while being polite when talking to their grandparents.

In the developmental phase of the first four or five years of life, the child experiences his or her self to be at the center of gravity for all (inter-)action and understanding. This can be seen by the child’s ability to deal with increasingly complex conceptual-grammatical relations and deixis. The main types of text the child is exposed to and engages in are dialogues and narratives. Narratives are complex forms of text and are used by children even younger than three years of age as a means of problem-framing, which the child expresses by asking questions such as “How come?”. Narratives, such as fairy tales, serve as an instrument for invoking morals and ethics, but also as a tool for developing the skills of abstract reflection.25

Within the first five years of development, children learn an enormous number of words, and master the complex skills of using language grammatically, phonetically, and syntactically to a high degree of competence so that they can understand and produce a vast array of novel sentences. Parallel to this process of learning words and working out the basic grammatical and syntactical rules, young children also acquire knowledge about the socio-pragmatic appropriateness of using linguistic and communicative patterns (cf. Winkel, Petermann, and Petermann 2006: 237) so that they are by now fully interactive members of their speech community who are able to explain their opinion, talk back to others, verbalize experiences, memories and dreams, reason, and critique the actions and opinions of others. The term speech community, as introduced by Hymes (1974), foregrounds the social entity of languages users (Hymes 1974: 47), thus departing from the traditional approach of linguistics. In Hymes’ view, language has the function to create meanings in a particular context of use. This requires the subject to have acquired pragmatic skills in order to be able to recognize contextual cues which can sometimes overlay the meaning of what is actually said. Children at the age of five years have acquired these skills to a good degree, although they still have to refine them further in terms of understanding satire and irony.

2.2 The interplay of early linguistic and cognitive development

As we have seen, language gradually develops in tandem with the subjective mental capacities during early socialization into the primary means of communication and social activities by means of interaction with more knowledgeable others. Hence, language acquisition (or lingualization) is an integral part of the process of socialization. Therefore, the acquisition of language is neither an innate mechanism nor a purely cognitive process but a social skill, driven by the subjective desire to communicate with others. It is incidentally acquired by interacting with others. Consequently, the analytical focus of the process of language acquisition should be neither on the syntax of language (as Chomsky would suggest), nor on the cognitive stages the child develops on the basis of biological mechanisms (as Piaget would suggest), but on social pragmatics.

Messer (2000: 140) observes that “children make little or no progress with the language they hear” when the main source of language is a technological apparatus such as television which does not allow for social interaction nor facilitate the active testing of hypotheses about language because of the absence of constructive feedback. This observation was verified by Jacqueline Sachs and her colleagues (1981), who studied the linguistic development of a boy they called Jim. Jim was the hearing child of deaf parents who did not interact with him in sign language, nor obviously in verbal communication. Hence, Jim was not exposed to any interpersonal interaction, although he frequently watched television. An assessment of his linguistic abilities at three years and nine months of age showed that he was well below age standards in all aspects of language. Although he tried to express ideas typical for his age, he did not have the appropriate lexical, grammatical, and syntactical means to do so. Once he began a sequence of sustained interactive learning sessions with an adult, his linguistic abilities began to improve, and after five months his deficient language structures had been replaced by structures more typical of his age. The fact that Jim rapidly acquired the rules of English grammar once he began interacting with an adult demonstrates the relevance of dialogical human interaction and sociocultural embeddedness for subjective language acquisition and development. This is highlighted by the fact that Jim failed to acquire language normally when he was exposed to impersonal media such as television.

Technical media do not allow for meaningful interaction, as they are one-dimensional and unresponsive, and are thus not suitable for language learning – good news for the profession of L2 teachers. In addition, the creative testing, verifying, and falsifying of hypotheses in actual interaction with its implicit and explicit feedback is lacking, denying the child active engagement with linguistic features. When children are exposed to explicit or implicit language teaching through technical media, no discernible effect on vocabulary or grammatical development can be observed (cf. Saxton 2009: 69-71).

In contrast to behaviorist, nativist, and cognitivist approaches to explaining the rapid and smooth process of language acquisition, the sociocultural and interactionist approaches emphasize contextual social factors, rather than internal biological mechanisms or simple stimulus-response patterns. If language acquisition can be explained by the gradual internalization of external factors, then these external factors must also assume a central role for theories of second language acquisition in many respects. This recognition gave rise to the communicative method in the 1980s. Language learning cannot be separated from the general developmental process of learning, which in turn is heavily influenced by social and cultural factors. This observation makes, of course, the whole enterprise of analyzing language acquisition much more complex, as relevant and broad extralinguistic concepts such as psychological, cultural, social, pragmatic, interactive, biological, cognitive, and affective factors have to be considered with regard to their influence on the dynamic and multi-layered language acquisition process.

2.2.1 Vygotskian accounts of early sociocultural development

One of the very early and most influential proponents of the sociocultural approach is the Russian psychologist Lev Vygotsky (1896-1934) who analyzed in detail the complex interplay of the processes of speaking and thinking. He studied the use of tools, both as a model for language usage and as a psychological process in itself. Vygotsky distinguishes between lower (or natural), biologically endowed mental functions, such as elementary perception, memory, attention, and volition, and the higher (or cultural), mental functions, which are specific to the human race. The higher mental functions are a product of mediated activity in which psychological tools, such as gestures, language, and other sign systems, and interpersonal communication are the prime mediators. Tools in this sense are significant for Vygotsky because he found that, when human subjects begin to use them, language and thought start to coalesce. Referring to the work of Slobin (1996), Lantolf (2005: 347) explains Vygotsky’s notion of language as a tool for processes of thought by drawing parallels between the actions that physical and symbolic tools exert on regulating our actual and mental activities. For example, a shovel is a tool that was invented to allow people to dig better and faster than using their bare hands; it compels them to make particular movements that are specific to this tool – and markedly different from movements imposed on them by other tools, e.g., hammering or sawing. Transferred to the semiotic level, the tool of language also imposes certain patterns on mental activity since inherent in each human language is a specific “subject orientation to the world of human experience, and this orientation affects the ways in which we think while we are speaking” (Slobin 1996: 91, cited in Lantolf 2005: 347; emphasis in the original). It is important to note that the activities triggered by the use of tools are being referred to here. The tool alone cannot infer its function; only through participation or observing the tool in action do we understand the function of the tool, be it a hammer, a saw, a shovel – or language.

Just as physical tools compel us to move in certain ways, categories inherent in language also direct our mental activities in certain ways, and not in others, as Vygotsky (1981) points out: “By being included in the process of behavior, the psychological tool [of language] alters the entire flow and structure of mental functions. It does this by determining the structure of a new instrumental act just as a technical tool alters the process of a natural adaptation by determining the form of labour operations” (Vygotsky 1981: 137). The psychological tool of language has a mediational function, for example, organizing rational thought and gaining voluntary control over the biologically endowed mental functions. However, the linguistically mediated higher functions of the mental apparatus, while biological in origin, gain a socio-historical dimension by using language as a tool. This is a result of language having been developed socio-historically and ontogenetically from participation in socioculturally-organized activities (such as daily rituals of having breakfast, lunch, and dinner, or birthday parties) and from experiences with culturally constructed artifacts (such as ordinary objects like cars, water taps, or ovens). Although language also provides us with metalinguistic categories, allowing us to critically reflect upon linguistic relativity, and thus modify the dependence on linguistically mediated mental activities, this only takes place in exceptional circumstances of reflective practice. Just like the material tool of a hammer or a saw, language cannot be understood simply by analyzing its structure. Rather, it is its use that uncovers its function. Lantolf and Thorne (2006: 69) point out: “For the individual, mediational means [of language] are at first mere objects. However, through repeated use and – particularly in the case of children – under the regulatory agency of others in social practice, the objects are organized into conceptual categories, or types.” These conceptual categories, then, contribute to structuring voluntary thought and agency.

Although the comparison of language as a metaphorical tool with concrete tools is useful to highlight the impact language has on our thinking and behaving, there are, however, some important differences between the impact of concrete tools on our actions, and of language as a tool with structuring effects on our thinking. Language as a social and semiotic tool is much more powerful than any physical tool, such as a hammer or a saw, in that it is capable of being reflexive. It constructs “reality,” structures and organizes communicable human experience, and defies time. Unlike material tools, language as a social product also requires the participation of conscious others and it is characterized by an element of ambiguity, as there cannot be a simple one-to-one transmission of thoughts, ideas, or experiences from one person to another; the other person will always add or take away some of the meaning that I intended to convey to him or her.

Vygotsky suggests a unity of learning and development. He suggests that social interaction is foundational in cognitive development and hence rejects the notion of biologically predetermined stages of mental development, as analyzed by Piaget (cf. Inhelder and Piaget 1958; Inhelder and Piaget 1964). Vygotsky places great emphasis on the early acquisition of language by children, as language is the tool of the most effective transformation of the subjective mental functions by a social instrument. Thus, Vygotsky focuses on process rather than on product, in that he tries to reveal the dynamic relations at work in the development of higher mental functions which are mediated by culturally constructed semiotic artifacts, primarily language and sociocultural practices. Language is by no means the only sign system informing this process of transformation from the social to the subjective plane of the child (cf. Section 2.1 for pre-linguistic sounds and pointing) but it is the most consequential, since the two areas of speaking and thinking overlap in verbal thought, and this is coincident with language. Since the first language has such a normative impact on ways of thinking, it must serve as cornerstone for theories and practices of second language acquisition.

2.2.2 Egocentric speech

The fundamental transformative impact of language on subjective thought was analyzed in detail by Vygotsky. He suggests that initially language and thought exist independent of one another and that they have separate origins. During the pre-linguistic phase of the child’s development, thoughts are primarily based on images, sounds, colors, and smell, whereas linguistic noises produced by the child are pre-intellectual, and thus not linked to influencing thought. Vygotsky suggests a principal distinction between lower mental functions, such as elementary perception, memory, attention, and will, and higher cultural functions, such as language use, abstract thought, and conscious interpersonal relations, which are specifically human. The development of higher functions has a structuring and organizing influence on the lower functions, which do not disappear but rather are superseded by the cultural functions. Only at approximately the age of two years do children start to use language as a tool for thinking, and their speech increasingly relates to their thoughts. Hence, language and thought become progressively interdependent, and develop together under reciprocal influence: “The relation between thought and word is a living process; thought is born through words. A word devoid of thought is a dead thing. (...) But a thought that fails to realize itself in words also remains a ‘Stygian shadow’” (Vygotsky 1986: 255).

Vague, unmediated infant thought precedes the acquisition of the first words of a language. The transition from unconscious thought to the first words is characterized by the formation of “complexes” in the mind of the child: “The principal function of complexes is to establish bonds and relations. Complex thinking begins the unification of scattered impressions; by organizing discrete elements of experience into groups, it creates a basis for later generalizations” (Vygotsky 1986: 135).26 The development of complexes leads to the understanding and production of first words, and these are the words of others. Gradually these words, but also the grammar and syntax, as used by significant others, begin to exert a structuring and generating influence on thought, in that language increasingly becomes a tool for thinking to an extent that it allows for abstract thought, based solely on concepts mediated by language.27 For instance, we can think about abstract concepts such as love or intelligence in very precise terms, even differentiating many apparently distinct forms of love or intelligence.

For Vygotsky, this complex relationship between language and thought is the focus of his investigations. His basic hypothesis is this: “If perceptive consciousness and intellectual consciousness reflect reality differently, then we have two different forms of consciousness. Thought and speech turn out to be the key to the nature of human consciousness” (Vygotsky 1986: 256; emphasis in the original). Thought and language reflect “reality” in a way that differs from perception; one could also say that thought and language construct a subjective “reality” for each individual which he or she takes for objective reality, which in turn is checked for validity in a myriad of intersubjective communicative acts on both subjective and collective levels within a speech community. In this process, the subject’s constructs of “reality” for a given situation are influenced by internalized concepts, and thus his or her construals of “reality” are not completely subjective, but the social tool of language ensures that they are basically compatible and accessible to members of the same speech community. This means that language plays a central part, not only in the subjective development of thought, but also in the collective growth of human consciousness as a whole: “If language is as old as consciousness itself, and if language is a practical consciousness-for-others and, consequently, consciousness-for-myself, then not only one particular thought but all consciousness is connected with the development of the word” (Vygotsky 1986: 256).

The process of increasing interdependence of language and thought marks the beginning of higher stages of development for the child; these become evident in the development of egocentric speech at about the age of two or three years. In contrast to Piaget (1959: 43, cited in Vygotsky 1986: 16) who suggests that egocentric speech is the earliest form of speech, originating from deep within the psychic structure of the child, Vygotsky proposes that “egocentric speech is a phenomenon of the transition from interpsychic to intrapsychic functioning, i.e., from the social, collective activity of the child to his more individualized activity” (Vygotsky 1986: 228). According to this view, the occurrence of egocentric speech marks the transition from external to inner speech, or verbal thought. The psychological function of speech does not arise suddenly from social speech, but passes through an egocentric phase characterized by the social appearance of this speech, though its function is increasingly psychological. Thus, the formation of egocentric speech is not the result of autonomous innate cognitive mechanisms, as Piaget hypothesized, but it is initiated in the direct social environment of the child before being internalized and used for intrapersonal communication and self-guidance. The process of internalization or appropriation of language for the self initiates a fundamental transformation of the child’s cognitive processes in terms of fusing pre-intellectual language and pre-linguistic cognition which gives rise to verbally mediated thought.

However, Vygotsky’s concepts of egocentric speech, inner speech, and internalization has been criticized for promoting a segregationist agenda in linguistics in which “language is conceived and studied as a self-contained system whose constituent units have an autonomous existence above and beyond the actual concrete practices of communication in real life and, therefore, of the actual individuals who are responsible for these practices” (Jones 2009: 168). Critiques like these have a point in making us aware that Vygotsky, who was not a linguist, could have defined his vision of language and linguistics in much more detail. However, they tend to overlook that fact that Vygotsky’s main interest was psychological. Furthermore, the allegation that Vygotsky’s view of linguistics is segregationist tends to overlook his emphatic observation that language, as experienced by the child, comes from other peoples’ mouths, with all its imperfections, in particular during the early stages (e.g., motherese or parentese; cf. Section 2.1). This view of linguistics is not segregationist in the sense of a strict opposition of the language as a system and language as use; Vygotsky perceived language as a tool for communication and thought, not as an abstract system.

In a similar manner, Vygotsky’s concept of internalization has been criticized as a “rigid transmission model of linguistic and cultural development” (Jones 2009: 168). This critique seems to operate with “rigid” oppositions between external and internal spheres of the human mind. Drawing on Stetsenko (1999), Lantolf (2005: 314) points out that internalization does not literally mean “within the individual” or “in the brain.” Rather, the term should be taken as a metaphor for subjective mental activity which is psychologically independent of the thoughts and understandings of others, although remaining, of course, socioculturally organized in a historical dimension. Lantolf and Thorne (2006) cite Valsiner (1997) to define the process of internalization as a complex phenomenon which is situated on the borderline of the subject and the collective: “Internalization is a negotiated process of development that is co-constructed through forward-oriented construction of signs that bring over from the extrapersonal (social) world of the person to the intrapersonal subjective world semiotically encoded experiences, which, as personal sense systems, guide the person’s process of further reorganization of person-environment relationships” (Valsiner 1997: 246, cited in Lantolf and Thorne 2006: 160). The process of internalization does not happen in an instant but it is a long process of negotiation and re-negotiation in which socioculturally shaped and linguistically mediated concepts and plausibility structures are tested for viability and, if deemed viable, appropriated and internalized. Hence, the process of internalization presumes the incorporation of the subject into the activities of communities of practice,28 beginning at the periphery of the activity, moving through a series of actual and possible roles, thus becoming a full participant of the respective community.

Therefore, language and cognition are not completely internal functions but are situated on the borderline between internal and external realities, enabling the subject to abstract mentally from the immediate environment. The function of egocentric speech can be seen as structuring, regulating, and representing the child’s early thoughts to his or her self. Its form is largely identical to the social language the child experiences and acquires. Thus, egocentric speech has its origins in the social speech of a language community in general. For the cognition of the child, it originates in directly experienced social speech of others and is acquired during the process of appropriating, and thus transforming, social forms of action to his or her subjective cognitive and emotional needs (which are also socially and culturally influenced). The child speaks to his or her self through commenting on ongoing activities or verbalizing thoughts and ideas; language has in this context the function of monitoring and, to a lesser extent, structuring the process of thinking. In doing so, children do actually use the social language they have internalized, and not a language of their own invention: “The child does not choose the meaning of his words. (...) The meaning of the words is given to him by his conversations with adults. (...) In a word, he does not create his own speech, but acquires the speech of others” (Vygotsky 1986: 122), which has a transformative influence on his mental functions.

However, children do not master social speech used by adults in a completely identical fashion because they have not yet acquired the same modes of thought and conceptualization. Children are neither aware of the rules and regulations inherent in the system of language nor of the concepts adults associate with certain words and expressions; hence, they operate with pseudoconcepts. These are similar to the concepts used by adults but they are just “‘shadow[s]’ of the concept, its contour” (Vygotsky 1986: 122) which resemble more the structure of complexes. In using these pseudoconcepts, the child operates with them in an instrumental fashion but is not aware of their meaning. For instance, when I caught my three-year old son taking sweets from the refrigerator without having asked for permission, I explained to him why this was wrong. He listened carefully, put the sweets back into the fridge, looked at me and replied by saying: “I rest my case.” Clearly, he had heard this phrase somewhere and vaguely knew that it could be used to end a part of conversation, but had not fully understood its meaning and could not use it in a manner appropriate to the situation.

In the development of children’s thinking from unconscious to linguistically mediated and conceptually structured thought, pseudoconcepts have an important function because they provide the bridge to introducing the child’s thought to the concepts of others. “The transition from thinking in complexes to thinking in concepts passes unnoticed by the child because his pseudoconcepts already coincide in content with adult concepts” (Vygotsky 1986: 123-24). This means that the child uses the adult concepts without understanding their meaning. While the child operates in thought and speech with meanings derived from the words and concepts of others, he or she tries to fill the words and (pseudo-)concepts with sense, and to appropriate concepts and internal speech for self-directed thought. Egocentric speech allows children to distance themselves from their immediate environment and stimuli and have their behavior and attention guided by their own verbal plans; this procedure can be frequently observed “in preschool and early elementary years as an overt step in the eventual formation of inner speech” (Winsler 2009: 4).

2.2.3 Inner speech

Since the elaborate use of external social language is too complex for spontaneous subjective mental functions and thus ineffective for the purpose of immediately regulating mental operations of the child, egocentric speech is gradually superseded, roughly at the age of seven (cf. Vygotsky 1986: 33), by a more effective form of inner linguistic representation, namely inner speech (although egocentric speech and thinking in complexes do not completely vanish; cf. Section 2.2.2). The child has now reached a stage in his or her mental development that allows him or her to increasingly move from reliance on others’ speech to reliance on his or her appropriated form of inner speech. In contrast to egocentric speech, inner speech is inaudible. It originates from linguistic exchanges with others and passes through an intermediate phase of self-directed speech before becoming fully internalized.

Inner speech, according to Vygotsky, is neither identical with subjective thought nor with social language. Rather, it consists of a very personal language with minimal, abbreviated syntax and phonetics which operates in an automated manner in the mind of the subject. It focuses wholly on sense which is decontextualized, highly abstract, and only meaningful to the subject:

The sense of a word (...) is the sum of all the psychological events aroused in our consciousness by the word. It is a dynamic, fluid, complex whole, which has several zones of unequal stability. Meaning is only one of the zones of sense, the more stable and precise zone. A word acquires its sense from the context in which it appears; in different contexts, it changes its sense. (Vygotsky 1986: 244-45)

In making meaning for the individual, “the senses of different words flow into one another – literally ‘influence’ one another – so that the earlier ones are contained in, and modify, the later ones” (Vygotsky 1986: 246-47). Hence, the senses of words in inner speech – mainly verbs (Vygotsky 1986: 248) – are fused with one another (Vygotsky [1986: 246] calls this process “agglutination”), with the principal function of providing a point of reference for the subject, evoking other associations, other senses, that can quickly be identified by the speaker and provide a guide for his or her actual verbal utterances. The meanings of words become interconnected, fused senses of chunks of words and expressions that are comprehensible only to the subject. “A single word is so saturated with sense that (...) it becomes a concentrate of sense. To unfold it into overt speech, one would need a multitude of words” (Vygotsky 1986: 247). The sense generated by inner speech thus underlies in concentrated form the actual verbal utterances which tend to be much more elaborate and adherent to the systemic use of social language.

Inner speech cannot be translated fully into external speech, since aspects of sense remain unique to the subject due to his or her former experiences, social environment, motivations, memories, objectives of speaking, and degree of access to elaborated linguistic codes. “Inner sense turns out to be incommensurable with the external meaning of the same word” (Vygotsky 1986: 248). In contrast to sense, meaning is, according to Vygotsky, a much narrower concept: “Meaning remains stable throughout the changes of sense. The dictionary meaning of a word is no more than a stone in the edifice of sense” (Vygotsky 1986: 245). Inner speech, then, is characterized by the predominance of subjectively induced sense over socially defined meaning. This implies that inner (subjective) and outer (socially comprehensible) speech is not identical; they are different and they develop differently. Whereas inner speech is highly abstract, minimalist, and subvocally verbalized in a very subjective dimension, outer speech must adhere to the rules and regulations of the language, that is, its grammar, syntax, lexis, phonology, semantics, pragmatics, and morphology. But external language provides the basis for inner speech, too, as it must have been internalized before it can be used for that inner function. Vygotsky suggests that the linguistic structures acquired by the child through the internalization of social language into inner speech subsequently become “the basic structures of thinking” (Vygotsky 1986: 94).

Vygotsky’s concept of inner speech has been criticized from an integrationist approach to linguistics (cf. Harris 1996). If inner speech is an abbreviated form of social speech, it can be described as deficient, compared to the intersubjective use of speech. However, what is efficient use of language? Does it always operate with complete and linguistically correct units? Certainly not, as everyday speech is frequently deficient, yet highly efficient in its communicative purposes, and it is often supported by extra- and paralinguistic forms of communication; every text or utterance, no matter how abbreviated, makes sense, even if it is nonsensical. Therefore, the notion of abbreviated inner speech may be misconceived, as it tacitly presumes the necessity of using linguistically complete expressions, phrases, sentences, and texts (or utterances). Opponents of Vygotsky’s approach even state “that there is no such thing as ‘social speech’ or ‘private speech’ in the Vygotskian sense” (Jones 2009: 175). However, critiques like these tend to overlook the psychological context in which Vygotsky has developed these concepts, that is, the correlation between the development of language and mental faculties of thought. Over the past thirty years, a huge amount of research in developmental psychology has confirmed the existence of egocentric, private, and inner speech (for a comprehensive overview, cf. Winsler, Fernyhough, and Montero 2009).

As analyzed in Chapter 2, the child is by no means a passive pawn in the adult’s game, but has agency from the moment of birth (and even before). The child makes moves to regulate others’ behavior even before he or she has developed egocentric speech (e.g., making grasping movements with the hand while looking at an object; cf. Section 2.1). These activities are clearly conscious and creative but not necessarily reflective. In order to gain voluntary control over otherwise chaotic and messy thought processes, the child has to operate with semiotic tools, such as images, colors, smells, and language. Language is for him or her clearly not the elaborated linguistic system which the ideal speaker in Chomsky’s sense would effortlessly use; it is a form of language appropriated by him or her from what he or she hears from others, i.e. egocentric speech and inner speech. Language, as experienced by the child, contains a lot of deficient inputs which parents deliberately use to adapt to the children’s presumed deficient linguistic competences (e.g., parentese). The child does not internalize these utterances in an identical manner but crafts them anew and adopts them into his or her ongoing processes of thinking and (inter-)acting. From the age of 2 years, children actively test their linguistic constructs and creations and receive feedback from more knowledgeable others, thus continuing to construct the forms and rules of their L1.

Lantolf and Thorne (2006: 75) suggest that there is a third form of speech, namely “private speech” which “refers to that form of externalized speech deployed by adults to regulate their own mental (and possibly physical) activity.” Whereas inner speech is characterized by the predominance of pure sense and the absence of formal properties, private speech may be subvocal, or whispered; it is clearly based on linguistic structure and can therefore easily become externalized. The notion of private speech, which comes close to the notion of inner speech encoded in linguistic form, is observable mainly in adults rather than in children, and its main purpose is to maintain the individual’s focus on a particular reflective (or other) activity. This form of private speech seems to be closely related to egocentric speech (cf. Section 2.2.2) of children with the difference that adults use a more elaborate form of language for that purpose. Research undertaken in this field seems to suggest that private speech does not disappear when inner speech develops but it is regularly used by adults to regulate their own behavior (cf. Medina et al. 2009).

Private speech also plays an interesting role in self-regulation of bilingual or multilingual children and in early L2 acquisition. Studies carried out in this field of research seem to be consistent in showing that there is very little, if any, code-switching between L1 and L2 in bilingual children (Diaz et al. 1991; Pease-Alvarez and Winsler 1994; Glaessner 1995; Dolitsky 2000). It would appear that bilingual children tend to pick one language for thought processes and private speech, and then basically stick with this choice (cf. Winsler 2009: 22), although this may vary according to the linguistic and cultural context in which activities are performed.29 The second language is typically learned with intense scaffolding from the L1, not only by monolingual learners, but also by the majority of bilingual learners (cf. Diaz et al. 1991; Ushakowa 1994). However, when certain tasks require a varied and rich use of the L2 (e.g. narrating picture stories, solving problems, or recalling stories), considerable self-regulatory L2 private speech was observed (in addition to L1 private speech) among early L2 learners (cf. Appel and Lantolf 1994; Ahmed 1994; Centeno-Cortés and Jiménez-Jiménez 2004; Lantolf and Thorne 2006). Centeno-Cortés and Jiménez-Jiménez 2004 conducted a study comparing the use of overt private speech by intermediate-level learners of Spanish to both advanced L2 Spanish learners and native L1 speakers of Spanish while they were engaged in trying to solve logic and math word problems. They found out that, while L1 speakers unsurprisingly used almost exclusively the L1 for their private speech, 35% of intermediate learners used the L2 (Spanish) for engaging with the task at hand, followed by 52% of advanced L2 learners. This suggests that the duration and intensity of L2 learning has an impact on the use of the L2 as the medium of private speech (in the sense of self-directed talk). The longer one learns the L2, and the more proficient one becomes in using the L2 appropriately in its communicative and cultural context, the more likely it seems that the L2 is used for mediating one’s continued learning of the language.30 These observations might be transferred to the role of the L2 in inner speech, although there has not yet been any conclusive research carried out in this area.

In the process of acquiring an ever-increasing amount of words and rules of the L1, the child progressively develops in parallel mental functions. Having already started to develop in the pre-linguistic developmental stage from unorganized heaps to complexes, and then to concepts which Vygotsky observed on the basis of block-sorting and classification tasks with young children (cf. Vygotsky 1986: 110-124), conscious use of language as a tool for mental operations brings the ability to think to a completely different level. This observation refers mainly to the conceptual level of language, as Wertsch exemplifies:

With the development of scientific concepts, a child not only can use words such as “table”, “chair” and “furniture” appropriately in connection with the objects to which they refer, but the child can also operate on statements of logical equivalence, nonequivalence, entailment, and the like, such as “All tables are furniture.” Hence the emphasis has shifted away from those aspects of linguistic organization that involve contextualization to the capacity of linguistic signs to enter into decontextualized relationships, that is, relationships which are constant across contexts of use. (Wertsch 1985: 103)

Vygotsky considers the ability to use words as semiotic symbols for phenomena that are not present in the immediate interaction a crucial condition for the development of higher stages of mental development:

[One has to] view concept formation as a function in the adolescent’s total social and cultural growth, which affects not only the contents but also the methods of his thinking. The new significance of the word, its use as a means of concept formation is the immediate psychological cause of the radical change in the intellectual process that occurs on the threshold of adolescence. (Vygotsky 1986: 108; emphasis in the original)

Inner speech cannot take place without previous internalization of socioculturally produced language and concepts. At the same time, it contributes to the further development of concepts and plausibility structures on both subjective and collective planes. Inner speech, therefore, has a special status in the workings of the human mind. It regulates and guides unconscious processes of construction, and provides the space for conscious reflection. In both instances, private speech has to be taken into account for second language teaching and learning. If the learning process in its advanced stages is to be successful, the learner will develop a form of private and inner speech that integrates elements of the second language and its plausibility structures with the hitherto dominant L1 constructs. Therefore, in parallel to the development of the intercultural third place (cf. Sections 8.2 & 9.5), it can be assumed that the learner will also develop an interlingual form of private speech, based on the subjective interplay, or blending, of L1 and L2 linguistic, conceptual, and cultural elements. This, however, does not imply that the learner develops a completely new form of language; he or she rather is able to switch between languages according to context and, to some extent, blends and fuses underlying concepts and patterns.

2.3 Development of everyday concepts and scientific concepts

The decontextualization of meaning from the referent is a precondition for the subjective development of what Vygotsky calls scientific concepts, mainly mediated through language in formalized contexts, for instance, highly structured and specialized classroom activities in school. Vygotsky differentiates between scientific and everyday (or spontaneous) concepts. Whereas the former are usually learned in a formal school setting, the latter emerge on the basis of children’s experience of (inter-)acting in everyday real life. However, if the term concept relates to abstracted and subsequently synthesized features, scientific concepts must precede the formation of everyday concepts. This apparent contradiction in the sequence of the development of concepts can be explained by Vygotsky’s notion of pseudoconcepts: as long as the child has not yet reached the ability to think in a systematic, organized, conscious, and hierarchical manner, proper concepts cannot be formed, even though the child clearly and consciously experiences his or her everyday environment: “The child becomes conscious of his spontaneous concepts relatively late; the ability to define them in words, to operate with them at will, appears long after he has acquired the concepts. (...) The development of a scientific concept, on the other hand, usually begins with its verbal definition and its use in nonspontaneous operations” (Vygotsky 1986: 192; emphasis in the original).

The child tries to make sense of his or her environment on the basis of acquired linguistic and sociocultural knowledge. However, this kind of sense is construed with pre-abstract categories, or pseudoconcepts, and is not, therefore, compatible with sense derived from proper concepts. Hence, everyday concepts are closely linked to particular contexts, and lack an overall system. Once everyday concepts have been developed by the child, they enrich existing scientific concepts which, in turn, have a structuring influence on everyday concepts:

Scientific concepts are formed on the basis of systematic, organized and hierarchical thinking as distinct from everyday concepts which were seen to be tightly linked to particular contexts and lacking in an overall system. The latter are seen to bring the embedded richness and detailed patterns of signification of everyday thinking into the system and organized structure of scientific concepts. (Daniels 1996: 11)

Vygotsky argues that one of the fundamental advantages of scientific concepts (as opposed to everyday concepts) is that they form part of a system of interconnected concepts that allows the subject to link up with existing knowledge and to draw conclusions based on this system of decontextualized links. Consequently, Vygotsky (1986: 193; emphasis in the original) observes that “the development of the child’s spontaneous concepts proceeds upward, and the development of his scientific concepts downward, to a more elementary and concrete level.” Whereas spontaneous everyday concepts have the tendency to work their way toward a greater abstractedness, scientific concepts develop toward greater consciousness. Valsiner and van der Veer (2000: 376) illustrate the mutual enrichment of scientific (or academic) and everyday concepts with the example of the concept farmer, which may evoke romantic meanings in the child but more capitalist connotations in the adult:

The everyday concept of farmer is enriched because the child now learns the nonapparent fact that farmers form part of an economic market in which they try to realize certain goals. The academic concept of farmer is enriched because the abstract notion of entrepreneurs dealing with cattle and crop is filled with concrete facts of daily farmer life; it gets body and flesh. Thus, academic concepts presuppose everyday concepts, build upon them, but once acquired they alter the everyday concepts in fundamental ways. (Valsiner and van der Veer 2000: 376)

Scientific concepts are the gateway through which consciousness enters the realm of childish concepts, or pseudoconcepts. In other words, scientific concepts are a necessary precondition for the transformation of pseudoconcepts into proper everyday concepts which structure adult thought.

2.4 Learning the written form of language

If concepts are based on linguistic categories, then concepts must be systematic, communicable, and sharable. The concepts internalized by individuals within the same speech community have to be essentially the same, or at least compatible, in order to facilitate meaningful communication; individuals can also pool their conceptual resources so as to construct and co-construct meanings in order to solve certain problems. This is sometimes done explicitly and intentionally, but for the most part subjects are unaware of the concepts they are using in interaction; thus, conceptual knowledge is usually tacit knowledge, distributed to all members of a speech community. For example, in perceiving a certain situation, we attribute meaning to this situation as a result of the conceptual patterns of knowledge and experiences that we have acquired and internalized: “Perceptual recognition is subject to ‘semantic priming’ whereby previous exposure to a stimulus related in meaning enhances perceptual performance on various tasks” (Harré and Gillet 1994: 169). On that basis, knowledge of self, Other, and world can be co-constructed between individuals, and successful strategies for coping with situations can be developed (cf. Harré and Gillet 1994: 43). This is the core of the symbolic order that constitutes culture (cf. Chapter 7).

Vygotsky’s associate Luria carried out field research of literate and illiterate subjects in Uzbekistan in Central Asia during the 1930s and 1940s with the objective of examining the impact of scientific concepts on the ability to understand selfhood and worldhood in decontextualized structures of knowledge. For example, subjects were given pictures of a hammer, a saw, a log, and a hatchet and asked to group them together. Whereas schooled, literate Uzbeks tended to do so on the basis of abstract word meanings (hammer, saw, and hatchet are hyponyms of tools), non-schooled peasants did not make these connections at all; rather, they based their reasoning on experiential and situational thinking, for instance by stating that, “They [hammer, saw, log and hatchet] are all alike. The saw will saw the log and the hatchet will chop it into small pieces. If one of these things has to go, I’d throw out the hatchet. It doesn’t do as good a job as a saw” (Luria 1976: 60-61).

Michael Cole and Sylvia Scribner (1974) conducted similar experiments with illiterate members of the ethnic group of Kpelle in Liberia (West Africa). Kpelle adults were given simple syllogisms, such as the following:

Experimenter: At one time spider went to a feast. He was told to answer this question before he could eat any of the food. The question is: Spider and black deer always eat together. Spider is eating. Is black deer eating?

Subject: Were they in the bush?

Experimenter: Yes.

Subject: Were they eating together?

Experimenter: Spider and black deer always eat together. Spider is eating. Is black deer eating?

Subject: But I was not there. How can I answer such a question?’

(Cole and Scribner 1974: 162; emphasis in the original)

Just like the illiterate subjects in Uzbekistan, the Kpelle could not solve simple syllogisms by forming analogues since this requires a level of abstraction they obviously had not acquired; instead, they referred to conventional situations they had themselves experienced. The illiterate subjects invoked practical, non-linguistically represented experiences in their reasoning when confronted with decontextualized problems. They were not in a position to grasp the gist and meaning of the decontextualized narrative, and the researchers were surprised by the way the illiterate Kpelle subjects invoked empirical and situational frames of contextualization. By contrast, literate Kpelle demonstrated an ability to effortlessly operate with abstract, linguistically invoked objects and a linguistically created reality (cf. Cole and Scribner 1974: 164). Thus, “The arbitrariness of the hypothetical problem defeats them, whilst Kpelle boys even with limited schooling are able to cope with it” (Jahoda 1976: 183).

However, Scribner and Cole (1981) could show with their experiments among the Kpelle and Vai people of Liberia that the differences between schooling and literacy are not the same as Luria (and Vygotsky) had assumed. Schooling has a much bigger impact than literacy on the successful execution of the task because schooling also influences the ways, manners, and methodologies applied for solving the syllogisms. Hence, the ability to read and write is in itself not as decisive as the manner in which it is used. Schooled children were able to provide verbal explanations to solve the syllogisms, whereas the non-schooled interviewees relied on empirical reasoning, frequently adding further aspects which the original problem did not contain. But the biggest push in the direction of acquiring the potential for abstract thought is also not inherent in schooling itself, as Scribner and Cole (1981) concluded; rather, it is mediated by the written form of language. In this form, the instrument of language as such is objectified, and thus potentially deinstrumentalized. These anthropological field experiments graphically illustrate that empirically-structured illiterate thinking does not know analytical thought; it does not recognize any truth outside its own terms (cf. Demele 1988: 116). By contrast, schooled Kpelle literates were able to use decontextualized categories and assertions quite effortlessly.

The empirical field studies carried out by Luria in Uzbekistan and by Scribner and Cole in Liberia have their flaws. For example, they do not stipulate whether and for how long subjects kept in touch with school-related occupational activities requiring abstract thought, nor do they mention whether the subjects continued to sustain their improved task performance in everyday life. However, both studies seem to support Vygotsky’s general argument that transformation in material circumstances has an effect on mental function.

When children begin to learn to write in school, they have already acquired a vast amount of orally represented lexical and grammatical knowledge of their L1, typically by means of incidental learning. However, they are not usually aware of this knowledge and normally have no reason to make it explicit: “The child does have a command of the grammar of his native tongue long before he enters school, but it is unconscious, acquired in a purely structural way, like the phonetic composition of words” (Vygotsky 1986 :184). Therefore, written words are only objects to small children. Although children may feel that they represent something, they do not know how this mechanism of representation works. In an experiment conducted by Bruce Homer in Canada in the late 1990s (cf. Olson 2002: 158), it transpired that, when preliterate four-year olds were asked to pretend to write the word cat, they produced just one scribbled mark insisting it said cat. The children obviously tried to reproduce what they had seen their parents or other more knowledgeable others do when they were writing.31 When asked to write two cats, one child produced two scribbled marks on the paper, and three scribbles for three cats. However, when asked to write no cats the child waved the pencil in the air, saying: “There’s no cats so I didn’t write anything” (Olson 2002: 159). Obviously, the child hypothesizes that a word is something to be read and that it stands for something tangible, so abstract ideas, such as no cats, cannot be written down in words. Homer and Olson conclude that “the child knows something about writing - hence the scribbles – but does not know that writing maps on to utterances, not on to events or objects. (... ) They do not realize that writing represents not events, but language about events” (Olson 2002: 159-160). Preliterate children do not know about lexical or phonological properties of language, therefore they lack the capacity to analyze the constituents of written words which is a precondition for developing the ability to read and write.

In learning to write and in explicitly learning the grammar of the mother tongue, the child reaches a stage in the development of his or her language that is characterized by the conscious use of language. This is a major step in language acquisition, as the child is no longer limited to the unconscious and unreflected usage of language (tacit linguistic knowledge), but can now deliberately operate with language, and consequently operate consciously with his or her own cognitive abilities (cf. Vygotsky 1986:183-184). Whereas language usage in the ordinary world is characterized by spontaneity and context-specific restrictions, schooling puts emphasis on theoretical knowledge, and decontextualized and taxonomic thinking. In learning how to read and in particular how to write words, phrases, and texts, children have to learn to reconstitute their meaning in a new, abstract mode. “In learning to write, the child must disengage himself from the sensory aspect of speech and replace words by images of words” (Vygotksy 1986: 181). Unlike in face-to-face conversation, where responsibility for mutual understanding is shared between the interlocutors, the solitary writing process requires children to take on the sole responsibility for making accessible the meaning of what they want to express. In expressing their thoughts in a written text, children become aware that they must make their thinking fully explicit if it is to be understood by the readers. Consequently, children become aware of the linguistic system in some detail, and they come to understand that, by consciously deploying linguistic means, they can effectively meet their specific communicative intentions.

According to Vygotsky, the beginning of the process of learning to write is not dependent on the developmental readiness of school children because “the child has little motivation to learn writing when we begin to teach it. He feels no need and has only a vague idea of its usefulness” (Vygotsky 1986: 181). In addition, “the psychological functions on which written speech is based have not begun to develop in the proper sense when instruction in writing starts” (Vygotsky 1986: 183). In this important stage of learning, instruction is clearly leading development, a notion that Vygotsky developed in more detail with his concept of the “zone of proximal development” (cf. Section 9.5). Individual development follows its own internal logic which makes it difficult to work out a path of instruction to follow, but as Vygotsky has shown, development can be guided by instruction.

While speaking, the child “is hardly conscious of the sounds he pronounces and quite unconscious of the mental operations he performs. In writing, he must take cognizance of the sound structure of each word, dissect it, and reproduce it in alphabetical symbols, which he must have studied and memorized before” (Vygotsky 1986: 182). Although every text written in English can be decomposed into the 26 letters of the alphabet, children have to put the letters and words together according to complex linguistic rules and discursive requirements. An additional difficulty is the relation between elements of speech (phonemes) and their corresponding elements in written language (graphemes), for instance, due to different ways of pronunciation (e.g., varieties such as Scottish English, Hiberno English, Estuary English, Pidgin English, etc.), or words that sound the same but have different meanings (for example, homophones such as plain and plane, there and their, etc.). Using written symbols, such as full-stops, commas, and colons can also influence the meaning of a text and provide some guidance to the readers as to how the author intends his or her text to be read.32

Learning to acquire literacy is an arduous process covering many years, starting (at least in most Western societies) with the preparation phase, when children (of about 6 years of age) learn to form letters, handle a pencil, and copy words for themselves. In the consolidation phase, children (aged about 7 years) can write independently, using only those constructions they have already used in speech. The differentiation phase marks the separation of speech patterns from writing. Children (of 9 or 10 years of age) use increasingly complex literary constructions and genres in their writing. And finally, the integration phase marks the completion of differentiating the control of speaking and writing (cf. Kroll 1981). The child can now deliberately manipulate the differences between the oral and written forms of language, and mix them for intended effect.

During the initial two phases, “young children’s writing occurs at the moment. There is little preplanning or revising” (Rowe 2008: 409). It is only with gaining experience in the variation of texts and their underlying purposes in the subsequent two phases that children “begin to engage in both rehearsal and revision, with an eye toward pleasing their audience” (Rowe 2008: 409). By now, children have experienced language not only as a communicative tool for expressing their thoughts, but have learned to understand it in its objectified and systematic form. This prepares the ground for developing another form of linguistic knowledge, namely metalinguistic awareness, which occurs in the last two phases when children encounter more complex written language requiring conscious planning of the line of argument, the form of expression, and the choice of genre for the text to be written: “Written text conventions promote metalinguistic thinking in various linguistic domains such as sound/letter correspondence, word and sentence boundaries, and appropriate grammatical constructions (e.g., past perfect in English, passé simple in French, or optional bound morphology in Hebrew)” (Ravid and Tolchinsky 2002: 430).

In reaching the stage of metalinguistic awareness, the young learner has now acquired all aspects of language relevant for use in adult life. Language in its written form fosters elaborated decontextualized reflection, for instance, of philosophical concepts of truth or nothingness, and it allows access to archives of texts for the formation of disciplined knowledge. The individual is now a fully competent member of the speech community. For the purposes of L2 acquisition, the achievement of metalinguistic L1 awareness through mastering the written form of language has two sides. On the one side, is eases subjective access to the L2 because it can be approached as a linguistic system in terms of grammar, morphology, syntax, phonology, etc. which is different from the L1. However, the L1 system will, at least in the initial and intermediate stages of L2 learning, provide the basic framework for the subjective metalinguistic approach to the other language. This will lead to interferences at all linguistic levels which can be very difficult to overcome in the long term when using the L2.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.145.179.35