language evolution – Page 2

Why Disagree? Some Critical Remarks on the Integration Hypothesis of Human Language Evolution

Shigeru Miyagawa, Shiro Ojima, Robert Berwick and Kazuo Okanoya have recently published a new paper in Frontiers in Psychology, which can be seen as a follow-up to the 2013 Frontiers paper by Miyagawa, Berwick and Okanoya (see Hannah’s post on this paper). While the earlier paper introduced what they call the “Integration Hypothesis of Human Language Evolution”, the follow-up paper seeks to provide empirical evidence for this theory and discusses potential challenges to the Integration Hypothesis.

The basic idea of the Integration Hypothesis, in a nutshell, is this: “All human language sentences are composed of two meaning layers” (Miyagawa et al. 2013: 2), namely “E” (for “expressive”) and “L” (for “lexical”). For example, sentences like “John eats a pizza”, “John ate a pizza”, and “Did John eat a pizza?” are supposed to have the same lexical meaning, but they vary in their expressive meaning. Miyagawa et al. point to some parallels between expressive structure and birdsong on the one hand and lexical structure and the alarm calls of non-human primates on the other. More specifically, “birdsongs have syntax without meaning” (Miyagawa et al. 2014: 2), whereas alarm calls consist of “isolated uttered units that correlate with real-world references” (ibid.). Importantly, however, even in human language, the Expression Structure (ES) only admits one layer of hierarchical structure, while the Lexical Structure (LS) does not admit any hierarchical structure at all (Miyagawa et al. 2013: 4). The unbounded hierarchical structure of human language (“discrete infinity”) comes about through recursive combination of both types of structure.

This is an interesting hypothesis (“interesting” being a convenient euphemism for “well, perhaps not that interesting after all”). Let’s have a closer look at the evidence brought forward for this theory.

Miyagawa et al. “focus on the structures found in human language” (Miyagawa et al. 2014: 1), particularly emphasizing the syntactic structure of sentences and the internal structure of words. In a sentence like “Did John eat pasta?”, the lexical items John, eat, and pasta constitute the LS, while the auxiliary do, being a functional element, is seen as belonging to the expressive layer. In a more complex sentence like “John read the book that Mary wrote”, the VP and NP notes are allocated to the lexical layer, while the DP and CP nodes are allocated to the expressive layer.

Fig. 9 from Miyagawa et al. (2014), illustrating how unbounded hierarchical structure emerges from recursive combination of E- and L-level structures

As pointed out above, LS elements cannot directly combine with each other according to Miyagawa et al. (the ungrammaticality of e.g. John book and want eat pizza is taken as evidence for this), while ES is restricted to one layer of hierarchical structure. Discrete infinity then arises through recursive application of two rules:

(i) EP → E LP
(ii) LP → L EP
Rule (i) states that the E category can combine with LP to form an E-level structure. Rule (ii) states that the L category can combine with an E-level structure to form an L-level structure. Together, these two rules suffice to yield arbitrarily deep hierarchical structures.

The alternation between lexical and expressive elements, as exemplified in Figure (3) from the 2014 paper (= Figure 9 from the 2013 paper, reproduced above), is thus essential to their theory since they argue that “inside E and L we only find finite-state processes” (Miyagawa et al. 2014: 3). Several phenomena, most notably Agreement and Movement, are explained as “linking elements” between lexical and functional heads (cf. also Miyagawa 2010). A large proportion of the 2014 paper is therefore dedicated to phenomena that seem to argue against this hypothesis.

For example, word-formation patterns that can be applied recursively seem to provide a challenge for the theory, cf. example (4) in the 2014 paper:

(4) a. [anti-missile]
b. [anti-[anti-missile]missile] missile

The ostensible point is that this formation can involve center embedding, which would constitute a non-finite state construction.

However, they propose a different explanation:

When anti– combines with a noun such as missile, the sequence anti-missile is a modifier that would modify a noun with this property, thus, [anti-missile]-missile, [anti-missile]-defense. Each successive expansion forms via strict adjacency, (…) without the need to posit a center embedding, non-regular grammar.

Similarly, reduplication is re-interpreted as a finite state process. Furthermore, they discuss N+N compounds, which seems to violate “the assumption that L items cannot combine directly — any combination requires intervention from E.” However, they argue that the existence of linking elements in some languages provides evidence “that some E element does occur between the two L’s”. Their example is German Blume-n-wiese ‘flower meadow’, others include Freundeskreis ‘circle of friends’ or Schweinshaxe ‘pork knuckle’. It is commonly assumed that linking elements arose from grammatical markers such as genitive -s, e.g. Königswürde ‘royal dignity’ (from des Königs Würde ‘the king’s dignity’). In this example, the origin of the linking element is still transparent. The -es- in Freundeskreis, by contrast, is an example of a so-called unparadigmatic linking element since it literally translates to ‘circle of a friend’. In this case as well as in many others, the linking element cannot be traced back directly to a grammatical affix. Instead, it seems plausible to assume that the former inflectional suffix was reanalyzed as a linking element from the paradigmatic cases and subsequently used in other compounds as well.

To be sure, the historical genesis of German linking elements doesn’t shed much light on their function in present-day German, which is subject to considerable debate. Keeping in mind that these items evolved gradually however raises the question how the E and L layers of compounds were linked in earlier stages of German (or any other language that has linking elements). In addition, there are many German compounds without a linking element, and in other languages such as English, “linked” compounds like craft-s-man are the exception rather than the rule. Miyagawa et al.’s solution seems a bit too easy to me: “In the case of teacup, where there is no overt linker, we surmise that a phonologically null element occurs in that position.”

As an empiricist, I am of course very skeptical towards any kind of null element. One could possibly rescue their argument by adopting concepts from Construction Grammar and assigning E status to the morphological schema [N+N], regardless of the presence or absence of a linking element, but then again, from a Construction Grammar point of view, assuming a fundamental dichotomy between E and L structures doesn’t make much sense in the first place. That said, I must concede that the E vs. L distinction reflects basic properties of language that play a role in any linguistic theory, but especially in Construction Grammar and in Cognitive Linguistics. On the one hand, it reflects the rough distinction between “open-class” and “closed-class” items, which plays a key role in Talmy’s (2000) Cognitive Semantics and in the grammaticalization literature (cf. e.g. Hopper & Traugott 2003). As many grammaticalization studies have shown, most if not all closed-class items are “fossils” of open-class items. The abstract concepts they encode (e.g. tense or modality) are highly relevant to our everyday experience and, consequently, to our communication, which is why they got grammaticized in the first place. As Rose (1973: 516) put it, there is no need for a word-formation affix deriving denominal verbs meaning “grasp NOUN in the left hand and shake vigorously while standing on the right foot in a 2 ½ gallon galvanized pail of corn-meal-mush”. But again, being aware of the historical emergence of these elements begs the question if a principled distinction between the meanings of open-class vs. closed-class elements is warranted.

On the other hand, the E vs. L distinction captures the fundamental insight that languages pair form with meaning. Although they are explicitly talking about the “duality of semantics“, Miyagawa et al. frequently allude to formal properties of language, e.g. by linking up syntactic strutures with the E layer:

The expression layer is similar to birdsongs; birdsongs have specific patterns, but they do not contain words, so that birdsongs have syntax without meaning (Berwick et al., 2012), thus it is of the E type.

While the “expression” layer thus seems to account for syntactic and morphological structures, which are traditionally regarded as purely “formal” and meaningless, the “lexical” layer captures the referential function of linguistic units, i.e. their “meaning”. But what is meaning, actually? The LS as conceptualized by Miyagawa et al. only covers the truth-conditional meaning of sentences, or their “conceptual content”, as Langacker (2008) calls it. From a usage-based perspective, however, “an expression’s meaning consists of more than conceptual content – equally important to linguistic semantics is how that content is shaped and construed.” (Langacker 2002: xv) According to the Integration Hypothesis, this “construal” aspect is taken care of by closed-class items belonging to the E layer. However, the division of labor envisaged here seems highly idealized. For example, tense and modality can be expressed using open-class (lexical) items and/or relying on contextual inference, e.g. German Ich gehe morgen ins Kino ‘I go to the cinema tomorrow’.

It is a truism that languages are inherently dynamic, exhibiting a great deal of synchronic variation and diachronic change. Given this dynamicity, it seems hard to defend the hypothesis that a fundamental distinction between E and L structures which cannot combine directly can be found universally in the languages of the world (which is what Miyagawa et al. presuppose). We have already seen that in the case of compounds, Miyagawa et al. have to resort to null elements in order to uphold their hypothesis. Furthermore, it seems highly likely that some of the “impossible lexical structures” mentioned as evidence for the non-combinability hypothesis are grammatical at least in some creole languages (e.g. John book, want eat pizza).

In addition, it seems somewhat odd that E- and L-level structures as “relics” of evolutionarily earlier forms of communication are sought (and expected to be found) in present-day languages, which have been subject to millennia of development. This wouldn’t be a problem if the authors were not dealing with meaning, which is not only particularly prone to change and variation, but also highly flexible and context-dependent. But even if we assume that the existence of E-layer elements such as affixes and other closed-class items draws on innate dispositions, it seems highly speculative to link the E layer with birdsong and the L layer with primate calls on semantic grounds.

The idea that human language combines features of birdsong with features of primate alarm calls is certainly not too far-fetched, but the way this hypothesis is defended in the two papers discussed here seems strangely halfhearted and, all in all, quite unconvincing. What is announced as “providing empirical evidence” turns out to be a mostly introspective discussion of made-up English example sentences, and if the English examples aren’t convincing enough, the next best language (e.g. German) is consulted. (To be fair, in his monograph, Miyagawa (2010) takes a broader variety of languages into account.) In addition, much of the discussion is purely theory-internal and thus reminiscent of what James has so appropriately called “Procrustean Linguistics“.

To their credit, Miyagawa et al. do not rely exclusively on theory-driven analyses of made-up sentences but also take some comparative and neurological studies into account. Thus, the Integration Hypothesis – quite unlike the “Mystery” paper (Hauser et al. 2014) co-authored by Berwick and published in, you guessed it, Frontiers in Psychology (and insightfully discussed by Sean) – might be seen as a tentative step towards bridging the gap pointed out by Sverker Johansson in his contribution to the “Perspectives on Evolang” section in this year’s Evolang proceedings:

A deeper divide has been lurking for some years, and surfaced in earnest in Kyoto 2012: that between Chomskyan biolinguistics and everybody else. For many years, Chomsky totally dismissed evolutionary linguistics. But in the past decade, Chomsky and his friends have built a parallel effort at elucidating the origins of language under the label ‘biolinguistics’, without really connecting with mainstream Evolang, either intellectually or culturally. We have here a Kuhnian incommensurability problem, with contradictory views of the nature of language.

On the other hand, one could also see the Integration Hypothesis as deepening the gap since it entirely draws on generative (or “biolinguistic”) preassumptions about the nature of language which are not backed by independent empirical evidence. Therefore, to conclusively support the Integration Hypothesis, much more evidence from many different fields would be necessary, and the theoretical preassumptions it draws on would have to be scrutinized on empirical grounds, as well.

References

Hauser, Marc D.; Yang, Charles; Berwick, Robert C.; Tattersall, Ian; Ryan, Michael J.; Watumull, Jeffrey; Chomsky, Noam; Lewontin, Richard C. (2014): The Mystery of Language Evolution. In: Frontiers in Psychology 4. doi: 10.3389/fpsyg.2014.00401

Hopper, Paul J.; Traugott, Elizabeth Closs (2003): Grammaticalization. 2nd ed. Cambridge: Cambridge University Press.

Johansson, Sverker: Perspectives on Evolang. In: Cartmill, Erica A.; Roberts, Séan; Lyn, Heidi; Cornish, Hannah (eds.) (2014): The Evolution of Language. Proceedings of the 10th International Conference. Singapore: World Scientific, 14.

Langacker, Ronald W. (2002): Concept, Image, and Symbol. The Cognitive Basis of Grammar. 2nd ed. Berlin, New York: De Gruyter (Cognitive Linguistics Research, 1).

Langacker, Ronald W. (2008): Cognitive Grammar. A Basic Introduction. Oxford: Oxford University Press.

Miyagawa, Shigeru (2010): Why Agree? Why Move? Unifying Agreement-Based and Discourse-Configurational Languages. Cambridge: MIT Press (Linguistic Inquiry, Monographs, 54).

Miyagawa, Shigeru; Berwick, Robert C.; Okanoya, Kazuo (2013): The Emergence of Hierarchical Structure in Human Language. In: Frontiers in Psychology 4. doi 10.3389/fpsyg.2013.00071

Miyagawa, Shigeru; Ojima, Shiro; Berwick, Robert C.; Okanoya, Kazuo (2014): The Integration Hypothesis of Human Language Evolution and the Nature of Contemporary Languages. In: Frontiers in Psychology 5. doi 10.3389/fpsyg.2014.00564

Rose, James H. (1973): Principled Limitations on Productivity in Denominal Verbs. In: Foundations of Language 10, 509–526.

Talmy, Leonard (2000): Toward a Cognitive Semantics. 2 vol. Cambridge, Mass: MIT Press.

P.S.: After writing three posts in a row in which I critizised all kinds of studies and papers, I herby promise that in my next post, I will thoroughly recommend a book and return to a question raised only in passing in this post. [*suspenseful cliffhanger music*]

Defining iconicity and its repercussions in language evolution

There was an awful lot of talk about iconicity at this year’s EvoLang conference (as well as in previous years), and its ability to bootstrap communication systems and solve symbol grounding problems, and this has lead to talk on its possible role in the emergence of human language. Some work has been more sceptical than other’s about the role of iconicity, and so I thought it would be useful to do a wee overview of some of the talks I saw in relation to how different presenters define iconicity (though this is by no stretch a comprehensive overview).

As with almost everything, how people define iconicity differs across studies. In a recent paper, Monaghan, Shillcock, Christiansen & Kirby (2014) identify two forms of iconicity in language; absolute iconicity and relative iconicity. Absolute iconicity is where some linguistic feature imitates a referent, e.g. onomatopoeia or gestural pantomime. Relative iconicity is where there is a signal-meaning mapping or there is a correlation between similar signals and similar meanings. Relative iconicity is usually only clear when the whole meaning and signal spaces can be observed together and systematic relations can be observed between them.

Liz Irvine gave a talk on the core assumption that iconicity played a big role in in bootstrapping language. She teases apart the distinction above by calling absolute iconicity, “diagrammatic iconicity” and relative iconicity, “imagic iconicity”. “Imagic iconicity” can be broken down even further and can be measured on a continuum either in terms of how signals are used and interpreted by language users, or simply by objectively looking at meaning-signal mappings where signs can be non-arbitrary, but not necessarily treated as iconic by language users. Irvine claims that this distinction is important in accessing the role of iconicity in the emergence of language. She argues that diagrammatic or absolute iconicity may aid adults in understanding new signs, but it doesn’t necessarily aid early language learning in infants. Whereas imagic, or relative iconicity, is a better candidate to aid language acquisition and language emergence, where language users do not interpret the signal-meaning mappings explicitly as being iconic, even though they are non-arbitrary.

Irvine briefly discusses that ape gestures are not iconic from the perspective of their users. Marcus Perlman, Nathaniel Clark and Joanne A. Tanner presented work on whether iconicity exists in ape gesture. They define iconicity as being gestures which in any way resemble or depict their meanings but break down these gestures into pantomimed actions, directive touches and visible directives, which are all arguably examples of absolute iconicity. Following from Irvine’s arguments, this broad definition of iconicity may not be so useful when drawing up scenarios for language evolution, and the authors try to provide more detailed and nuanced analysis drawing from the interpretation of signs from the ape’s perspective. Theories which currently exist on iconicity in ape gesture maintain that any iconicity is an artefact of the gesture’s development through inheritance and ritualisation. However, the authors argue that these theories do not currently account for the variability and creativity seen in iconic ape gestures which may help frame iconicity from the perspective of its user.

It’s difficult to analyse iconicity from an ape’s perspective, however, it should be much easier to get at how human’s perceive and interpret different types of iconicity via experiments. I think that experimental design can help get at this, but also analysis from a user perspective from post-experimental questionnaires or even post-experimental experiments (where naive participants are asked to rate to what degree a sign represents a meaning).

Gareth Roberts and Bruno Galantucci presented a study where their hypothesis was that a modality’s capacity for iconicity may inhibit the emergence of combinatorial structure (phonological patterning) in a system. This hypothesis may explain why emerging sign languages, which have more capacity for iconicity than spoken languages, can have fully expressive systems without a level of combinatorial structure (see here). They used the now famous paradigm from Galantucci’s 2005 experiment here. They asked participants to communicate a variety of meanings which were either lines, which could be represented through absolute iconicity with the modality provided, or circles which were various shades of green, which could not be iconically represented. The experiment showed that indeed, the signals used for circles were made up from combinatorial elements where the lines retained iconicity throughout the experiment. This is a great experiment and I really like it, however, I worry that it is only looking at two extreme ends of the iconicity continuum, and has not considered the effects of relative iconicity, or nuances of signal-meaning relations. In de Boer and Verhoef (2012), a mathematical model shows that shared topology between signal and meaning spaces will generate an iconic system with signal-meaning mapping, but mismatched topologies will generate systems with conventionalised structure. I think it is important that experimental work now looks into more slight differences between signal and meaning spaces and the effects these differences will have on structure in emerging linguistic systems in the lab, and also how participant’s interpretation of any iconicity or structure in a system effects the nature of that iconicity or structure. I’m currently running some experiments exploring this myself, so watch this space!

References

Where possible, I’ve linked to studies as I’ve cited them.

All other studies cited are included in Erica A. Cartmill, Seán Roberts, Heidi Lyn & Hannah Cornish, ed., The Evolution of Language: Proceedings of the 10th international conference (EvoLang 10). It’s only £87.67 on Amazon, (but it may be wiser to email the authors if you don’t have a friend with a copy).

The Evolution of Language: The Webcomic

Remi van Trijp, a researcher at the Sony Computer Science Laboratory in Paris has started a weekly webcomic on the Evolution of Language here. There are seven entries so far, Remi adds some explanatory commentary to each one. Whilst somewhat crudely draw, they’re definitely worth a look. Here’s my favourite so far:

The Myth of Language Universals at Birth

[This is a guest post by Stefan Hartmann]

“Chomsky still rocks!” This comment on Twitter refers to a recent paper in PNAS by David M. Gómez et al. entitled “Language Universals at Birth”. Indeed, the question Gómez et al. address is one of the most hotly debated questions in linguistics: Does children’s language learning draw on innate capacities that evolved specifically for linguistic purposes – or rather on domain-general skills and capabilities?

Lbifs, Blifs, and Brains

Gómez and his colleagues investigate these questions by studying how children respond to different syllable structures:

It is well known that across languages, certain structures are preferred to others. For example, syllables like blif are preferred to syllables like bdif and lbif. But whether such regularities reflect strictly historical processes, production pressures, or universal linguistic principles is a matter of much debate. To address this question, we examined whether some precursors of these preferences are already present early in life. The brain responses of newborns show that, despite having little to no linguistic experience, they reacted to syllables like blif, bdif, and lbif in a manner consistent with adults’ patterns of preferences. We conjecture that this early, possibly universal, bias helps shaping language acquisition.

More specifically, they assume a restriction on syllable structure known as the Sonority Sequencing Principle (SSP), which has been proposed as “a putatively universal constraint” (p. 5837). According to this principle, “syllables maximize the sonority distance from their margins to their nucleus”. For example, in /blif/, /b/ is less sonorous than /l/, which is in turn less sonorous than the vowel /i/, which constitues the syllable’s nucleus. In /lbif/, by contrast, there is a sonority fall, which is why this syllable is extremely ill-formed according to the SSP.

A simplified version of the sonority scale. — A simplified version of the sonority scale

In a first experiment, Gómez et al. investigated “whether the brains of newborns react differentially to syllables that are well- or extremely ill-formed, as defined by the SSP” (p. 5838). They had 24 newborns listen to /blif/- and /lbif/-type syllables while measuring the infant’s brain activities. In the left temporal and right frontoparietal brain areas, “well-formed syllables elicited lower oxyhemoglobin concentrations than ill-formed syllables.” In a second experiment, they presented another group of 24 newborns with syllables either exhibiting a sonority rise (/blif/) or two consonants of the same sonority (e.g. /bdif/) in their onset. The latter option is dispreferred across languages, and previous behavioral experiments with adult speakers have also shown a strong preference for the former pattern. “Results revealed that oxyhemoglobin concentrations elicited by well-formed syllables are significantly lower than concentrations elicited by plateaus in the left temporal cortex” (p. 5839). However, in contrast to the first experiment, there is no significant effect in the right frontoparietal region, “which has been linked to the processing of suprasegmental properties of speech” (p. 5838).

In a follow-up experiment, Gómez et al. investigated the role of the position of the CC-patterns within the word: Do infants react differently to /lbif/ than to, say, /olbif/? Indeed, they do: “Because the sonority fall now spans across two syllables (ol.bif), rather than a syllable onset (e.g., lbif), such words should be perfectly well-formed. In line with this prediction, our results show that newborns’ brain responses to disyllables like oblif and olbif do not differ.”

How much linguistic experience do newborns have?

Taken together, these results indicate that newborn infants are already sensitive for syllabification (as the follow-up experiment suggests) as well as for certain preferences in syllable structure. This leads Gómez et al. to the conclusion “that humans possess early, experience-independent linguistic biases concerning syllable structure that shape language perception and acquisition” (p. 5840). This conjecture, however, is a very bold one. First of all, seeing these preferences as experience-independent presupposes the assumption that newborn infants do not have linguistic experience at all. However, there is evidence that “babies’ language learning starts from the womb”. In their classic 1986 paper, Anthony DeCasper and Melanie Spence showed that “third-trimester fetuses experience their mothers’ speech sounds and that prenatal auditory experience can influence postnatal auditory preferences.” Pregnant women were instructed to read aloud a story to their unborn children when they felt that the fetus was awake. In the postnatal phase, the infants’ reactions to the same or a different story read by their mother’s or another woman’s voice were studied by monitoring the newborns’ sucking behavior. Apart from the “experienced” infants who had been read the story, a group of “untrained” newborns were used as control subjects. They found that for experienced subjects, the target story was more reinforcing than a novel story, no matter if it was recited by their mother’s or a different voice. For the control subjects, by contrast, no difference between the stories could be found. “The only experimental variable that can systematically account for these findings is whether the infants’ mothers had recited the target story while pregnant” (DeCasper & Spence 1986: 143).

Continue reading “The Myth of Language Universals at Birth”

UFO Events, a Thought Experiment about the Evolution of Language

The problem of human origins, of which language origins is one aspect, is deep and important. It is also somewhat mysterious. If we could travel back in time at least some of those mysteries could be cleared up. One that interests me, for example, is whether or not the emergence of language was preceded by the emergence of music, or more likely, proto-music. Others are interested in the involvement of gesture in language origins.

Some of the attendant questions could be resolved by traveling back in time and making direct observations. Still, once we’d observed what happened and when it happened, questions would remain. We still wouldn’t know the neural and cognitive mechanisms, for they are not apparent from behavior alone. But our observations of just what happened would certainly constrain the space of models we’d have to investigate.

Unfortunately, we can’t travel back in time to make those observations. That difficulty has the peculiar effect of reversing the inferential logic of the previous paragraph. We find ourselves in the situation of using our knowledge of neural and cognitive mechanisms to constrain the space of possible historical sequences.

Except, of course, that our knowledge of neural and cognitive mechanisms is not very secure. And large swaths of linguistics are mechanism free. To be sure, there may be an elaborate apparatus of abstract formal mechanism, but just how that mechanism is realized in step-by-step cognitive and neural processes, that remains uninvestigated, except among computational linguists.

The upshot of all this is that we must approach these questions indirectly. We have to gather evidence from a wide variety of disciplines – archeology, physical and cultural anthropology, cognitive psychology, developmental psychology, and the neurosciences – and piece it together. Such work entails a level of speculation that makes well-trained academicians queasy.

❖ ❖ ❖

What follows is an out-take from Beethoven’s Anvil, my book on music. It’s about a thought experiment that first occurred to me while in graduate school in the mid-1970s. Consider the often astounding and sometimes absurd things that trainers can get animals to do, things the don’t do naturally. Those acts are, in some sense, inherent in their neuro-muscular endowment, but not evoked by their natural habitat. But place them in an environment ruled by humans who take pleasure in watching dancing horses, and . . . Except that I’m not talking about horses.

It seems to me that what is so very remarkable about the evolution of our own species is that the behavioral differences between us and our nearest biological relatives are disproportionate to the physical and physiological differences. The physical and physiological differences are relatively small, but the behavioral differences are large.

In thinking about this problem I have found it useful to think about how at least some chimpanzees came to acquire a modicum of language. All of them ended in failure. In the most intense of these efforts, Keith and Cathy Hayes raised a baby chimp in their household from 1947 to 1954. But that close and sustained interaction with Vicki, the young chimp in question, was not sufficient. Then in the late 1960s Allen and Beatrice Gardner began training a chimp, Washoe, in Ameslan, a sign language used among the deaf. This effort was far more successful. Within three years Washoe had a vocabulary of Ameslan 85 signs and she sometimes created signs of her own. Continue reading “UFO Events, a Thought Experiment about the Evolution of Language”

Bootstrapping Recursion into the Mind without the Genes

Recursion is one of the most important mechanisms that has been introduced into linguistics in the past six decades or so. It is also one of the most problematic and controversial. These days significant controversy centers on question of the emergence of recursion in the evolution of language. These informal remarks bear on that issue.

Recursion is generally regarded as an aspect of language syntax. My teacher, the late David Hays, had a somewhat different view. He regarded recursion as mechanism of the mind as a whole and so did not specifically focus on recursion in syntax. By the time I began studying with him his interest had shifted to semantics.

He had the idea that abstract concepts could be defined over stories. Thus: charity is when someone does something nice for someone without thought of a reward. We can represent that with the following diagram:

The charity node to the left is being defined by the structure of episodes at the right (the speech balloons are just dummies for a network structure). The head of the episodic structure is linked to the charity node with a metalingual arc (MTL), named after Jakobson’s metalingual function, which is language about language. So, one bit of language is defined by s complex pattern of language. Charity, of course, can appear in episodes defining other abstract stories, and so on, thus making the semantic system recursive.

Now let’s develop things a bit more carefully, but still informally. Nor do we need to get so far as the metalingual definition of abstract concepts. But we do need the metalingual mechanism. Continue reading “Bootstrapping Recursion into the Mind without the Genes”

Happy Darwin Day!

I had hoped to celebrate Darwin day with a longer post discussing how language is often viewed as a challenging puzzle to natural selection. My main worry is that the formal design metaphor used in much of linguistics has been used, incorrectly IMHO, to divert attention away from studying language as a biological system based on organic logic. If this doesn’t make much sense, then you can do some background reading with Terrence Deacon’s paper, Language as an emergent function: Some radical neurological and evolutionary implications. Alas, that’s all I have to say on the matter for now, but if you’re looking for something related to Darwin, evolution and the origin of language, then I strongly suggest you head over to the excellent Darwin Correspondence project and read their blog post on the subject:

Darwin started thinking about the origin of language in the late 1830s. The subject formed part of his wide-ranging speculations about the transmutation of species. In his private notebooks, he reflected on the communicative powers of animals, their ability to learn new sounds and even to associate them with words. “The distinction of language in man is very great from all animals”, he wrote, “but do not overrate—animals communicate to each other” (Barrett ed. 1987, p. 542-3). Darwin observed the similarities between animal sounds and various natural cries and gestures that humans make when expressing strong emotions such as fear, surprise, or joy. He noted the physical connections between words and sounds, exhibited in words like “roar”, “crack”, and “scrape” that seemed imitative of the things signified. He drew parallels between language and music, and asked: “did our language commence with singing—is this the origin of our pleasure in music—do monkeys howl in harmony”? (Barrett ed. 1987, p. 568).

Retiring Procrustean Linguistics

Many of you are probably already aware of the Edge 2014 question: what scientific ideas are ready for retirement? The question was derived from the Kuhnian-esque, and somewhat tongue-in-cheek, quote by theoretical physicist Max Planck:

A new scientific theory does not triumph by convincing its opponents and making them see the light, but rather because its opponents die, and a new generation grows up that is familiar with it.

Some of the big themes that jumped out at me were bashing the scientific method, bemoaning our enthusiasm for big data and showing us how we don’t understand and routinely misapply statistics. Other relevant candidates that popped up for retirement were culture, learning, human nature, innateness, and brain plasticity. Lastly, on the language front, we had Benjamin Bergen and Nick Enfield weighing in against universal grammar and linguistic competency, whilst John McWhorter rallied against strong linguistic relativity and Dan Sperber challenged our conventional understanding of meaning.

And just so you’re aware: I’m not necessarily in agreement with all of the perspectives I’ve linked to above, but I do think a lot of them are interesting and definitely worth a read (if only to clarify your own position on the matters). On this note, you should probably go over and read Norbert Hornstein’s post about the flaws of Bergen’s argument, which basically boil down to a conflation between I-languages and E-languages (and where we should expect to observe universal properties).

If I had to offer my own candidate for retirement, then it would be what Anne Buchanan over at the excellent blog, The Mermaid’s Tale, termed Procrustean Science:

In classical Greek mythology, Procrustes was a criminal who produced an iron bed and made his victims fit the bed…by cutting off any parts of their bodies that didn’t fit. The metaphorical use of the word means “enforcing uniformity or conformity without regard to natural variation or individuality.” It is in this spirit that Woese characterized much of modern biology as procrustean, because rather than adapt its explanations to the facts, the facts are forced to lie in a bed of theory that is taken for granted–and thus, the facts must fit!

Continue reading “Retiring Procrustean Linguistics”

Self-Organization and Developmental Mechanisms in the Origins of Speech and Action Systems

Pierre-Yves Oudeyer just popped this up on YouTube and it’s worth a watch for those interested in the evolution of speech and language.

What is combinatorial structure?

Languages have structure on two levels. The level on which small meaningless building blocks (phonemes) make up bigger meaningful building blocks (morphemes), and the level of structure at which these meaningful building blocks make up even bigger meaningful structures (words, sentences, utterances). This was identified way back in the 1960s as one of Hockett’s design features for language know as “duality of patterning”, and in most of linguistics people refer to these different levels of structure as “phonology” and “(morpho)syntax”.

However, in recent years these contrasting levels of structure have started to be talked about in the context of language evolution, either in reference to artificial language learning experiments or experimental semiotics, where a proxy for language is used so it doesn’t make sense to talk about phonological or morphosyntactic structure, or when talking about animal communication where it also doesn’t make sense to talk about terms which pertain to human language. Instead, terms such as “combinatorial” and “compositional” structure are used, occasionally contrastively, or sometimes they get conflated to mean the same thing.

In the introduction to a recent special issue in Language and Cognition on new perspectives on duality of patterning, Bart de Boer, Wendy Sandler and Simon Kirby helpfully outline their preferred use of terminology:

Duality of patterning (Hockett, 1960) is the property of human language that enables combinatorial structure on two distinct levels: meaningless sounds can be combined into meaningful morphemes and words, which themselves could be combined further. We will refer to recombination at the first level as combinatorial structure, while recombination at the second level will be called compositional structure.

You will notice that they initially call both levels of structure “combinatorial”, and they both arguably are, and my point in this blog post isn’t necessarily that only structure on the first level should be called combinatorial, but that work talking about combinatorial structure should establish what their terminology means.

A recent paper by Scott-Philips and Blythe (2013), which is entitled “Why is combinatorial communication rare in the natural world, and why is language an exception to this trend?” presents an agent based model to show how limited the conditions are from which combinatorial communication can emerge. Obviously, in order to do this they need to define what they mean by combinatorial communication and present this figure by way of explanation:

They explain:

In a combinatorial communication system, two (or more) holistic signals (A and B in this figure) are combined to form a third, composite signal (A + B), which has a different effect (Z) to the sum of the two individual signals (X + Y). This figure illustrates the simplest combinatorial communication system possible. Applied to the putty-nosed monkey system, the symbols in this figure are: a, presence of eagles; b, presence of leopards; c, absence of food; A, ‘pyow’; B, ‘hack’ call; C = A + B ‘pyow–hack’; X, climb down; Y, climb up; Z ≠ X + Y, move to a new location. Combinatorial communication is rare in nature: many systems have a signal C = A + B with an effect Z = X + Y; very few have a signal C = A + B with an effect Z ≠ X + Y.

In this example, the building blocks which make C , A and B, are arguably meaningful because they act as signals in their own right, therefore, if C had a meaning which was a combination of the meanings of A and B, this system (using de Boer, Sandler and Kirby’s definition) would be compositional (this isn’t represented in the figure above). However, if the meaning of C is not a combination of the meanings of A and B, then A and B are arguably meaningless building blocks (and their individual expressions just happen to have meaning, for example the individual phoneme /a/ being an indefinite determiner in English, but not having this meaning when it is used in the word “cat”). In this case, the system would be combinatorial (as defined by the figure above, as well as under the definition of de Boer, Sadler and Kirby). So far so good, it looks like we are in agreement.

However, later in their paper Scott-Philips and Blythe go on to argue:

Coded ‘combinatorial’ signals are in a sense not really combinatorial at all. After all, there is no ‘combining’ going on. There is really just a third holistic signal, which happens to be comprised of the same pieces as other existing holistic signals. Indeed, the most recent experimental results suggest that the putty-nosed monkeys interpret the ‘combinatorial’ pyow–hack calls in exactly this idiomatic way, rather than as the product of two component parts of meaning. By contrast, the ostensive creation of new composite signals is clearly combinatorial: the meaning of the new, composite signal is in part (but only in part) a function of the meanings of the component pieces.

The argument they are giving here is that unless the meaning of C is a combination of A and B (or compositional as defined above), then it is not really a combinatorial signal.

Scott-Philips and Blythe definitely know and demonstrate that there is a difference between the two levels of structure, but they conflate them both under one term, “combinatorial”, which makes it harder to understand that there is a very clear difference. Also, changing the definition of what they mean by “combinatorial” between the introduction of their paper and their discussion confuses their argument.

Perhaps we should all agree to adopt the terminology proposed by de Boer, Sandler and Kirby, but given the absence of a consensus on the matter, at the very least I think outlining exactly what is meant by combinatorial (or compositional) needs to be established at the beginning of every paper using these terms.

References

de Boer, B., Sandler, W., & Kirby, S. (2012). New perspectives on duality of patterning: Introduction to the special issue. Language and Cognition, 4(4).

Hockett, C. 1960. The origin of speech. Scientific American 203. 88–111.

Scott-Phillips, T. C., & Blythe, R. A. (2013). Why is combinatorial communication rare in the natural world, and why is language an exception to this trend?. Journal of The Royal Society Interface, 10(88), 20130520.