Abstract: This contribution is concerned with the reconstruction of the vowel qualities of Proto-Tupian, the ancestral language of the Tupian language family. The study is grounded in a bottom-up application of the comparative method and seeks to offer a more balanced reconstruction that avoids an overreliance on the Tupí-Guaraní branch. It is first shown that the height opposition traditionally reconstructed for the rounded vowel series (*o vs. *u) is best interpreted as an opposition between an unrounded vowel and a rounded one (*ə vs. *o). It is also argued that multiple instances of *e in the traditional reconstruction should be rather attributed to *ə. Finally, it is shown that two vowels (symbolized as *ɨ and *ɯ) must be reconstructed in lieu of the traditional *ɨ. The resulting proposal has consequences for the subgrouping of the Tupian family.
Keywords: Tupian languages, Comparative reconstruction, Vowels, Sound change.
Resumo: O foco do presente trabalho é a reconstrução do sistema vocálico do Proto-Tupi, a língua ancestral da família linguística Tupi. A investigação baseia-se em uma aplicação bottom-up do método comparativo e busca oferecer uma reconstrução mais equilibrada, evitando uma influência desproporcional do ramo Tupi-Guarani. Mostraremos primeiro que a diferença de altura tradicionalmente reconstruída para a série de vogais arredondadas (*o vs. *u) pode ser melhor compreendida como uma oposição entre uma vogal não arredondada e uma vogal arredondada (*ə vs. *o). Argumentamos também que múltiplas instâncias de *e na reconstrução tradicional devem ser atribuídas a *ə. Por fim, duas vogais (simbolizadas *ɨ e *ɯ) devem ser reconstruídas em lugar do segmento *ɨ da reconstrução tradicional. Estas propostas têm implicação para a classificação interna da família Tupi.
Palavras-chave: Línguas Tupi, Reconstrução comparativa, Vogais, Mudança sonora.
Research Articles
A revised reconstruction of the Proto-Tupian vowel system
O sistema vocálico do Proto-Tupi: uma nova proposta reconstrutiva
Received: 15 March 2021
Accepted: 22 December 2021
This article addresses the reconstruction of the vowels of Proto-Tupian (PT). Tupian is a major language family of South America, which comprises as many as ten universally recognized close-knit branches: Tupí-Guaraní, Awetí, Sateré-Mawé, Mundurukú, Juruna, Mondé, Tuparian, Arikém, Karo (= Ramarama), and Puruborá. Of these, four consist of only one language (Awetí, Sateré-Mawé, Karo, Puruborá), while Mundurukú, Juruna, and Arikém are similar in that each of them comprises one living language (Mundurukú, Yudjá, and Karitiana, respectively) and one extinct (or dormant), less well-attested language (Kuruaya, Xipaya, and Arikém, respectively); the Juruna branch also includes a third extinct language, Manitsawá, of which very little is known.
It is now universally accepted that Tupí–Guaraní, Awetí, and Sateré-Mawé are particularly closely related to each other, constituting thus a node within Tupian; furthermore, Tupí-Guaraní and Awetí share some innovations that did not affect Sateré-Mawé, suggesting an early split of the latter language off Proto-Mawé-Awetí-Tupí-Guaraní (Rodrigues, 1984-1955, p. 35; Rodrigues & Dietrich, 1997; Corrêa da Silva, 2010; Meira & Drude, 2015). In the remainder of this paper, we will label the node that comprises Tupí-Guaraní and Awetí ‘Awetí-Guaraní’, and the node that comprises Awetí-Guaraní and Sateré-Mawé will be accordingly called ‘Mawé-Guaraní’. Another proposal related to the internal classification of Tupian advances a hypothesis according to which Puruborá and Karo would form a subgroup (‘Ramarama–Puruborá’, see Rodrigues, 2007, p. 168, fn. 2; Galucio & Gabas Jr., 2002).
In this article, we critically assess the only existing reconstructive proposal that has ever been put forward for the PT segmental inventory (cf. Rodrigues, 1999, 2005, 2007), focusing on the vowel system as reconstructed in Rodrigues (2005). Rodrigues (2005) proposal is presented as established common knowledge in later reference works on the Tupian family (see e.g. Rodrigues & Cabral, 2012). However, we will argue that although the author’s cognacy judgments are, for the most part, precise, his analysis of the attested correspondence sets is problematic and, for this reason, in need of revision. Specific problems which we attempt to resolve in this contribution include the sound correspondence in the pan-Tupian word for ‘cultivated field’ (cf. Sateré-Mawé ŋo/ko, Awetí ko, Mundurukú kə, Yudjá kúá, but Makurap and Wayoró ŋge, Karitiana ŋga), which is considered irregular by Rodrigues (2005, p. 40), as well as the reason why the vowel *o in Rodrigues’ (1999) reconstruction occurs predominantly after labial consonants (cf. Rodrigues, 1999, pp. 110–111).
After a brief presentation of the relevant aspects of Rodrigues (2005) PT reconstruction (“Earlier scholarship”), we discuss some vowel correspondences that are especially troublesome in Rodrigues’ (2005) framework, and put forward an alternative account of the evolution of the vowel system of PT (sections “PT *ə” and “PT *ɯ vs. *ɨ”). The scenario we propose requires positing some non-trivial innovations shared by specific groups of Tupian, as discussed in “Implications for the subgrouping of Tupian”. After the “Conclusions”, the “List of abbreviations”, and the “References”, we include two appendices. Appendix 1 contains all the cognate sets discussed in this contribution, whereas Appendix 2 summarizes our reconstruction of Proto-Tupian consonants.
Before proceeding to the bulk of the discussion, we make explicit our conventions regarding the representation of linguistic data. In order to warrant the comparability of the data, which are extracted from a variety of sources on multiple Tupian languages, we have unified the representation of all cited forms according to the following principles.
First of all, the flap ɾ is represented as r throughout this paper, and the open-mid vowels ɔ and ɛ as o and e, respectively, as is in fact commonly done in Tupian linguistics. An exception is made for Kuruaya, where /ɔ/ and /o/ contrast and are thus represented as ɔ and o. The vowels of the extinct language Arikém, as well as of its direct ancestor Proto-Arikém, are represented as /i e æ ɒ ʉ/. The choice of the less common characters /æ ɒ ʉ/ is suggested by the variable representation of these vowels as ‹a› ~ ‹e›, ‹o› ~ ‹ḁ› ~ ‹a›, ‹u› ~ ‹u̥› ~ ‹i› in our only sources on the language (Rondon & Faria, 1948; Nimuendajú, 1932).
In most Tupian languages, consonants in the coda position do not contrast for features other than the place of articulation. This is true for Sateré-Mawé, Awetí, Proto-Tupí-Guaraní (and most contemporary Tupí-Guaraní languages), Proto-Tuparian (and all contemporary Tuparian languages), Proto-Arikém, and Puruborá. We write such codas in small caps: p (labial), t (dental/alveolar), c (palatal), k (velar). Their precise surface realizations vary depending on the language, on the point of articulation, and on the phonological environment, and include [p̚ m β] for p, [t̚ n ɾ] for t, [(ʲ)c̚ ɲ j] for c, [k̚ ŋ ɰ g] for k.
The phonetic inventory of many Tupian languages has two-phase stops that start out with a lower velum which raises during the occlusion. These are variably analyzed as postoralized allophones of underlying nasals or prenasalized allophones of underlying voiced stops (cf. Wetzels & Nevins, 2018), or (more rarely) as underlying prenasalized stops (cf. González, 2008). In this paper, such segments are always spelt as mb, nd, nʤ, ŋg, regardless of their phonological status in a given language.
Nasal vowels are always explicitly marked with a tilde, even if our sources leave nasality unmarked in some environments (usually following or preceding nasal consonants).
In languages where [s] and [ts] are free, idiolectal, dialectal, or chronological variants of one phoneme, we have normalized the data from our sources in order to warrant consistency across examples (e.g., only s is used in Tuparí, and only ts in Sakurabiat), following the current transcription practices in the most recent expert source in each case.
Proto-Mawé-Awetí-Tupí-Guaraní (= Proto-Mawetí-Guaraní = PMG) reconstructions are mostly those by Meira and Drude (2015), with the following modifications. Meira and Drude (2015)*tʲ is rewritten as *c, except when it follows an *i or a *j in their reconstruction, in which case we employ an ad hoc character *ć in order to capture the fact that the reflexes of *ć in all daughter languages are different from those of *c (this also allows us to give up the reconstruction of ‘phantom’ instances of PMG *i and *j, which are in Meira and Drude (2015) proposal hypothesized to have been lost in all daughter languages). For example, Meira and Drude’s (2015) reconstructions such as *itʲet ‘his/her name’ and *tʲajtʲu ‘armadillo’ are replaced with *ćet and *caću, given that no daughter language preserves any segmental trace of the alleged PMG segments *i and *j (Sateré-Mawé het, sahu; Awetí tet, tatu[pep]; PTG *tset, *tatu). We also posit PMG *tʲ alongside *c and *ć in order to account for the sound correspondence involving Sateré-Mawé t (in unstressed syllables) / ɾ(j) (in stressed syllables), Awetí ʐ, and PTG *t, found in the stem for ‘fire’ (reconstructed as *atia, *atja in Meira & Drude, 2015; *(c)atʲa in our proposal) as well as before *i (in complementary distribution with *t)1. We also diverge from Meira and Drude (2015) in reconstructing *w instead of their *kʷ, whereas the segment they reconstruct as *w is considered to have been independently epenthesized in the environment *u_V after the split of PMG (that way, Meira & Drude’s (2015) reconstructions such as *tʲuwaj ‘tail’ and *tʲuwɨ(k) ‘blood’ are replaced here with *cuac, *cuɨ > Sateré-Mawé suwac-po, suː; Awetí -uwac, -uwɨ[k]; PTG *tuwac/*-ruwac, *tuwɨ/*-ruwɨ)2. We accept Schleicher’s (1998, pp. 18–24) suggestion, reinforced by Meira and Drude (2015, pp. 278–279), whereby only one PTG affricate is reconstructed instead of the traditional *c (*ts) and *č (*ʧ); we symbolize it as *ts3.
Proto-Mundurukú reconstructions are taken from Picanço (2019). For relational stems, whose leftmost consonant often alternates depending on the left context, Picanço (2019) lists all possible allomorphs. In this article, only the allomorphs with *p-, *ð-, and *ʧ- (rather than *b-, *t-, *ɟ-) are given for such stems. We also omit the hyphen, used by Picanço (2019) in order to indicate that the relational stems are bound.
For Proto-Juruna and Proto-Tuparian, a variety of proposals exist; the reconstructions in this article are mostly extracted from the more recent ones (Carvalho, 2019 for Proto-Juruna; Nikulin & Andrade, 2020 for Proto-Tuparian) or adapted from earlier proposals (Fargetti & Rodrigues, 2008 for Proto-Juruna; Moore & Galucio, 1994; Nogueira et al., 2019 for Proto-Tuparian) so as to match the phonological reconstruction of the most recent works.
The reconstruction of the Proto-Tupian consonants adopted here differs from previous proposals and is based on the correspondence sets summarized in Appendix 2. For reasons of space, it is impossible to discuss this problem in detail in this contribution. Note, however, that nothing in our reconstruction of the Proto-Tupian vowels hinges on our interpretation of the PT consonants, and the validity of our proposal would remain intact if one adopted a different interpretation (such as that of Rodrigues, 2007).
The acute accent symbolizes the high tone in tonal languages (including the languages of the Mundurukú, Juruna, and Mondé branches, Karo, and maybe also Makurap); the low tone is left unmarked. Tones other than high and low are found only in the Mondé languages, and the transcription of our sources is retained in such cases. In the Tuparian languages Tuparí and Akuntsú, contrastive stress has been described, which is also symbolized by means of an acute accent (when its position is known).
Data quoted from premodern sources, which are not expected to faithfully represent all the relevant phonological oppositions, are given ‘verbatim’ enclosed in chevrons. Subscript letters after such forms indicate the ultimate source of the data: ‹›S and ‹›L refer to Emilie Snethlage’s and Lopes’ data on Kuruaya (Snethlage, 1932); ‹›B and ‹›N refer to Barbosa’s and Nimuendajú’s data on Arikém (Rondon & Faria, 1948; Nimuendajú, 1932). Forms followed by ‹›kg come from Koch-Grünberg (1932) on Puruborá, and those with ‹›es from Snethlage (1934), again on Puruborá. Forms in chevrons without subscript letters are from Steinen (1886) for Manitsawá, Nimuendajú (1923–1924, 1928, 1929) for Xipaya, and Sekelj (1948) for Aruá and Makurap.
Much of the discussion in this paper is based on analyzing cognate sets. In some cases, a given form is not synchronically segmentable, but only a part of it is cognate with the material of other languages. The part which is deemed non-cognate is then given in brackets. In premodern attestations (enclosed in chevrons), the cognate part is given in boldface.
The vocalic system of Proto-Tupian has been reconstructed by Rodrigues and Dietrich (1997, p. 268) and Rodrigues (1999, p. 110, 2005) as comprising six vowel qualities (*a, *ɨ, *o, *u, *e, *i), each of which would also have a nasal counterpart (*ã, *ɨ̃,*õ, *ũ̃, *ẽ, *ĩ; see Rodrigues & Cabral, 2012, p. 502). In Table 1, we list the reflexes of the Proto-Tupian oral vowels according to the proposal by Rodrigues (2005).
It can be easily seen from the table above that, according to Rodrigues (2005), the evolution of Proto-Tupian *e and *ɨ in the constituent families involved some phoneme splits. In the case of PT *e, Rodrigues (2005) posits a split in the so called ‘Eastern’ Tupian languages (Mawé-Awetí-Tupí-Guaraní, Mundurukú, and Juruna) allegedly conditioned by an adjacent consonant. The examples 1-2 illustrate the default development of PT *e, whereas 3 instantiates the development of PT *e affected, according to Rodrigues’ (2005, p. 40) proposal, by a following labialized stop (the labialization is reconstructed here exclusively in order to account for what are thought to be the divergent reflexes of PT *e)4.
For *ɨ, Rodrigues (2005, pp. 40–41) posits a split in Juruna and in the so called ‘Western’ Tupian languages of the Arikém, Tuparian, Karo (Ramarama), and Puruborá groups. This time, however, he does not identify a phonological environment which could have conditioned the alleged split (beyond a generic reference to the ‘immediate consonantal context’), nor is he explicit about whether the alleged split proceeded in the same way in all the aforementioned languages. The examples 4-6 illustrate.
In the subsequent sections, we will argue against the proposal by Rodrigues (2005), suggesting instead that the observed sound correspondences are best accounted for by reconstructing a phonemic inventory of seven (rather than six) vowel qualities for Proto-Tupian and positing a number of mergers in the daughter languages, in addition to one conditioned split. That way, the examples in 1-6 are reconstructed in our proposal as *kʲet ‘to sleep’, *mẽt ‘husband’, *jəp ‘leaf’, *ḳɯp ‘tree; stick-like’, *mbɨ/*pɨ ‘foot’, *pətɨc ‘heavy’. Tables 2 and 3 show the oral vowel inventories of Proto-Tupian in Rodrigues’ (2005) and our proposals, respectively.
This section deals with the reconstruction of a vowel we chose to represent as *ə. We start by stating its proposed reflexes in the daughter branches and listing the relevant cognate sets. In subsequent sections, we discuss how our findings relate to Rodrigues’ (2005) reconstruction of the PT vowels and consonants. We conclude that the recognition of *ə as a contrastive unit allows reducing the phonological inventory of Proto-Tupian by three phonemes (*pʷ, *kʷ, *kˀʷ), to account for the sound correspondences in a number of cognate sets which are unexplainable in Rodrigues’ (2005) proposal, and to account for the limited distribution of the sound correspondence which underlies Rodrigues’ (2005) reconstruction of PT *o (which occurs exclusively following labial consonants). Our proposal also entails that the vowels traditionally reconstructed as *o and *u should be reinterpreted as PT *ə, *o.
The vowel we reconstruct as PT *ə has evolved in the following way in the daughter languages5. In the Mawé-Guaraní branch, it has acquired rounding and changed to PMG *o (in fact, in our proposal PT *ə is the only source of PMG *o). In addition, it has been raised to *u before a vowel (as in ‘blood’, ‘sun’) or before a glottal stop and a vowel (as in ‘arrow’). In the Mundurukú branch, PT *ə is regularly reflected as Mu ə, Ku ɨ (< PMu *ɨ in Picanço’s (2019) reconstruction) and is the only source of this PMu vowel. Before a pre-PMu vowel, however, PT *ə is reflected as PMu *o (‘blood’, ‘arrow’, ‘sun’, provided that PMu *ðoj and *op/*ðop go back to pre-PMu *ðoi, *oip/*ðoip). The sequence *mə̃ is reflected as Mu mə̃, Ku mã (< PMu *mã in Picanço’s (2019) reconstruction). In the Juruna branch, one finds PJu *a except next to a labial consonant in the final syllable of the PT stem (‘leaf’, ‘snake’, ‘hand’, ‘wing’; exception: ‘what’) or before a vowel (‘sun’), where the regular reflex is *u. In the Tuparian branch, PT *ə is fronted to PTpr *e except after labial consonants, in which case the regular reflex is PTpr *o (‘to return’, ‘hand’, ‘wing/feather’, ‘heavy’, ‘causative’). Likewise, in the Arikém branch PT *ə is reflected as a low vowel (Kt a, Ari ) except after labial consonants, in which case one finds Kt ɨ, Ari ʉ (‘hand’, ‘vine’, ‘wing/feather’, ‘heavy’). Puruborá retains PT *ə as ə (or ə̃, if nasality is present). In the Mondé languages, PT *ə is reflected as a except after coronal consonants and word-initially, in which case e is found (‘leaf’, ‘larva’, ‘house/village’). Only in Karo do the reflexes seem chaotic at present: one finds ɨ in two examples (‘cylindrical and small’, ‘to hold’), i in two examples (‘heavy’, ‘to go’, in both cases next to a *t), o in two examples (‘leaf’, ‘third person coreferential’), a in two prefixes (‘causative’, ‘sociative causative’), as well as the following vowels in one example each: ə (‘to go, to walk’), ə̃ (‘snake’), and u (‘blood’, a likely result of contraction of the unique vowel sequence *əɯ).
In what follows, we discuss four correspondence sets derived by Rodrigues (2005) from three Proto-Tupian vowels. Two of them, identified by Rodrigues (2005) with PT *e and *u, show no overlap at all; these correspondence sets appear as (a) and (d), respectively, in Table 4 below. The remaining two correspondences, given as (b) and (c) in Table 4, are attributed in our account to PT *ə, yet an entirely different account is proposed by Rodrigues (2005). The correspondence (c) has the same reflexes as (d) in Karitiana and Tuparí, but other languages show distinct reflexes, which are typically lower than those of (d); it is associated by Rodrigues (2005) with PT *o. The correspondence (b) shows significant overlaps with (a) and (c): in Tupí–Guaraní, Awetí, Sateré-Mawé, Mundurukú, and Yudjá, the observed reflexes are identical to those of (c), whereas in all other branches the correspondence set in question — in Rodrigues’ (2005) account — coincides completely with (a).
According to Rodrigues (2005, pp. 40, 42, 2007, pp. 175–176, 181–182, 186), the overlapping pattern that involves the correspondence (b) — which coincides with (c) in Tupí–Guaraní, Awetí, Sateré-Mawé, Mundurukú, and Yudjá, but with (a) in the remaining branches — can be explained by reconstructing a secondary rounding feature for the consonant that immediately follows the vowel (the available options in Rodrigues’s, 2005 reconstruction are *pʷ, *kʷ, *kʷˀ). In the proto-language of Mawé-Guaraní, Mundurukú, and Yudjá (‘Eastern’ Tupian languages in Rodrigues’, 2005 terms), this contextual factor would have induced the merger of the correspondence set in (b) with the *o series, Yudjá being later subject to *o > a and Mundurukú undergoing *o > ə. The remaining branches — that is, Arikém, Tuparian, Mondé, Ramarama, and Puruborá — would have not been subject to any contextual coloring and show reflexes identical to those of PT *e, which leads Rodrigues (2005, 2007) to reconstruct *e for the correspondence set in question and to posit a conditioned split in his ‘Eastern’ languages. In synthesis, then, Rodrigues’ (2005) proposal for the PT segments whose reflexes appear in the correspondences in Table 4 above is as follows: *e for the correspondences (a) and (b), *o for (c), and *u for (d). Moreover, the context-dependent merger of the series (b) and (c) in Mawé-Guaraní, Mundurukú, and Juruna is attributed to the influence of a secondary rounding feature hosted on the following consonant.
Under closer scrutiny, however, it appears that the available evidence does not support either the identification of the correspondence (b) as a context-dependent offshoot of (a) or the reconstruction of a labialized stop series for Proto-Tupian. Although we concur with Rodrigues (2005) in reconstructing PT *e for the correspondence (a), we disagree with his diachronic interpretation of the remaining three correspondences in that:
we consider (b) to be the default development of PT *ə (rather than a positional development of *e);
we consider (c) to be a positional development of PT *ə (rather than the default reflex of a PT phoneme of its own, symbolized as *o by Rodrigues (2005);
we derive (d) from PT *o, which in our account is the only rounded vowel of PT (as opposed to Rodrigues’ (2005) reconstruction, whereby PT had both *u and *o).
Rodrigues’ (2005) account is seriously undermined by the following facts.
Each of these five points is discussed in the subsequent sections.
Let us consider the reflexes of the PT segments that Rodrigues (2007) reconstructs as labialized consonants. As will become clear, their reflexes do not differ from those of their plain (non-labialized) counterparts, and the only reason for positing such phonemes in Rodrigues (2007) reconstruction is to account for the correspondence set (b). Once it is recognized that the (b) series does not result from a conditioned split of *e, it is no longer necessary to reconstruct labialized consonants for Proto-Tupian.
We will start by examining the occurrences of *pʷ that are supposed to account for the alleged rounding of PT *e in ‘Eastern’ Tupian. Rodrigues (2007) reconstructs it for two roots, *epʷ ‘leaf’ and *epʷa ‘face’ (as well as in its derivative *epʷa-pokˀ ‘to appear’). In the former case, all Tupian branches have a reflex with a plain labial stopp (in some languages, which lack an opposition between oral and nasal codas, it is symbolized as P). In most Tupí-Guaraní languages, as well as in the Arikém language before the suffix -ɒ, the stop is lenited to β or a similar sound (cf. Schleicher, 1998, pp. 29–32; Storto & Baldi, 1994); this development regularly targets word-final stops of any origin in these languages8. In our reconstruction, the vowel correspondence between PMG *o, PMu *ɨ, PTpr *e, Kt a, Ari æ, Pu ə, and Mo e is derived from PT *ə, whereas the correspondence between the word-final consonants straightforwardly continues PT *-p. That way, there is no need to reconstruct a labialized stop for ‘leaf’ in our proposal.
An identical rhyme is found in the word for ‘bitter’. Rodrigues (2007, p. 196) reconstructs its PT etymon as *rʲop and lists its reflexes in Sateré-Mawé, Awetí, PTG, and Mundurukú. Had he considered the Tuparí and Karitiana cognates, he would have likely reconstructed *rʲepʷ.
In Rodrigues’ (2007) account, *pʷ is claimed to have a divergent reflex between vowels in the languages of the Mawé-Guaraní (PT *pʷ > PMG *β, as opposed to PT *p > PMG *p) and Mundurukú (PT *pʷ > PMG *p, as opposed to PT *p > PMG *b) branches. Rodrigues (2007) gives only two cognate sets that instantiate the sequence *epʷ: PT *epʷa ‘face’ and *epʷapokˀ ‘to appear’; Corrêa da Silva (2010, p. 128) claims the latter to be a derivative of the former. Rodrigues (2007, p. 186) lists the following reconstructions and reflexes (quoted verbatim).
Note, however, that even within Rodrigues’ own framework the proposed etymology for ‘face’ presents serious irregularities. In his account, PT *e before a labialized consonant would be expected to yield Mw o (rather than e), Mu ə (rather than o), Ku ɨ (rather than o — note that u in the datum cited by Rodrigues (2007) is a phonetic variant of /o/), and Kt a (rather than ɨ). We surmise that in this case Rodrigues (2007) has failed to distinguish between two unrelated cognate sets, which we derive from PT *jəβa ‘forehead’ and *jopʔa ‘face’ (in addition, Mw -ewa, or sewa in our notation, appears to be unrelated to either etymon).
PT *β in *jəβa ‘forehead’ is reconstructed based on the correspondence PMG *β ~ PTpr *β, otherwise found in the cognate set for ‘wind’, PMG *ɨβɨću ~ PTpr *ɨβijo (cf.Nikulin & Andrade, 2020, p. 292). In *jopʔa, the consonant cluster is reconstructed in order to account for the voiceless intervocalic stop *p in PMu, also found in etymologies such as PMu *óropo ~ PTpr *oropʔo ~ PMG *uruβu ‘vulture’9. In our account, the correspondence between PMG *o and PTpr *e need not be conditioned by any feature hosted on the following consonant.
As for the cognate set for ‘to appear’, we believe that PTG *oβapo should be excluded from it (no other examples are known where a velar coda in Mundurukú or Tuparian would correspond to zero in PTG)10. Moreover, the Mundurukú cognate does not actually contain an initial vowel (the root is papə́k ‘to be visible’, with the allomorph bapə́k occuring after vowels; cf, Picanço, 2005, p. 17). That way, this cognate set does not instantiate the vowel correspondence which interests us in this section, nor does it back up the reconstruction of *pʷ.
Now we turn to the purported PT *kʷ. According to Rodrigues (2007), this phoneme induces contextual coloring in the preceding vowel in the following cognate sets.
To the best of our knowledge, no PT reconstructions for the following cognate sets are proposed in the works by Rodrigues (2007). He would probably reconstruct *tˀakˀekʷ ‘army ant’ and *kekʷ ‘to hold’.
As can be seen in these examples, the putative PT *kʷ does not have labialized reflexes in any single Tupian branch. In fact, in all cases it is reflected precisely in the same way as PT *k (represented as *k in the coda position). The only example which, according to Rodrigues (2007, p. 182; 2008, p. 6), instantiates a labialized reflex of PT *kʷ in a Tupian language is PT *ekʷ-at ‘plaza’ > Xi koað-á, Sk ekʷat, Tu ekoat-pe ‘area around the house’11. We reconstruct the etymon in question as PT *ək-at and reject the appurtenance of the cited words to this cognate set. First of all, the ultimate sources of the Xipaya and Sakurabiat words provide glosses which are quite distant from ‘(village) plaza’: ‹ku̥aẓá› ‘village of foreigners’ (Nimuendajú, 1928, p. 827), ‹hekʷat› ‘field’ (Hanke et al., 1958, p. 205). Second, Sakurabiat kʷ does not regularly correspond to Tuparí ko (cf. Nikulin & Andrade, 2020).
The third purported labialized stop of Proto-Tupian is reconstructed by Rodrigues (2007) as *kʷˀ for one single etymon.
Rodrigues (2007, p. 186) states explicitly that *kʷˀ is reflected in the daughter languages precisely in the same way as *kˀ (we prefer to symbolize the segment in question with the ad hoc character *ḳ), and the labialization in the etymology for ‘arrow’ is reconstructed only in order to account for the correspondence between a front vowel in the Tuparian languages and rounded vowels in the ‘Eastern’ Tupian languages. Also note that the PT stem for ‘arrow’ almost certainly contains the formative for tree- or stick-like objects *-ḳɯp (*-kˀɨp in Rodrigues’, 2007 reconstruction), with reflexes in all Tupian languages, none of which shows any trace of labialization. In this sense, our proposal is superior to Rodrigues’ (2007) in that no need arises to reconstruct an extra consonant found in only one stem.
In this subsection, we examine the instances of the correspondence set (b) in environments in which no consonant follows the vowel in question. These receive no explanation in the framework of Rodrigues (2007): his proposal attributes the emergence of the series (b) to the rounding conditioned by a following labialized consonant, whereas in the etyma under consideration there is no consonant which could have triggered the alleged conditional rounding. The relevant data are provided below.
Regarding the etymology for ‘field’, Rodrigues (2005, p. 40) explicitly notes that the vowel of the Karitiana reflex (Kt ŋga/ŋa/) corresponds to that of the Makurap (Tuparian) reflex (Ma ŋge /ŋe/), but is not the regular outcome of PT *o, reconstructed by Rodrigues (2005) based on the reflexes in other branches of Tupian12. Rodrigues (2005, pp. 39-40) also notes that his proposal fails to account for the Yudjá reflex (Yu kú-á), as PT *o in his proposal is supposed to be reflected as a in the Juruna branch; he tentatively suggests that the Yudjá word is a loan from Língua Geral Amazônca, a Tupí-Guaraní language. In our account, none of these problems arises, as we hypothesize that PT *ə regularly yields Kt a, Ma e, and (before vowels) Yu u.
Rodrigues (1999, pp. 110-111) notes that PT *o — in his own reconstruction — occurs almost exclusively after labial consonants, and tentatively suggests that this vowel arose as a positional variant of PT *u. We argue that the correspondence set which Rodrigues (1999, 2005) equates with PT *o is indeed secondary: it may be indeed explained away as an offshoot of another PT vowel in the environment C[+labial](ʔ)_. However, our account differs from Rodrigues’ (1999) suggestion in that the correspondence set in question is derived from PT *ə (rather than from another rounded vowel) and in that the proposed conditioned split is attributed to a shallower level than Proto-Tupian: its effect is clearly visible in only two Tupian branches, Tuparian and Arikém. Instead of yielding PTpr *e, Kt a, Ari æ, one finds PTpr *o, Kt ɨ, Ari ʉ in the aforementioned context. The following examples illustrate.
After non-labial consonants or word-initially, the reflexes are PTpr *e, Kt a, Ari æ.
Not a single exception has been identified.
According to Rodrigues (2005), the correspondence sets (b) and (c) have different reflexes not only in Tuparí and Karitiana (as we have shown in the previous section, they are in fact in a complementary distribution in these languages), but also in Karo, Puruborá, and Mondé (represented by Suruí-Paiter in Rodrigues’, 2005 study). The (b) series (derived from PT *e before a labialized consonant in Rodrigues’, 2005 proposal) is supposed to be reflected as e in Karo, Puruborá, and Suruí-Paiter, whereas the correspondence set (c) (< PT *o according to Rodrigues, 2005) is expected to yield a in the three languages.
In reality, however, the Karo and Puruborá reflexes listed by Rodrigues (2005, pp. 39-40) for the correspondence sets (b) and (c) turn out to be nonexistent, and the Mondé reflexes may be equally accounted for if one accepts our reconstruction of PT *ə. Consider the following etymologies (R = Rodrigues, 2007):
The reflex e, listed by Rodrigues (2005) for Karo and Puruborá, simply does not occur in the available data. In Karo, one finds o, ɨ, i, ə, ə̃, a, and u (with no obvious distribution), and in Puruborá, ə is found in all cases. Currently we have no explanation for the reflexes in Karo (but note that Rodrigues’ proposal also fails to account for them).
In the Mondé languages, one does indeed find a and e in accordance with Rodrigues’ (2005) predictions (PT *e > e; PT *o > a). However, it is also possible to account for the Mondé reflexes if one recognizes that the etyma of all the aforementioned cognate sets contained one and the same vowel, PT *ə. Note that all instances of e occur following a coronal consonant (‘leaf’, ‘larva’) or word-initially (‘house/village’), whereas all instances of a (‘hand’, ‘snake’, ‘cultivated field’, ‘sun’, ‘heavy’) occur following a peripheral (labial or velar) consonant. We propose, therefore, that PT *ə was fronted to Proto-Mondé *e following coronal consonants or word-initially and yielded Proto-Mondé *a elsewhere, and that the Mondé languages lend no support to the Proto-Tupian age of the distinction between the correspondence sets (b) and (c). We parenthetically note that the fronting of the type *ə > e following coronal consonants is also known from the history of Djeoromitxí, a Macro-Jê language of the Jabutian branch (Voort, 2007, p. 147), which is, like the Mondé languages, spoken in the Rondonian East. Typologically, the functioning of labial and velar consonants as a natural class in processes triggering vowel backing (as in *ə > a) is amply documented (Hyman, 1973; Vago, 1976)13.
The development PT *eCʷ > *oC, posited by Rodrigues (2005), inter alia for his ‘Eastern’ Tupian languages, conjoined with the reconstruction of such consonants with labial off-glides — the factor that accounts for these environmentally restricted vocalic outcomes — raise two issues of phonetic plausibility of historical reconstructions: one related to directionality considerations of the presumed coloring effect of the PT consonants, the other related to the distribution of the labialized consonants. First of all, a baffling aspect of the labializing effect exerted by these consonants is that it always affects the preceding, not the following vowel: a pre-vocalic labialized stop has no effect on a following vowel. From a phonetic point of view this is extremely counterintuitive. If labialization, or, more precisely, a labial release feature, is to play the role of contrastive feature distinguishing between these consonants (i.e. *pʷ and *kʷ) and their plain counterparts (i.e. *p and *k), one would expect its ‘coloring’ influence upon adjacent vowels to be realized more strongly (if not exclusively) on a following rather than a preceding vowel (that is, in a Cw-to-V transition, as opposed to the V-to-Cw boundary).
As to their distribution, Rodrigues’ (2007) PT labialized stops tend to occur, or are found quite frequently, in word-final position. In fact, the most significant phonotactic gap in their distribution in Rodrigues’ (2007) proposal is the absence of *pʷ from word-initial position. The expectation, commented on above, that a consonant with a secondary articulation will exert a stronger coarticulatory effect on a following rather than a preceding vowel derives from the fact that such segments depend, for their realization, on a following resonant element. As a consequence, we also expect such consonants to be less-optimally realized (qua contrastive segments) in word-final or pre-consonantal position, with no vocoid to work as a base for its contrastive release features to be imposed. As a matter of fact, plenty of evidence suggests that this is the case (see, e.g., Blevins, 2004, p. 116). In the words of Ladefoged and Maddieson (1996, p. 357):
Thus we can say that labialization is typically concentrated on the release phase of the primary articulation it accompanies. This observation has both phonetic and phonological significance. Many more languages have a restriction between the presence of labialization and the choice of the following vowel, than between its presence and the choice of the preceding vowel, and in many languages with labialized consonants the set of syllable-final consonants, if any, does not include labialized ones.
Aside from general considerations stemming from principles of acoustics and perceptual phonetics, it is not hard to find cross-linguistic evidence supporting the contention that such secondary articulations found in stop consonants behave phonologically as if ‘looking for’ a supporting vowel. Thus, in Khwarshi, an Eastern Caucasian language, labialization is found as a secondary articulation feature, mostly in velar and uvular consonants (Khalilova, 2009, pp. 17-18). The contrast is restricted basically to word-initial and word-medial position preceding a vowel, as in etʷa ‘fly’ vs. eta ‘touch’, lakʷa ‘see’ vs. laka ‘lick’. The dynamic phonology of the language also demonstrates a preference for such labialized release consonants to occur preceding a vowel. Labialization is either lost (89a) or transferred to another consonant, one that precedes a vowel (89b), whenever a -C(V) suffix is added to a root containing a final labialized stop.
The joint effect of these generalizations, both static phonotactic patterns and processes in the dynamic phonology of Khwarshi, is to suggest that having consonants with a labial release preceding something other than a vowel is a highly undesirable or marked configuration in this language. Similar regularities are found in the phonologies of many unrelated languages, and can be understood more broadly in terms of the acoustic and phonetic constraints mentioned above, the same that make Rodrigues’ (2007) proposal of stop consonants with secondary offglides that are almost always realized in contexts other than that of a following vowel very implausible.
Above we have presented evidence against positing a sound change whereby PT *e would have acquired rounding (> *o) preceding labialized consonants as well as against reconstructing these labialized consonants for PT. Instead, we have proposed that the suspect correspondence should be derived from PT *ə. Moreover, in our reconstruction, PT *ə accounts for some sound correspondences deemed irregular in Rodrigues’ (2005) proposal as well as for the correspondences which underlie his reconstruction of *o. Our proposal is summarized in Table 5, where we list the reflexes of PT *ə as well of two other vowels which do not present split reflexes: PT *e and *o (in Rodrigues’, 2005, 2007 interpretation, *e and *u).
The notational change in the reconstruction of the sole rounded vowel of Proto-Tupian (*o) as opposed to (*u) does not affect the correctness of the sound correspondences identified by Rodrigues (2005) in any way. It is suggested by the fact that the typical realization of its reflex is a mid vowel in most Tupian branches (Mundurukú, Tuparian, Karo, Puruborá, and Mondé). Since Rodrigues’ (2005)*o is reinterpreted as an unrounded vowel *ə in our account, it is now unproblematic to reconstruct *o in PT stems such as *amẽko ‘jaguar’, *jacjo ‘armadillo’, *jaḳo ‘lizard’, *jeko ‘monkey’, *jõk ‘flea’, *jopi(-ʔa) ‘egg’, *ḳo ‘to ingest’, *ndo ‘hill, rock’, *ndok ‘to eat (intr.)’, *õp ‘to give’, *õt ‘I’ (and the first person prefix *o-), *toḳo ‘to bite’, *top ‘to see’, *waco ‘alligator’, among many others (we do not list these well-established etymologies in Appendix 1 for reasons of space).
In this section, we will argue that it is necessary to reconstruct two distinct vowel phonemes in place of Rodrigues’ (2005) Proto-Tupian *ɨ. According to Rodrigues (2005), Proto-Tupian *ɨ would have undergone a number of splits in the daughter languages depending on the immediate consonantal environment, yielding ɨ/i in Yudjá, e/i in Karitiana, i/ʉ in Tuparí, i/ɨ in Karo, and ɨ/i in Puruborá. Unfortunately, Rodrigues (2005) does not specify the consonantal environments which would have triggered the putative fronting of *ɨ in the daughter languages, nor is he explicit on whether these environments were identical for each constituent branch (Juruna, Arikém, Tuparian, Karo, and Puruborá). In what follows, we show that Rodrigues’ (2005) reconstruction collapsed two correspondence sets into one and that two distinct vowels must therefore be reconstructed for Proto-Tupian. We symbolize them as PT *ɯ and *ɨ15, as in the minimal pair PT *jɯ ‘liquid’ vs. PT *jɨ ‘urine’, still retained in Karitiana as se ‘liquid’ and si ‘urine’. Their reflexes are identical in some branches (PMG *ɨ, PMu *i, PJu *ɨ, Mo i); for this reason, in what follows we are concerned only with the remaining branches (that is, Tuparian, Arikém, Karo, and Puruborá). At the end of the section, however, we will see that the distinction between the PT vowels in question is indirectly preserved in the Mundurukú branch as well.
PT *ɯ is reconstructed for the correspondence set which involves the following reflexes: PTpr *ɨ (> Ma/Sk/Ak ɨ, Wy/Tu ʉ), Kt/Ari e, Pu ɨ. In Karo, one usually finds i in the word-initial position (i-cɨ ‘water’, itɨ ‘deer’), ə̃ if the syllable is nasal (nə̃p ‘louse’, wakə̃ja ‘agouti’), and ɨ elsewhere (i-cɨ ‘water’, ma-ʔɨp ‘tree’, tɨt ‘to cook’, itɨ ‘deer’, jaɨ ‘howler monkey’); we are as of now unable to account for the apparently aberrant reflex pək ‘to burn’. In Karo ju ‘blood’, the vowel u continues the PT sequence *əɯ and is thus not necessarily irregular. Below we list the PT etyma where evidence from multiple branches converges to the reconstruction of PT *ɯ as opposed to PT *ɨ (reflexes in branches which do not distinguish between them are omitted).
The following example can be considered regular if it turns out that Lemos Barbosa (1951) transcription ‹mixon›B of the Arikém cognate (see Rondon & Faria, 1948, p. 199) stands for mʉ̃sɒ̃ (for the development PT *w > Kt/Ari m in nasal environments, compare Kt mĩɲõ ‘Brazil nut’).
In the following examples, the PT etymon is preserved in Tuparian, but not in other branches which distinguish between *ɯ and *ɨ. Based on the Tuparian evidence, we reconstruct *ɯ.
There are also two PT etyma which have reflexes in Karitiana, but not in other languages which distinguish between *ɯ and *ɨ: PT *mbVʔɯt ~ *mbVḳɯt ‘necklace’,18*tɨʔɯt ~ *tɨḳɯt ‘maternal aunt’ > Kt mboʔet ~ mõet, teʔet. Based on the Karitiana evidence, we reconstruct PT *ɯ (we assume that the latter stem is derived from PT *tɨ ‘mother’ and thus reconstruct *ɨ in the first syllable, translaryngeal assimilation of vowels is not unheard of in Karitiana; alternatively, the Karitiana word could be related to Ari ‹utaíră›N ‘id.’).
PT *ɨ is reconstructed for the correspondence set which involves the following reflexes: PTpr *i (preserved as i in all daughter languages), Kt/Ari i, Pu i. Below we list the PT etyma where evidence from multiple branches converges to the reconstruction of PT *ɨ as opposed to PT *ɯ (reflexes in branches which do not distinguish between them are omitted).
To these one may add the verbs Tu [e]pik-, Kt (a-)mbik and possibly Pu [t]api-a ‘to sit’, if these are related to PT *apɨk ‘to sit’.
In the following examples the apparent mismatch between the reflexes in Tuparian and Arikém is explained by vowel fusion: PT *ewɨ > *æhi > Kt eː (instead of *ai), Ari e (instead of *æi); PT *ɨ(p)ḳɯ or *ɨ(p)ʔɯ > *iʔe > Kt eː (instead of *ie).
In the following examples, the reconstruction of PT *ɨ as opposed to PT *ɯ is based on non-converging evidence from only one branch, as no cognates in other diagnostic branches have been identified.
Finally, one etymology has been found with a mismatch between Tuparian and Karo regarding the reconstruction of *ɨ or *ɯ. It is possible that the Karo word is ultimately unrelated to PT *mẽpɨt, as the vowel correspondence in the first syllable and the nasal reflex of PT *p appear to be irregular.
We now turn to the Juruna branch, for which Rodrigues (2005) also posited a split of PT *ɨ into Yu ɨ, i. However, this alleged split does not have anything to do with the reconstruction of PT *ɯ and *ɨ proposed in this section: in our account, both vowels yield PJu *ɨ, usually preserved in the daughter languages (PT *kʲɯt ‘green’, *ŋgɯp ‘louse’, *nẽcɯk ‘horsefly’, *mbɨʔa/*pɨʔa ‘liver’, *(j)atɨ ‘pain, sour’, *ewɨt ‘bee, honey’, *tɨk ‘resin’, *pɨtɨk ‘to take, to grab’, *mẽpɨt ‘son (female ego)’, *apɨk ‘to sit’, *wɨp ‘cooked’ > PJu *[a]kɨ́ɮ-ú, *kɨpá, *nãtɨ́k-á, *bɨʔá, *ʃadɨ́ ‘sour’, *awɨɮ-á, *dɨ́́k-á, *pɨd–í́k-ú, (?) *mãbɨ-a ‘daughter’, *ab–í́k-ú, *[u]wɨp-u ‘to cook’). A complication arises from the fact that PJu *ɨ has a special reflex in the word-initial position in Yudjá (e-) and Xipaya (i-): PT *ḳɯc ‘earth’ > PJu *ɨt-á > Yu etá ‘sand, beach’; PT *ḳɯc-pɨ ‘earth’ > PJu *ɨpɨ́-á > Yu epɨ́á, Xi ipɨa; PT *ḳɯp ‘tree; stick-like’ > PJu *ɨpá ‘stick’ > Yu epá, Xi ipa. Note that the extinct Manitsawá appears to have preserved a non-front vowel in this case, as in ‹upá› ‘Holz’ (Steinen, 1886, p. 361). In the following example, Yudjá appears to have raised the vowel in the prevocalic position, feeding glide epenthesis: PT *ʔɯ ‘water’ > PJu *ɨ-á > *e-á > *i-á > Yu ijá (cf. Xi ija). Finally, in the following two etymologies PT *ɨ is reflected in unexpected ways in the Juruna languages: PT *pətɨc ‘heavy’, *tɨ ‘mother’ > PJu *padét-ú, *di-á. At present, we are unable to account for the development of these words.
We conclude this section with presenting a piece of indirect evidence for the opposition between PT *ɯ and *ɨ from the Mundurukú branch. In Proto-Mundurukú, PT *p is lost before front vowels, including *i < *ɨ, as in PT *pe ‘path’, *peo ‘pus’, *pe ‘tobacco’, *pɨ ‘foot’, *pɨcja ‘heel’, *pɨtɨk ‘to take, to grab’, *jepɨ ‘payment’, *w-epɨk ‘to revenge’ > PMu *e, *eɨ ‘to swell’, *e, *i, *(ʔ)iða, *iʧik, *ðéi, *w-eik (cf. Rodrigues, 2007, p. 173). However, in all available examples *p is retained before a PT *ɯ: PT *pɯk ‘to burn’, *kɨp-ḳɯt ~ *kɨp-ʔɯt ‘brother’ > PMu *pik, *kipit, suggesting that PT *ɨ and *ɯ were still distinct at the time when PT *p was lost before front vowels (i.e., PT *ɨ had already become front, but PT *ɯ had not). This argument is unfortunately not as strong as it could have been in light of the existence of cognate sets in which PT *p was not lost even before reflexes of PT *ɨ or *i (PT *jupi-ʔa ‘egg’, *pɨcjo ‘breath’, *kɯpɨ(-ḳɯt ~ -ʔɯt) ‘younger sister (female ego)’, *pɨ ‘inner part’ > PMu *ðopia̰, *piðo, *kibḭt, *pi),20 but the fact that there is not a single example of loss of PT *p preceding a *i < PT *ɯ is hardly due to chance.
Rodrigues and Cabral (2002, 2012) and Rodrigues (2007, p. 170) put forward a hypothesis whereby the Tupian family is considered to be split, in a binary manner, into two large branches: ‘Eastern’ (comprising Sateré-Mawé, Awetí, Tupí-Guaraní, Mundurukú, and Juruna) and ‘Western’ (comprising Mondé, Tuparian, Arikém, Ramarama, and Puruborá). These two proposals, however, have remained insufficiently demonstrated in the sense that they have not yet been supported with bunches of identified shared innovations.
Figure 1 summarizes the distribution of the innovations identified in this paper. Only phonologically significant innovations shared by more than one low-level group are shown, namely: (i) the merger of PT *ɨ and *i (Puruborá, Arikém, Tuparí, Karo, Mondé, Mundurukú), (ii) the merger of PT *ə and *e in the default environment (Arikém, Tuparí), (iii) the merger of PT *ə and *o after labials (Arikém, Tuparí), (iv) the merger of PT *ɯ and *ɨ (Mondé, Mundurukú, Juruna, Sateré-Mawé, Awetí, Tupí-Guaraní), and (v) the merger of PT *ə and *o before vowels (Mundurukú, Juruna, Sateré-Mawé, Awetí, Tupí-Guaraní).
The following groups share more than one innovation related to the evolution of PT *ə and *ɨ: Arikém and Tuparí (3 innovations), Mundurukú, Juruna, Sateré-Mawé, Awetí, and Tupí-Guaraní (2 innovations), and Mundurukú and Mondé (2 innovations). Of these, the former two sets are strong candidates for valid clades: both include non-trivial, positionally conditioned innovations (merger of *ə and *o following labials in Tuparí and Arikém, preceding vowels in Mundurukú, Juruna, Sateré-Mawé, Awetí, and Tupí-Guaraní). In contrast, the set comprising Mondé and Mundurukú is in all likelihood spurious (or paraphyletic): in addition to being incompatible with the proposal which links Mundurukú to Juruna and Mawé-Guaraní, there is indirect evidence which suggests that the fronting of PT *ɯ in Mundurukú counterfed the loss of *p before front vowels, an innovation specific to that branch. Therefore, the triple merger of PT *i, *ɨ, and *ɯ as *i has probably occurred independently in the phonological history of Mondé and Mundurukú.
That way, evidence from the development of the PT vowels supports the identification of two mid-level clades within Tupian. The node comprising Tuparí and Arikém is defined by the sound change *ə > *e (default) / *o (after labials); we suggest the label Tuparikém for this subgrouping hypothesis.21 The vowel inventory of Proto-Tuparikém (*/i ɨ e a o/) is preserved without changes in Proto-Tuparian, whereas in Proto-Arikém these vowels yielded /i e æ ɒ ʉ/ (> Karitiana /i e a o ɨ/) by means of a vowel shift identified by Storto and Baldi (1994). The second clade includes Mundurukú, Juruna, and Mawé-Guaraní (that way, our findings partially corroborate Rodrigues’ (2005) hypothesis regarding the validity of his Eastern branch) and has the merger of PT *ə and *o before vowels as its defining innovation. It may have proceeded in two stages: first, PT *ə and *o may have changed into *o and *u in a chain shift (this is precisely the state reconstructed by Rodrigues, 2005 for Proto-Tupian); in turn, the vowel *o (from PT *ə) may have been raised to *u in prevocalic contexts (thus merging with *u from PT *o). After that, Proto-Eastern Tupian *o, *u yielded PMG *o, *u; PMu *ɨ, *o; PJu *a (*u next to labials in stem-final syllables), *u. As the Eastern Tupian languages reach their greatest diversity between the Lower Madeira and the Lower Iriri, the Proto-Eastern Tupian Urheimat has to be sought in that region.
This paper has presented a reconstruction of the Proto-Tupi (PT) inventory of oral vowels alternative to that advanced by Rodrigues (2005), this being clearly the accepted view on the PT vocalism since its adoption in reference works on the family such as Rodrigues (1999) and Rodrigues and Cabral (2012). Our proposal is summarized in Table 6.
We have argued that this new proposal is superior to the Rodrigues (2005) reconstruction in that it avoids the postulation of unexplained bifurcations of reflexes and the proposal of exception-ridden splits that, moreover, lack phonetic plausibility. Rodrigues’ (2007) proposal of a series of labialized consonants to PT is rejected too, as the segments in question lack reflexes different from those of their plain counterparts and because the positional developments of contextual vowels were shown to be spurious.
1: First person
3: Third person
3CRF: Coreferential third person
IV: Class IV
abs.: Absolute
Ak: Akuntsu
Ar: Aruá
Ari: Arikém
Aw: Awetí
CAUS: Causative
Cl: Cinta-larga
excl.: Exclusive
Gv: Gavião
incl.: Inclusive
INF: Infinitive
Kp: Kepkiriwat
Kr: Karo
Kt: Karitiana
Ku: Kuruaya
Ma: Makurap
Mn: Manitsawá
Mo: Mondé
Mu: Mundurukú
Mw: Sateré-Mawé
Pa: Suruí-Paiter
PJu: Proto-Juruna
PL: Plural
PMG: Proto-Mawetí-Guaraní
PMu: Proto-Mundurukú
PRS: Present
PT: Proto-Tupian
(P)TG: (Proto-)Tupí-Guaraní
PTpr: Proto-Tuparian
Pu: Puruborá
rel.: Relational
SG: Singular
Sk: Sakurabiat (Mekéns)
Sl: Salamãy
Tu: Tuparí
Wy: Wayoró
Xi: Xipaya
Yu: Yudjá
Zo: Zoró
This appendix includes all etymologies that contain the relevant vowels in at least two branches of the family, as well as several other etymologies that were mentioned in the body of the text. In what follows, cognates which cannot be regularly derived from the reconstructed etyma are marked with (!), and the irregular reflexes of specific segments are underlined (except in cases of irregular deletion of segments).
We are grateful to two anonymous reviewers for their comments on the presentation and the substance of the paper. These have certainly improved the quality of our submission, and the authors are fully responsible for any remaining errors or shortcomings. We are especially grateful to the editors and technical staff of the Boletim for their swift and high-quality work in preparing the proofs and dealing with our observations on necessary adjustments and revisions.
Autor para correspondência: Fernando Carvalho. Museu Nacional/Universidade Federal do Rio de Janeiro. Quinta da Boa Vista, São Cristóvão, Rio de Janeiro, RJ, Brasil. CEP 20940-040 (fernaoorphao@gmail.com).