This paper discusses word-initial (sibilant + consonant) sequences in Brazilian Portuguese (BP) which may or may not be preceded by a vowel, i.e., sC-clusters. A set of words with sC-clusters in BP are native words, as in escola [isˈkɔlə] ~ [ˈskɔlə] ‘school’. Another set of words comprise loan words, as in Skype [isˈkajpi] ~ [ˈskajpi]. These examples show that, in both native and loan words, the word-initial vowel may or may not be pronounced. Cases involving native words are assumed to have vowel deletion, whereas loan words are subject to epenthesis. In either case, the word-initial vowel-zero alternation would be transcribed by phonetic symbols as: [isC] ~ [sC].
Several works have pointed out that phonetic symbols may not adequately capture the content of segmental strings (Derwing 1992; Coleman 2002; Port 2007; 2011; Munson et al. 2010). Experimental investigation based on Laboratory Phonology principles provides a much more accurate description of segmental strings than discrete phonetic symbols, since it provides detailed information about segments (Scobbie et al. 2000; Albano 2017).
This paper posits the hypothesis that the segmental string in sC-clusters would differ in native and loan words, and that such a difference could be captured by the experimental analysis. It is suggested that the durational properties of high vowels and sibilants characterize phonetic detail, which plays an important role in the implementation of the vowel-zero alternation in sC-clusters in BP. Furthermore, this paper intends to contribute to the debate on the representation of word-initial sC-clusters (Kaye 1992; Harris 1994; Pierrehumbert 1994; Sanoudaki 2010; Goad 2011; Hermes, Mücke & Grice 2013; Polgárdi 2017; Freitas 2019).
This paper is organized as follows: section two discusses the representational nature of sC-clusters and motivates the experimental investigation. The third section presents the methodology. In the fourth section, the experimental results are presented in order to contribute to the debate on the role of phonetic detail in phonological representations. It also addresses the relevance of experimental investigation in Phonology.
sC-clusters posit an interesting problem to phonological theory regarding their syllabification. This is because they could be syllabified either as a tautosyllabic cluster, i.e., [sC], or as a heterosyllabic cluster: [s.C]. As noted by Zec (1995), Topintzi (2010) and Goad (2011), the representation of sC as a tautosyllabic cluster violates sonority constraints in branching onsets, thus being not appropriate. In most languages, word-internal sC-clusters are said to be heterosyllabic (Goad 2011). However, word-initially, the syllabification of sC-clusters as heterosyllabic has proven to be controversial.
Kaye (1992) is a seminal work on this topic. He claims that sC-clusters represent a heterosyllabic cluster, word-internally as well as word-initially. Word-internally, the nucleus which precedes the sibilant in sC-clusters may or may not be filled. Word-initially, an empty nucleus precedes the sC-clusters. He initially argues against the branching onset syllabification by showing that in Southern British English [j] does not occur after a tautosyllabic cluster – as in plumage or glue. So, if sC-clusters were branching onsets, then [j] would not be expected to occur after sC-clusters as in stupid or skew. However, [j] does occur after word-initial sC-clusters in Southern British English: st[j]upid and sk[j]ew.
As it is consensual that word-internal sC-clusters are heterosyllabic (Goad 2011), Kaye (1992) focuses on the interpretation of word-initial sC-clusters. He provides substantial evidence from Italian, European Portuguese and Ancient Greek for word-initial sC-clusters to be syllabified as heterosyllabic clusters. Further evidence for this claim is also provided for Italian (Hermes, Mücke & Grice 2013) and for French (Prince 2014).
European Portuguese (henceforth EP) provides evidence for word-initial sC-clusters to be heterosyllabic, as discussed by Kaye (1992). The negative prefix in EP is [in] when the word begins with a vowel: inacabado [inakaˈbadu] ‘unfinished’. When the word begins with a single consonant, or with a branching onset, the negative prefix is the nasal vowel [ĩ]: incapaz [ĩkaˈpaʃ] ‘incapable’ or intratável [ĩtɾaˈtavel] ‘unsociable’. Words that begin with sC-clusters in EP behave in the same way as those that begin with a vowel by receiving the negative prefix [in]: inesperado [inʃpeˈɾadu] ‘unexpected’. Notice that a word like esperado, formerly had an initial vowel in EP, which is not pronounced any longer: [ʃpeˈɾadu]. Although the initial vowel in esperado is not pronounced, it plays a role in the phonological representation of inesperado [inʃpeˈɾadu] ‘unexpected’, as it receives the prefix [in] which accompanies vowels.
The representation of sC-clusters in EP has been an interesting point of debate as the sibilant may be either alveopalatal or alveolar, depending on whether it is a native or a loan word (D’Andrade & Rodrigues 1999; Henriques 2012). Native words begin with an alveopalatal sibilant, as in esperado [ʃpeˈɾadu] ‘expected’, whereas loans begin with an alveolar sibilant, as in stress [ˈstɾɛs] or snob [ˈsnɔb] (D’Andrade & Rodrigues 1999). This indicates that word-initial sC-clusters, in native and loan words, differ in EP with regard to the initial sibilant.
In BP, word-initial sC-clusters also occur in native and loan words. Native words, such as in esperado [ispeˈɾadu] ‘expected’, are subject to weakening and loss of the word-initial high front vowel: [speˈɾadu]. Loan words, which present sC-clusters in BP, are said to have an epenthetic vowel word-initially to prevent an illicit cluster formed by a (sibilant + consonant). The epenthetic vowel would provide an acceptable syllable in BP: Skype as [isk]ype (Viana 1904; Said Ali 1931; Câmara Jr. 1970; Bisol 1999; Collischonn 2000; Collischonn & Schwindt 2005). Notice that both native and loan words may be pronounced with or without a word-initial vowel. A question to be posited is whether native and loan words are phonologically represented in like manner or differently in BP. This is one of the contributions of this paper.
Kaye’s (1992) proposal offers a categorical view as to whether a vowel occurs or not in word-initial sC-clusters. When the vowel occurs, the nucleus has phonetic content. When the vowel does not occur, an empty nucleus occurs, preceding the sibilant. Another theoretical perspective that also accounts for the expected outcome of vowel-zero alternation in sC-clusters in BP is that of Exemplar Models (Johnson 1997; Pierrehumbert 2001; 2002; Bybee 2001; 2002; 2006; 2013; Foulkes & Docherty 2006; Wedel 2006; Hanique, Aalders & Ernestus 2013; Ernestus 2014; Cristófaro Silva & Gomes 2017). From an Exemplar Model perspective, vowel weakening and loss is gradient, so that there might be temporal reorganization of segmental material. Exemplar Models also suggest that native and loan words have different representations as they are stored as different sets of exemplars, as will be discussed later. This paper considers how these two models account for the word-initial vowel-zero alternation in sC-clusters in BP. Following current tendencies for experimental investigation in Phonology, it was posited that there would be differences in the production of sC-clusters in native and loan words in BP, and that such differences might be experimentally captured. The next section presents the methodology adopted in this work.
A picture-naming experiment was designed to examine the segmental string sC-clusters. Experimental stimuli consisted of 30 words. Sibilants in sC-clusters were followed by a voiceless stop: [p, t, k].1 A set of fifteen items were selected from native and loan words, totalling 30 experimental stimuli, which were all nouns. Five stimuli were equally distributed for each of the stops [p, t, k] in native and loan words.
As loan words reflect an innovative pattern, which comprises few words, a set of 15 words were initially selected for this pattern based either on Houaiss (2009) or on the name trademarks or products that are widely available in Brazil. The other 15 words were native and chosen by their phonological similarity to the loans. The aim was to balance the words by the number of syllables and syllable structure as much as possible. The median value of the number of syllables for all of the 30 stimuli is 4 syllables. For native words, the mean value is 3.8 syllables (sd = 1.22; cv = 31.1) and, for loan words, the mean value is 3.93 syllables (sd = 1.74; cv = 45.8). Stimuli are presented in Table 1.
|Loan words||Native words|
The thirty experimental stimuli listed in Table 1 were presented along with thirty-five filler items, which aimed to disguise that the purpose of the experiment was to investigate sC-clusters. A set of five fillers was presented to all participants as a training phase to familiarize them with the experiment. The remaining 30 fillers consisted of BP native and loan words in equal measure. Stimuli were randomized for each participant using the sort_rand macro for Microsoft PowerPoint 2016.
Data was collected from twenty-four participants, 12 female and 12 male, Brazilian Portuguese native speakers from the Belo Horizonte metropolitan region, in the state of Minas Gerais, Brazil.2 Participants’ ages ranged from 18 to 28 years old (mean age: 22.5 yrs.). All participants were graduate or undergraduate students from the Federal University of Minas Gerais, and claimed not to have any hearing or speech disorders.
Participants were instructed to say the word that corresponded to the picture they were shown on a laptop screen. Audio recordings of speakers’ productions were obtained with a Zoom H4N recorder (sampling rate: 44,1 KHz; quantization: 16 bits; file format: WAV). Each slide was automatically presented on the screen for 2.5 seconds using Microsoft PowerPoint. Written stimuli were presented together with the picture in Arial font, size 88 pt. Picture stimuli consisted of figures of approximately 10 cm in size. Each stimulus was presented twice during the task, one immediately after the other. Initially, the slides displayed the picture and its corresponding written form. The second stage of the presentation consisted of pictures being presented without the written input. In this second stage, the stimuli contained the following instruction: “Say the word you just said again, out loud”. Consider Figure 1.
Data was collected in an acoustically treated room at CEFALA3 in July 2018. Every participant completed the task in less than 15 minutes. Participants signed a consent form authorising their data to be used for scientific purposes and stating that their participation in the experiment was voluntary. A total of 1,440 audio files were collected (24 participants × 30 stimuli × 2 repetitions). The samples were edited and annotated manually using Praat TextGrids (Boersma & Weenick 2019).
Results were extracted from audio files by a script adapted from Lennes (2002). Whether or not a vowel was present in sC-clusters was determined automatically by the identification of any periodic interval in the sound wave at the beginning of a word. When a vowel was attested, its durational value was registered. The duration of the sibilant was also automatically extracted by the script when an energy distribution in frequency bands ranging from 5 Hz to 8 Hz was attested. The end of the sibilant was determined by the beginning of the stop in the sC-cluster, which is characterized by lack of energy in the acoustic signal.
Graphical and statistical analyses were carried out using R software (R Core Team 2017), by employing mixed-effects models (Baayen, Davidson & Bates 2008) using the lme4 package (Bates et al. 2015). The statistical analysis is aligned with proposals presented by Oushiro (2017) and Godoy, Weissheimer & Mafra (2018). The best fitting models were selected to test the hypothesis that there would be differences in the production of sC-clusters in native and loan words in BP. In order to test vowel absence rates, a mixed-effect logistic regression model was fit, including the presence of the initial vowel as dependent variable, word origin (loan or native) as a fixed effect and the intercepts of lexical item and speaker as random effects. To test the duration of the initial vowel and of the sibilant, mixed-effect linear models were fit, including the initial vowel or the sibilant as a dependent variable, the word origin, the presence of the initial vowel (for sibilant analysis only) and the duration of the word as fixed effects, and the intercepts of lexical item and speaker as random effects. Nested models were compared using a likelihood-ratio test that respected a chi-squared distribution. For all statistical tests, the confidence interval was 95%, or p < 0.05.
Results showed that an initial vowel was not present in 57.4% of all sC-clusters, indicating that, although word-initial sC-clusters were formerly considered to be illicit in BP, they are now being manifested in more than half of the data examined. A word-initial high vowel was absent in 75.8% of cases in loan words, whereas a vowel did not occur in 39% of native words (χ2 = 56.835, p < 0.001).
Kaye’s (1992) representational model makes no prediction as to the different rates of vowels in native and loan words. This is because, in his model, the vowel would be either present or absent, categorically. Exemplar Models, on the other hand, argue that clusters of exemplars reflect experience with language. The general rate of exemplars without a word-initial vowel (57.4%) indicates a tendency of BP to favour word-initial sC-clusters. Exemplars without an initial vowel are more robust in loan words (75.8%) than in native ones (39%). This result suggests that loan words are more advanced than native words in accommodating word-initial sC-clusters without a vowel. Results also suggest that native and loan words may present different representations. This is because native words are represented in a set of exemplars distinct from loan words. This issue will be addressed later.
An important argument in Kaye’s (1992) proposal, is that although a vowel is not overtly present in sC-clusters in EP, it is as if it were, as the negative prefix in sC-clusters is [in], which occurs when a word begins with a vowel: inacabado [inakaˈbadu] ‘unfinished’ and inesperado [inʃpeˈɾadu] ‘unexpected’. If sC-clusters began with a consonant, the expected negative prefix would be [ĩ], as in incapaz [ĩkaˈpas] ‘incapable’ or intratável [ĩtɾaˈtavel] ‘unsociable’, which is not the case (cf. *[ĩʃpeˈɾadu]). Exemplar Models account for these facts by assuming that each word is lexically stored in clusters or sets of exemplars. Thus, in EP, words like esperado [ʃpeˈɾadu] ‘expected’ and inesperado [inʃpeˈɾadu] ‘unexpected’ are both lexically listed, each one associated with its set of exemplars. Generalizations about affixes emerge from relations formed from the semantic and phonetic similarity of sets of exemplars (Bybee 1995; 2001; 2010). A word like inesperado formerly had an initial vowel, and the negative prefix was [in]. As the initial vowel was not manifested any longer, the prefix [in] remained, occurring as it was, already part of the cluster of exemplars for inesperado. There would be no explanation for the suffix to be altered, so it remains as it formerly was.
BP offers further evidence for assuming that the prefix [in] is lexically listed. The word inesperado is pronounced as [inespeˈɾadu] in BP. As expected, the negative prefix [in] occurs for a word beginning with a vowel. Notice that, in this case, a vowel [e] follows the negative prefix, which is then followed by a sC-cluster. It is interesting to observe that, without the negative prefix, a pronunciation like *[espeˈɾadu], with an initial [e], does not occur in BP (Bisol 1981; Dubiela 2016), but rather a high vowel-zero alternation is observed: [ispeˈɾadu] ~ [speˈɾadu]. These facts show that the word esperado has had its initial vowel raised [e]sperado > [i]sperado and, more recently, the initial high vowel has undergone reductive processes: [is]perado > [s]perado. On the other hand, in the word in[e]sperado, the vowel which follows the prefix [in] remains as [e], without being raised to [i], or being cancelled. This indicates that, although esperado and inesperado are semantically and morphologically related, they are categorized in different sets of exemplars. Exemplar Models also account for the claim made by Kaye (1992) that word-initial sC-clusters behave as if they are preceded by a vowel. However, rather than positing an abstract word-initial empty nucleus, Exemplar Models posit that the distribution of the negative prefix follows from the organization of different sets of exemplars.
Considering the rates for vowel-zero alternation presented above, the next step was to test the hypothesis that there would be differences in the production of sC-clusters in native and loan words in BP, and that such a difference might be captured experimentally. In order to test this hypothesis, unstressed high vowel weakening and loss in BP has to be addressed (Lemle 1966). It is known in the literature that, in several languages, high vowels may devoice when adjacent to voiceless consonants (Dauer 1980; Tsuchida 1994; Fagyal & Moisset 1999; Kondo 2005; Andreeva & Koreman 2007; Chitoran & Iskarous 2008). This is also the case in BP (Bisol & Hora 1993; Leite 2006; Dias & Seara 2013; Nascimento 2017). A number of case studies have shown that unstressed high vowels undergo vowel weakening and loss in BP when adjacent to a voiceless sibilant: word-finally as in passe [ˈpasi] ~ [ˈpas] ‘pass’ (Meneses & Albano 2015); word-medially as in similar [simiˈlar] ~ [smiˈlar] ‘similar’ (Souza 2014) and word-initially as in estado [isˈtadu] ~ [ˈstadu] ‘state’ (Freitas 2019). Vowel weakening and loss are said to reflect gesture magnitude reduction followed by gestural overlap involving the high vowel and the sibilant, which is understood as gestural reconfiguration (Souza 2014; Meneses & Albano 2015). This view differs from previous ones which assumed that vowel deletion would take place, i.e., the vowel would be either present or absent. The experimental investigation will contrast the categorical and the gradient proposals for vowel deletion. If vowel deletion is categorical, it is expected that it has no impact on adjacent segments. The gradient view assumes that vowel deletion is gradual and may be experimentally observed. It is also known that segmental deletion may alter the segmental representation of adjacent sounds (Meneses & Albano 2015; Cristófaro Silva et al. 2017). Thus, in an Exemplar Model approach, it is expected that, when the high vowel is deleted, adjacent segments may be affected.
The experimental investigation considered two aspects: the duration of high vowels as well as of sibilants. It has been reported that epenthetic vowels – which are high vowels – are shorter than regular vowels in BP (Cristófaro Silva & Almeida 2008; Cantoni 2015). Thus, it is expected that the duration of high vowels will differ in native and loan words in sC-clusters. It is also expected that vowel weakening will be gradual, so that the duration of high vowels will be variable.
The second aspect to be experimentally examined was the duration of sibilants when a vowel occurs or not, within a single category, either in native or in loan words. It is expected that sibilants will be longer when the vowel is not present in sC-clusters. This is intended to address whether the segmental loss of the vowel has an impact on adjacent segments, i.e., the duration of the sibilant.
In order to test these hypotheses, the durational values of the vowel and the sibilant were considered for native and loan words.4 The proportional durations of vowels and sibilants in relation to the rest of the word, for native and loan words, were considered: the duration of the vowel (when it occurs), the duration of the sibilant, and the duration of the word. Table 2 presents the mean value for the each of the measurements for native and loan words, and whether or not a vowel was produced.
|vowel produced||word origin||vowel (ms)||sibilant (ms)||word (ms)|
Considering the results shown in Table 2, a ratio between the mean duration of the vowel and the sibilant vs. the mean duration of the whole word was calculated. The mean duration of the vowel and the sibilant was subtracted from the whole duration of the word, a value which was referred to as the “rest of the word”. The final result is the mean percentage that the vowel and the sibilant occupy in the word, as show in Figure 2.
The two top bars in Figure 2 report cases where the vowel was produced, and the two bottom bars present results when the vowel was not produced, in loan and native words respectively. The result expressed in the two top bars addresses the first experimental aspect to be investigated, showing that, when an initial vowel occurs, it is shorter in loan words (7.1%) than in native ones (9.7%). Thus, the duration of the vowel is different in cases of epenthesis and vowel deletion (χ2 = 18.73, p < 0.01). This result corroborates previous findings in the literature which found that epenthetic vowels are shorter than regular vowels in BP. Although vowel lengthening is not a contrastive property of BP phonology (Azevedo 2005), it can be posited that vowel duration plays a role in characterising the phonetic detail in word-initial sC-clusters in loan and native words in BP. Although vowel duration is a sub-phonemic property in BP, results show that shorter vowels occur in loan words, and longer vowels occur in native ones.
The representational model presented in Kaye (1992) cannot account for the patterns observed for vowel duration in sC-clusters, as sub-phonemic properties are not part of phonological representations in his model. On the other hand, Exemplar Models, which include redundant and variable properties as part of phonological representations, account for the experimental results that were just presented. The sub-phonemic property of vowel duration, where high vowels in loan words are shorter than in native words, offers evidence for sC-clusters in loan words to be grouped in a set of exemplars, which is different from the set of exemplars for native words. Within Exemplar Models approaches, vowel duration characterizes the fine phonetic detail related to high vowels in word-initial sC-clusters in BP. Thus, Exemplar Models account for the different patterns of vowel duration in sC-clusters in loan and native words. Exemplar Models also explain why loan words present a higher rate of vowel deletion (75,8%) than native words (39%). Loan words present short high vowels, which are more prone to devoicing, weakening and deletion than unstressed regular high vowels, which occur in native words (Cristófaro Silva & Almeida 2008; Cantoni 2015). The gestural reconfiguration that yields to vowel deletion is implemented at higher rates in loan words as the vowel is short. In native words, lower rates of vowel deletion are observed as the vowels subject to reductive processes are longer. Furthermore, as more and more words present a word-initial sibilant, i.e., the high vowel does not occur, Exemplar Models predict that more robust exemplar clusters are formed, which explains the general tendency of BP to favour word-initial sC-clusters (57.4%).
A comparison of sC-clusters in Brazilian and European Portuguese shows that different solutions were found to phonologically accommodate word-initial sC-clusters in each of these varieties. In EP the sibilant in loan words is an alveolar fricative, as in stress [ˈstɾɛs] or snob [ˈsnɔb], whereas in native words, an initial alveopalatal fricative occurs, as in esperado [ʃpeˈɾadu] ‘expected’ (D’Andrade & Rodrigues 1999). Alveolar and alveopalatal sibilants are in contrast in Portuguese, as in assa [ˈasə] ‘to bake’ and acha [ˈaʃə] ‘to find’. Therefore, the phonological solution to accommodate word-initial sC-clusters in EP involves phonemic differences.
Brazilian Portuguese also accommodates sC-clusters phonologically in different ways, and such a difference is expressed by the durational properties of the initial vowel in loan and native words, as show in the top two bars in Figure 2. This result indicates that not only contrastive properties of sounds are phonologically represented, but also that sub-phonemic properties captured by the experimental investigation may characterize phonetic detail that is relevant in shaping phonological representations, as predicted by Exemplar Models (Bybee 2001; Johnson 2007; Bermúdez-Otero 2007; Cole & Hualde 2011). The facts discussed above indicate that Exemplar Models account for the experimental results reported for vowel duration, whereas the representational model presented in Kaye (1992) does not.
Results presented in Figure 3 also show that sibilants in loan words have a longer duration when a vowel does not occur (16.8%), than when a vowel is produced (14.2%). In like manner, in native words, sibilants have a greater duration when a vowel is not produced (16.9%), than when a vowel is produced (13.9%). Thus, sibilants in sC-clusters are longer when the vowel is not produced (16.8% and 16.9%), than when the initial vowel is produced (14.2% and 13.9%), and that such a difference is statistically significant (χ2 = 110, p < 0.01 for loan words; χ2 = 123, p < 0.01 for native words). This result addresses the second aspect investigated experimentally, showing that sibilants are longer when a vowel does not occur.
Both Kaye (1992) and Exemplar Models account for the lengthening of the sibilant when the word-initial vowel is not manifested. Within Kaye’s (1992) model, the sibilant will occupy its original skeletal position and the nuclear position to its right. Thus, the sibilant occupies two skeletal positions (Polgárdi 2017), which can be interpreted as a long sibilant.
In Exemplar Models, the lengthening of the sibilant, which occurs when the vowel is not manifested, is accounted for by the gestural reconfiguration involving the weakened high vowel and the adjacent sibilant in sC-clusters (Meneses & Albano 2015). Gestural reconfiguration is attested in several other contexts in BP, and it is also observed in other languages (Dauer 1980; Tsuchida 1994; Fagyal & Moisset 1999; Kondo 2005; Andreeva & Koreman 2007; Chitoran & Iskarous 2008). Within Exemplar Models the gestural reconfiguration characterizes phonetic detail, which can be modelled in different sets of exemplars for loan and native words. Consider Figure 3.5
Figure 3 illustrates sets of exemplars for sC-clusters, where the diagram on the left represents exemplars for loan words and the diagram on the right represents a set of exemplars for native words. The vertical axis indicates the robustness of exemplars, where darker shades show more robust exemplars, i.e., exemplars with a high rate of experienced cases. Lighter shades reflect exemplars which have been experienced at lower rates, thus are less robust. The horizontal axis indicates segmental duration. Each segment was associated with a specific box, which appears to characterize discrete segments. In fact, this is a limitation of the graphic design, as the boxes can be understood as a continuum, which reflects the speech chain, where gestural reconfiguration of segmental material interacts dynamically (Meneses & Albano 2015).
In both diagrams in Figure 3, a high vowel is present in the top rows and no vowel is present in the bottom one. This expresses that the word-initial high vowel may or may not be manifested. The exemplars in between the top and bottom rows show the gradual weakening of the high vowel. The high vowel in loan words on the left present a shorter duration than the high vowel in native words on the right. This intends to capture the experimental result that epenthetic vowels that occur in loan words are shorter than regular vowels that occur in native words.
The diagrams in Figure 3 also show that a high front vowel is present at higher rates in native words than in loans. This is expressed by the grey shading, where light grey indicates a less robust pattern. The grey shades are intended to reflect the finding that word-initial sC-clusters are favoured in loan words (75.8%), when compared to native ones (39%). The diagram in Figure 5 also expresses that the sibilant has a longer duration when the vowel is not present – as can be seen in the bottom row – than when the vowel is present (cf. previous rows compared to the bottom one). The experimental results, together with the representation in Figure 5, indicate that Exemplar Models offer a comprehensive explanation for vowel-zero alternation in sC-clusters in BP.
Although Kaye’s (1992) model accounts for the sibilants being longer when a vowel it not present, it fails in explaining why the vowel is not manifested and why loan and native words present statistically significantly different rates of word-initial vowel deletion. Exemplar Models account for the different rates in the lack of vowels in sC-clusters, as loan and native words are grouped in different sets of independently organized exemplars, whose specific representations include phonetic detail regarding segmental duration’. Exemplar Models also accommodate experimental results, by explaining why high vowel reduction and loss occurs as a consequence of gestural reconfiguration, which is gradually implemented. Finally, Exemplar Models explain the lengthening of the sibilant by the interaction of the high vowel and adjacent sibilant.
Representational models, as the one proposed by Kaye (1992), contributed enormously to the development of Phonology. Besides the innovative representation in several multi-layered tiers, which provided a visual representation for phonological processes, such models contributed significantly to the expansion of knowledge of syllable typology, as well as to the understanding of the relationship between segments and prosody. However, a major limitation of representational models is that only discrete phonological segments, or empty skeletal positions, are lexically represented. In representational models, allophonic variation is derived as part of phonological processes, so that sub-phonemic, or fine phonetic detail, is not part of phonological representations.
Exemplar Models provide accounts for these limitations by assuming that detailed information is part of cognitive representations. In phonological terms, this can be translated as the incorporation of fine phonetic detail in phonological representations, as well as the lexical implementation of phonological phenomena. A prediction which follows from this view is that variation is intrinsic to language, where Phonetics and Phonology interact closely. Sets of exemplars have a dynamic organization, so that emergent phonological phenomena evolve gradually towards a specific direction, as words adopt the new pattern. In the case study presented in this paper, the emergent pattern is a word-initial (sibilant + consonant) sequence which occurs in native and loan words in BP. Variation involves whether or not a vowel is manifested in sC-clusters. The pattern with word-initial sC-clusters evolves in the lexicon, displaying phonetic gradual implementation, which can be captured by the experimental analysis presented.
An interesting aspect of experimental investigation in phonology is related to the presentation of results. Phonetics and phonology typically use discrete symbols to report results (Derwing 1992; Coleman 2002; Port 2007; 2011; Munson et al. 2010). Phonetic detail is typically reported in Phonetics, but not in Phonology. The different durational properties of vowels in loan and native words, as well as the lengthening of the sibilant when the vowel does not occur, cannot be expressed by IPA symbols, unless diacritics are used, which is not expected in phonological representations. Thus, IPA symbols do not characterize the phonological properties of vowel-zero alternation in sC-clusters reported in this paper. The full understanding of experimental results usually requires a model where Phonetics and Phonology are integrated (Demolin 2005). The trend towards experimental approaches in Laboratory Phonology suggests a new way not only to methodological design, but also to the presentation of results in graphs or numerical tables. A representational proposal for Exemplar Models is proposed in this paper. Like any preceding theoretical approach, Exemplar Models will uncover interesting and unknown facts about the sound systems of languages.
The investigation presented in this paper contributes to a better understanding of vowel-zero alternation in word-initial sC-clusters in BP. Vowel-zero alternation is an optional phenomenon in BP, where different rates of word-initial sibilants are attested in native (39%) and loan words (75.8%). The duration of sC-clusters was experimentally investigated. When the initial high vowel occurs, it is shorter in loan words than in native words. Results also showed that vowel weakening and loss is variable, which indicates that it is gradually implemented. Additionally, experimental results showed that sibilants are longer when a high vowel does not occur, rather than when it does.
Kaye’s (1992) representational model and Exemplar Models were considered to account for the experimental findings. Kaye’s (1992) representational model makes no prediction as to the different rates of vowel deletion. In his model, the vowel is either present or absent, which suggests that vowels would have a fixed duration. His model does not explain either why the vowel is not manifested. Exemplar Models, on the other hand, account for the higher rate of word-initial sibilants in loan words (75.8%) than native ones (39%), as the short duration of high vowels in loan words are more prone to weakening and loss, providing more robust exemplars without a word-initial vowel. Experimental results also showed that the duration of high vowels is variable, not fixed, which captures the gradual implementation of vowel-zero alternation, as predicted by Exemplar Models. Exemplar Models also account for why a high vowel is not manifested in word-initial sC-clusters, by suggesting that gestural reconfiguration takes place between the high vowel and the adjacent sibilant.
Kaye’s (1992) representational model accounts for the lengthening of sibilants when the vowel is not manifested, as a sibilant would occupy its position, as well as the position which was formerly occupied by the vowel. However, Kaye’s analysis fails in providing an explanation as to why sibilants are lengthened. Based on Exemplar Models, the proposal presented in this paper explains the lengthening of the sibilant by gestural reconfiguration involving the high vowel and the sibilant.
The durational properties of high vowels and sibilants discussed in this paper characterize phonetic detail, which plays an important role in the implementation of the vowel-zero alternation in sC-clusters in BP. Thus, sub-phonemic properties captured by the experimental investigation may characterize phonetic detail that is relevant in shaping phonological representations, as predicted by Exemplar Models (Bybee 2001; Johnson 2007; Bermúdez-Otero 2007; Cole & Hualde 2011). Additionally, results reported in this paper support the view that discrete phonetic symbols do not provide an accurate description of segmental strings (Derwing 1992; Coleman 2002; Port 2007; 2011; Munson et al. 2010).
A number of issues could be investigated in future studies, such as examining the lexical implementation of the vowel-zero alternation in SC-clusters, where word frequency and phonological similarity effects would be taken into consideration. Unfortunately, the present study was not methodologically designed to explore these issues.
It will also be pertinent to examine to what extent the phenomenon investigated in this paper will apply when the sibilant is voiced in loan words, as in slide [izˈlajʤi] ‘slide (presentation)’, and native words, as in esmola [izˈmɔla] ‘alms’. It is expected that the gestural reconfiguration involving the high vowel and sibilant will be less favoured when the sibilant and adjacent consonant are voiced. This will be in line with findings in the literature that high vowel devoicing is favoured when adjacent to voiceless segments (Kondo 2005; Leite 2006; Dias & Seara 2013; Souza 2014; Meneses & Albano 2015; Nascimento 2017).
As reported in the methodology, the stimuli were presented to participants with and without their corresponding orthographic form. Freitas’s (2019) results showed that whether or not the orthographic input was presented, it did not affect the production of sC-clusters in loan (χ2 = 0.96, p = 0.62) and native words (χ2 = 1.13, p = 0.57). Future work could consider in detail the role of orthography in the production of sC-clusters.
Future work could also consider the gestural reconfiguration of high vowel and sibilants in other varieties of BP in the several environments in which it takes place: word-finally as in passe [ˈpasi] ~ [ˈpas] ‘pass’ (Meneses & Albano 2015); word-medially as in similar [simiˈlar] ~ [smiˈlar] ‘similar’ (Souza 2014) and word-initially as in estado [isˈtadu] ~ [ˈstadu] ‘state’. It is expected that similar results will be found, although the lexical implementation may be different in other varieties.
This paper showed the relevance of experimental investigation in Phonology, as suggested by Laboratory Phonology approaches (Pierrehumbert, Beckman & Ladd 2000; Cohn, Fougeron & Huffman 2012; Albano 2017). Finally, the findings of this paper provide evidence for the role of phonetic detail in shaping phonological representations, as suggested by Exemplar Models (Bybee 2001; 2013; Hawkins 2003; Harris 2007; Bermúdez-Otero 2007; Cole & Hualde 2011).
1In BP in general, and specifically in the community used for the study, the sibilant is always pronounced as an alveolar fricative in a postvocalic or coda position. In some areas of Brazil, such as Rio de Janeiro or some of the Northern states, a sibilant may be alveopalatal in heterosyllabic clusters (Cristófaro Silva 2013). Voice agreement is also observed depending on the consonant that follows the sibilant. It was therefore decided that the experiment would be limited to just voiceless sibilants.
4The durational values of the plosives [p, t, k] were not considered in this paper due to methodological aspects. Cristófaro Silva et al. (2019) listed VOT values for voiceless plosives in BP, which were obtained in several works, showing a great variation in values: [p] 11 ms – 37.67 ms; [t]: 15 ms – 36.52 ms and [k]: 35 ms – 58.05 ms. Recent works showed an emergent aspiration in voiceless plosives in BP (Alves and Zimmer 2015; Kupske 2017). Furthermore, none of the works mentioned above refer to the BP variety reported in this paper. Considering these facts, the investigation was constrained to the initial high vowel and the sibilant in sC-clusters.
We are grateful to the editors for the organization of the present volume. We would like to thank the reviewers for their comments and suggestions which did contribute to improving the ideas presented in this paper. Thaïs Cristófaro Silva acknowledges funding from CNPq (PQ-309219/2016-7) and FAPEMIG (PPM-X-00702-18). Matheus Freitas acknowledges funding from CAPES (Coordenação de Aperfeiçoamento de Pessoal de Nível Superior) Funding code: 001.
The authors have no competing interests to declare.
Alves, U., & Zimmer, M. (2015). Percepção e produção dos padrões de VOT do inglês por aprendizes brasileiros: O papel de múltiplas pistas acústicas sob uma perspectiva dinâmica [Perception and production of English VOT patterns by Brazilian learners: The role of multiple acoustic cues in a DST perspective]. Alfa: Revista de Linguística, 59, 155–175. DOI: https://doi.org/10.1590/1981-5794-1502-7
Andreeva, B., & Koreman, J. (2007). The status of vowel devoicing in Bulgarian: Phonetic or phonological? In G. Zybatow, L. Szucsich, U. Junghanns, & Roland Meyer (Eds.), Formal Description of Slavic Languages: The Fifth Conference (FDSL5) (pp. 81–91). Frankfurt: Peter Lang.
Azevedo, M. (2005). Portuguese: A Linguistic Introduction. Cambridge: Cambridge University Press. DOI: https://doi.org/10.1017/CBO9780511755057
Baayen, R., Davidson, D., & Bates, M. (2008). Mixed-effects modeling with crossed random effects for subjects and items. Journal of Memory and Language, 59, 390–412. DOI: https://doi.org/10.1016/j.jml.2007.12.005
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software, 67(1), 1–48. DOI: https://doi.org/10.18637/jss.v067.i01
Bermúdez-Otero, R. (2007). Diachronic phonology. In P. de Lacy (Ed.), The Cambridge handbook of phonology (pp. 497–517). Cambridge: Cambridge University Press. DOI: https://doi.org/10.1017/CBO9780511486371.022
Boersma, P., & Weenink, D. (2019). Praat: Doing phonetics by computer [Computer program]. Version 5.4.08, retrieved March 24, 2015 from http://www.praat.org/.
Bybee, J. (1995). Regular morphology and the lexicon. Language and Cognitive Processes, 10(5), 425–455. DOI: https://doi.org/10.1080/01690969508407111
Bybee, J. (2001). Phonology and language use. Cambridge: Cambridge University Press. DOI: https://doi.org/10.1017/CBO9780511612886
Bybee, J. (2002). Phonological evidence for exemplar storage of multiword sequences. Studies in Second Language Acquisition, 24(2), 215–221. DOI: https://doi.org/10.1017/S0272263102002061
Bybee, J. (2006). From usage to grammar: The mind’s response to repetition. Language, 82(4). 711–733. DOI: https://doi.org/10.1353/lan.2006.0186
Bybee, J. (2010). Language, Usage and Cognition. Cambridge: Cambridge University Press. DOI: https://doi.org/10.1017/CBO9780511750526
Bybee, J. (2013). Usage-based Theory and Exemplar Representations of Constructions. In T. Hoffmann & G. Trousdale (Eds.), The Oxford Handbook of Construction Grammar (pp. 49–69). Oxford: Oxford University Press. DOI: https://doi.org/10.1093/oxfordhb/9780195396683.013.0004
Cantoni, M. M. (2015). A epêntese no português brasileiro em uma perspectiva multirrepresentacional [A multirepresentational approach to epenthesis in Brazilian Portuguese]. Gragoatá, 1(38), 231–246. DOI: https://doi.org/10.22409/gragoata.v20i38.33308
Chitoran, I., & Iskarous, K. (2008). Acoustic evidence for high vowel devoicing. In R. Sock, S. Fuchs & Y. Laprie (Eds.), Proceedings of the 8th International Seminar on Speech Production, Strasbourg, France, 8–12 December 2008 (pp. 93–96). Strasbourg: INRIA.
Cohn, A. C., Fougeron, C., & Huffman, M. K. (Orgs.). (2012). The Oxford Handbook of Laboratory Phonology. Oxford: Oxford University Press. DOI: https://doi.org/10.1093/oxfordhb/9780199575039.001.0001
Cole, J., & Hualde, J. (2011). Underlying representations. In M. van Oostendorp, C. J. Ewen, E. Hume & K. Rice (Eds.), Companion to Phonology (pp. 1–26), Wiley-Blackwell. DOI: https://doi.org/10.1002/9781444335262.wbctp0001
Collischonn, G. (2000). A epêntese vocálica no português do sul do Brasil: Análise variacionista e tratamento pela Teoria da Otimalidade [Vowel epenthesis in Southern Brazilian Portuguese: Variationist and Optimality Theory analyses]. Letras de Hoje, 35(1), 285–318.
Collischonn, G., & Schwindt, L. (2005). Considerações sobre a sequência /sC/ inicial em português brasileiro [Remarks on the initial /sC/ sequence in Brazilian Portuguese]. Revista Lingua(gem), 2(2), 249–266.
Cristófaro Silva, T. (2013). Fonética e fonologia do português: Roteiros de estudos e guias de exercícios [Portuguese phonetics and phonology: Study guides and guidelines for excersises]. São Paulo: Editora Contexto.
Cristófaro Silva, T., & Almeida, L. S. (2008). On the nature of epenthetic vowels. In L. Bisol, & C. Brescancini (Eds.), Contemporary Phonology in Brazil (pp. 193–212). Cambridge: Cambridge University Press.
Cristófaro Silva, T., Oliveira, N. A., Cantoni, M., & Seara, I. (2017). Segmental loss and phonological representation. In G. Lindsey & A. Nevins (Org.), Sonic Signatures (pp. 215–230). Amsterdam/Philadelphia: John Benjamins Publishing Company. DOI: https://doi.org/10.1075/lfab.14.c13
D’Andrade, E., & Rodrigues, M. C. (1999). Das Escolas e das Culturas: História de uma sequência consoântica [From schools and cultures: History of a consonantal sequence]. In A. M. Lopes & C. Martins (Eds.), Actas do XIV Encontro Nacional da Associação Portuguesa de Linguística (pp. 117–133). Lisboa: APL/Colibri.
Dauer, R. (1980). The reduction of unstressed high vowels in modern Greek. Journal of the International Phonetic Association, 10(1/2), 17–27. DOI: https://doi.org/10.1017/S0025100300002073
Derwing, B. (1992). Orthographic aspects of phonological competence. In P. Downing, S. Lima & M. Noonan (Eds.), The linguistics of literacy (pp. 193–210). Amsterdam: John Benjamins. DOI: https://doi.org/10.1075/tsl.21.16der
Dias, E., & Seara, I. (2013). Redução e apagamento de vogais átonas finais na fala de crianças e adultos de Florianópolis: Uma análise acústica [Word-final unstressed vowel reduction on children and adult speech from Florianópolis: An acoustic analysis]. Letrônica, 6(1), 71–93.
Dubiela, M. (2016). Vogal Frontal Átona Final Produzida por Falantes de Curitiba: Subsídios para Abordagem Dinâmica dos Sons da Fala [Final Unstressed Front Vowel Produced by Speakers from Curitiba: Evidence for a Dynamic Approach of Speech Sounds] (Unpublished master’s dissertation). Curitiba: Universidade Federal do Paraná.
Ernestus, M. (2014). Acoustic reduction and the roles of abstractions and exemplars in speech processing. Lingua, 142, 27–41. DOI: https://doi.org/10.1016/j.lingua.2012.12.006
Foulkes, P., & Docherty, G. (2006). The social life of phonetics and phonology. Journal of Phonetics, 24(4), 409–438. DOI: https://doi.org/10.1016/j.wocn.2005.08.002
Freitas, M. (2019). A redução segmental em sequências #(i)sC no português brasileiro [Segmental reduction in #(i)sC sequences in Brazilian Portuguese] (Unpublished master’s dissertation). Belo Horizonte: Universidade Federal de Minas Gerais.
Goad, H. (2011). The Representation of sC Clusters. In M. Oostendorp, C. J. Ewen, E. Humen & K. Rice (Eds.), The Blackwell Companion to Phonology (pp. 898–923). John Wiley & Sons. DOI: https://doi.org/10.1002/9781444335262.wbctp0038
Godoy, M., Weissheimer, J., & Mafra, M. (2018). When Grammar Meets Pragmatics: Subject Preference and Coherence Relations in Brazilian Portuguese Pronoun Interpretation. Journal of Portuguese Linguistics, 17(1), 6. DOI: https://doi.org/10.5334/jpl.197
Hanique, I., Aalders, E., & Ernestus, M. (2013). How robust are exemplar effects in word comprehension? The Mental Lexicon, 8(3), 269–294. DOI: https://doi.org/10.1075/ml.8.3.01han
Harris, J. (2007). Representation. In P. Lacy (Ed.), The Cambridge Handbook of Phonology (pp. 119–138). Cambridge: Cambridge University Press. DOI: https://doi.org/10.1017/CBO9780511486371.007
Hawkins, S. (2003). Roles and representations of systematic fine phonetic detail in speech understanding. Journal of Phonetics, 31(3–4), 373–405. DOI: https://doi.org/10.1016/j.wocn.2003.09.006
Henriques, I. (2012). A Fricativa Coronal /S/ em /#(∅)SC/ em Português Europeu [The coronal fricative /S/ in /#(∅)SC/ in European Portuguese] (Unpublished doctorate’s thesis). Porto: Universidade do Porto.
Hermes, A., Mücke, D., & Grice M. (2013). Gestural coordination of Italian word-initial clusters: The case of ‘impure s’. Phonology, 30(1), 1–25. DOI: https://doi.org/10.1017/S095267571300002X
Houaiss, A., & Villar, M. S. (2009). Dicionário Houaiss de Língua Portuguesa [Houaiss Dictionary of the Portuguese Language]. Elaborado pelo Instituto Antônio Houaiss de Lexicografia e Banco de Dados da Língua Portuguesa S/C Ltda. Rio de Janeiro: Objetiva.
Johnson, K. (1997). Speech perception without speaker normalization: An exemplar model. In K. Johnson & J. Mullenix (Eds.), Talker Variability in Speech Processing (pp. 145–165). San Diego: Academic Press.
Kondo, M. (2005). Syllable structure and its acoustic effects on vowels in devoicing environments. In J. Weijer, K. Nanjo & T. Nishihara (Eds.), Voicing in Japanese (pp. 32–37). De Gruyter Mouton. DOI: https://doi.org/10.1515/9783110197686.2.229
Kupske, F. (2017). Efeitos do contato entre categorias fonéticas distintas em contextos de imigração: Uma revisão sobre o atrito de língua materna [Effects of distinct phonetic categories contact in immigration contexts: A revision on native language attrition]. Gragoatá, 22(42), 85–106. DOI: https://doi.org/10.22409/gragoata.2017n42a903
Leite, C. (2006). Seqüências de (oclusiva alveolar + sibilante alveolar) como um padrão inovador no português de Belo Horizonte [(Alveolar stop + alveolar sibilant) sequences as an innovative pattern in the Portuguese spoken in Belo Horizonte]. (Unpublished master’s dissertation). Belo Horizonte: Universidade Federal de Minas Gerais.
Lennes, M. (2002). Calculate segment durations. Retrieved September 8, 2018 from http://www.helsinki.fi/~lennes/praat-scripts/public/calculate_segment_durations.praat/.
Meneses, F., & Albano, E. (2015). From Reduction to Apocope: Final Poststressed Vowel Devoicing in Brazilian Portuguese. Phonetica, 72, 121–137. DOI: https://doi.org/10.1159/000439599
Munson, B., Edwards, J., Schellinger, S., Beckman, M. E., & Meter, M. (2010). Deconstructing phonetic transcription: Language specificity, covert contrast, perceptual, bias, and an extraterrestrial view of vox humana. Clinical Linguistics and Phonetics, 24(4–5), 245–260. DOI: https://doi.org/10.3109/02699200903532524
Nascimento, K. (2017). Emergência de padrões silábicos no português brasileiro e seus reflexos no inglês língua estrangeira [Syllabic patterns emergency in Brazilian Portuguese and their impact on English as a foreign language]. (Unpublished PhD’s thesis). Universidade Estadual do Ceará, Fortaleza.
Oushiro, L. (2017). Introdução à Estatística para Linguistas [Introduction to Statistics for Linguists]. Material for the course LL900 Seminário de Linguística I (2018/1st semester) – PPG – Linguística – IEL/UNICAMP. DOI: https://doi.org/10.5281/zenodo.822069
Pierrehumbert, J. (1994). Syllable Structure and Word Structure: A Study of Triconsonantal Clusters in English. In P. A. Keating (Ed.), Papers in Laboratory Phonology III (pp. 168–188). Cambridge: Cambridge University Press. DOI: https://doi.org/10.1017/CBO9780511659461.011
Pierrehumbert, J. (2001). Exemplar dynamics: Word frequency, lenition and contrast. In J. Bybee & P. Hooper (Eds.), Frequency and the emergency of linguistic structure (pp. 137–157). Amsterdam: John Benjamins. DOI: https://doi.org/10.1075/tsl.45.08pie
Pierrehumbert, J. (2002). Word-specific phonetics. In C. Gussenhoven & N. Warner (Eds.), Laboratory Phonology VII (pp. 101–139). Berlin: Mouton de Gruyter. DOI: https://doi.org/10.1515/9783110197105.101
Pierrehumbert, J., Beckman, M., & Ladd, D. R. (2000). Conceptual Foundations of Phonology as a Laboratory Science. In N. Burton-Roberts, P. Carr & G. Docherty (Eds.), Phonological Knowledge (pp. 273–303), Oxford: Oxford University Press.
Polgárdi, K. (2017). The representation of sC-clusters: A CV analysis. In K. Bercés (Ed.), Proceedings of the Government Phonology Roundtable 2017, Budapest, Hungary, 18 November 2017. Budapest: Pázmány Péter Catholic University.
Port, R. F. (2007). How are words stored in memory? Beyond phones and phonemes. New Ideas in Psychology, 25(1), 143–170. DOI: https://doi.org/10.1016/j.newideapsych.2007.02.001
Sanoudaki, E. (2010). Towards a Typology of Word-initial Consonant Clusters: Evidence from the Acquisition of Greek. Journal of Greek Linguistics, 10, 74–114. DOI: https://doi.org/10.1163/156658410X495826
Souza, R. N. (2014). The emergence of syllable structure? Data from gradient vowel reduction in Brazilian Portuguese. In B. Anible, K. Beers, L. Hirrel, & D. Wager (Eds.), Proceedings of the tenth High Desert Linguistics Society Conference. (pp. 103–117). Albuquerque: High Desert Linguistics Society.
Topintzi, N. (2010). Onsets: Suprasegmental and prosodic behaviour. Cambridge Studies in Linguistics 125. Cambridge: Cambridge University Press. DOI: https://doi.org/10.1017/CBO9780511750700
Viana, A. G. (1904). Ortografia Nacional. Simplificação e uniformização das ortografias portuguesas [National Orthography. Simplification and standardization of Portuguese orthography]. Lisboa: Livraria Editora Viúva Tavares Cardoso.
Wedel, A. (2006). Exemplar models, evolution and language change. The Linguistic Review, 23, 247–274. DOI: https://doi.org/10.1515/TLR.2006.010
Zec, D. (1995). Sonority Constraints on Syllable Structure. Phonology, 12(1), 85–129. DOI: https://doi.org/10.1017/S0952675700002396