1. Introduction

It is well-known that infants have an attentional bias for faces (or face-like objects) since a very early stage of their life (e.g., Di Giorgio, Turati, Altoè & Simion, 2012; Gliga, Elsabbagh, Andravizou & Johnson, 2009; Johnson, Dziurawiec, Ellis & Morton, 1991), and that this bias becomes more robust with development (e.g., Amso, Haas & Markant, 2014; Frank, Vul & Johnson, 2009; Leppänen, 2016). Moreover, this bias is informative on infants’ social and language development, as its absence is considered a predictor for developmental disorders (Annaz, Karmiloff-Smith, Johnson & Thomas, 2009; Åsberg Johnels, Gillberg, Falck-Ytter & Miniscalco, 2014; Irwin & Brancazio, 2014, inter alia). Visual contributions to speech perception are well documented in literature, suggesting a crucial role for visual information in typical language development (e.g., Chandrasekaran, Trubanova, Stillittano, Caplier & Ghazanfar, 2009; Rosenblum, Schmuckler & Johnson, 1997; inter alia). By 2–4 months of age, infants can use visual information to discriminate vowels (Kuhl & Meltzoff, 1982; Patterson & Werker, 2003), consonants (Pons, Lewkowicz, Soto-Faraco & Sebastián-Gallés, 2009), and CVCV sequences with equal stress on both syllables (MacKain, Studdert-Kennedy, Spieker & Stern, 1983), i.e., they are able to match the heard and seen sounds, looking at women’s faces.

Some studies show that language (un)familiarity and age also play a crucial role in the use of visual cues to language discrimination. For instance, Weikum, Vouloumanos, Navarra, Soto-Faraco, Sebastián-Gallés & Werker (2007) report that monolingual infants can visually discriminate their native language (English) from an unfamiliar one (French) by 4–6 months, but not by 8 months, when exposed to silent talking faces. English-French bilingual infants, by contrast, still use visual information for language discrimination at 8 months (Weikum et al., 2007). However, by analyzing Spanish and Catalan monolingual and bilingual infants exposed to English and French for the first time, Sebastián-Gallés, Albareda-Castellot, Weikum & Werker (2012) conclude that language (un)familiarity is not crucial to discriminate a language, as Spanish-Catalan bilinguals behave similarly to English-French bilinguals at 8 months, even when exposed to two unknown languages.

By contrast, selective attention has been shown to predictably change with language familiarity and age (e.g., Atkinson & Braddick, 2012; Munhall & Johnson, 2012; Tenenbaum, Shah, Sobel, Malle & Morgan, 2013). It has been observed that children’s preference for a given face region varies along their developmental path and that it also depends on how familiar they are with the language they are being exposed to. Tsang, Atagi and Johnson (2018) show that, although the attention to the talker’s mouth increases with age, it is also associated with monolingual and bilingual infants’ linguistic development, namely, their concurrent expressive language skills. This relation between infants’ attention to the mouth (speech perception) and production abilities has also been explored in order to observe predictive language development and clinical outcomes (Tenenbaum, Sobel, Sheinkopf, Malle & Morgan, 2015; Young, Merin, Rogers & Ozonoff, 2009).

It has been found that bilinguals attend more and earlier to the mouth than monolinguals, indicating the need of an increased support from audiovisual cues to speech perception (e.g., Ayneto & Sebastián-Gallés, 2016; Fort, Ayneto-Gimeno, Escrichs & Sebastián-Gallés, 2017; Pons, Bosch & Lewkowicz, 2015). However, language proximity/distance was also shown to play an important role in selective attention to a talker’s mouth. Indeed, it was observed that, independently of age (4–6 months and 15 months), close-language bilinguals (Spanish-Catalan) attend more to the talker’s face than distant-language bilinguals (Spanish-other) (Birulés, Bosch, Brieke, Pons & Lewkowicz, 2018). A similar suggestion is raised by Pejovic (2019) and Pejovic, Yee & Molnar (in progress) for Spanish-Basque bilingual infants. Along similar lines, Morin-Lessard, Poulin-Dubois, Segalowitz & Byers-Heinlein (2019) observed that the developmental pattern of bilinguals and monolinguals may be similar, pointing to the possible effect of language proximity/distance which might modulate the impact of bilingualism on selective attention.

Most notably, Lewkowicz & Hansen-Tift (2012), exploring different face regions, showed that audiovisual speech processing in infants is characterized by a shift from attending more to the eyes to an increased attention to the mouth at 8 months of age, coinciding with the emergence of speech production (i.e., babbling). This increased preference for the mouth has been interpreted as infants’ ability to explore audiovisual cues in the mouth area, thus taking advantage of articulatory information to acquire their native language. This ability is also observed when a speaker’s face movements are not in synchrony with the auditory vocal production (e.g., Hillairet de Boisferon, Tift, Minar & Lewkowicz, 2017; Lewkowicz, 2010; Pons & Lewkowicz, 2014).

Overall, these findings suggest that eye gaze patterns relate to language development. More attention to the mouth seems to indicate ongoing acquisition, as in the case of the eyes to mouth shift and its relation to emerging articulatory abilities, and/or increased processing effort, as in the case of the earlier shift in bilingual infants, the different findings for close-language bilinguals and distant-language bilinguals, or the presence of asynchronous audiovisual cues. Along these lines, less attention to the mouth in same age infants could possibly indicate a more advanced learning stage, where some relevant aspect of language at that age is already acquired. This would parallel findings in motor brain responses to speech perception. It has been reported that motor areas get particularly involved during the perception of non-native speech compared to native speech, when the later has already been acquired (Kuhl, Ramírez, Bosseler, Lin & Imada, 2014). On the other hand, more attention to the mouth could indicate instead a more advanced learning stage, as some studies found a relationship between increased attention to the mouth and both concurrent and predictive expressive language skills (Tenenbaum et al., 2015; Tsang et al., 2018).

In the current study, we examine infants’ eye gaze patterns in a stress perception experiment, in which European Portuguese (EP) learning infants showed a clear preference for the iambic stress pattern (Frota, Butler, Uysal, Severino & Vigário, submitted). Specifically, we asked whether there was a relation between eye gaze patterns and the preferred stress pattern. This is the first study on EP-learning infants’ gaze patterns to talking faces, and, to the best of our knowledge, on the relation between infants’ eye gaze to communicative faces and the acquisition of lexical stress. Assuming that the preferred stress pattern is acquired earlier (Bhatara, Boll-Avetisyan, Hohle, & Nazzi, 2018; Jusczyk, Cutler & Redanz, 1993), an iambic preference indicates a more advanced learning stage. If increased attention to the mouth signals ongoing acquisition, we predict more attention to the mouth in EP-learning infants that do not show an iambic preference, and less attention to the mouth in infants showing the dominant iambic preference. If, by contrast, more attention to the mouth is related to more advanced language abilities, the reverse pattern is predicted. The current study will thus contribute to deepen the current understanding of the relation between eye gaze patterns and language development.

2. Method

2.1. Participants

Twenty-seven monolingual typically developing infants participated in this study (15 males; mean age: 5 months 14 days; age range: 5 months 2 days to 6 months 27 days), recruited from the wider Lisbon area. Five-six month olds were chosen for two main reasons. First, stress discrimination or a processing advantage for a given stress pattern has been reported already at such an early age, at least in languages like German and French (Friederici, Friedrich & Christophe, 2007; Weber, Hahne, Friedrich & Friederici, 2004), and the preference for a given stress pattern has been suggested to emerge between 4 and 6 months (Höhle, Bijeljac-Babic, Herold, Weissenborn & Nazzi., 2009, for German). Second, our age group is comparable to that of infants in other studies exploring the role of visual attention in speech perception, before the developmental shift towards the mouth.

Typical development was assessed after the experimental task, within a follow-up procedure that was part of the EBELa Project (http://labfon.letras.ulisboa.pt/babylab/EBELa/). Two screening tools were used: (i) the Communication and Symbolic Behavior Scales Developmental Profile (CSBS-DP) Checklist (Wetherby & Prizant, 2003) adapted for EP (Frota, Vicente, Filipe, & Vigário, 2014–2016), and filled in by all caregivers when infants were between 6 and 24 months of age; and (ii) The Portuguese MacArthur-Bates Communicative Development Inventories (CDI) short forms (Frota, Butler, Correia, Severino, Vicente, & Vigário, 2016), filled in by caregivers when infants were between 8 and 30 months. According to these developmental tools, all children exhibited social communication, language and symbolic functioning skills as expected for their age (including eye gaze, gestures, productive and receptive vocabularies).

Two infants were excluded from the analysis of eye gaze patterns due to poor tracking ratio (see section 2.2.1 for detailed information on exclusion criteria). From the 25 infants included in the gaze pattern analysis, three were excluded from the stress perception analysis due to poor tracking ratio in that task (Frota et al., submitted). A final sample of 22 infants was thus used for the analysis of eye gaze patterns in relation to stress perception.

2.2. Materials and procedure

Both stress perception and eye gaze data to talking faces were obtained from a perception experiment run in a sound attenuated room. Infants were sat in an appropriate chair or on their parent’s lap, at a distance of approximately 70 cm from the Dell LCD screen (1680 × 1050 pixel resolution) of a remote eye-tracker (SMI RED500). Stimuli presentation and data storage were performed with the SMI Experimenter Center and iView X software.

2.2.1. Eye gaze pattern to talking faces

The analysis of the gaze pattern was based on a 4-second long dynamic video of a talking movie character (Noddy), presented at the end of each block of the perception experiment to keep infants engaged in the task. The video was carefully chosen to allow measurement of eye gaze to talking faces and social gestures. The talking character moves its eyes, mouth, head, and arm while talking, against a colorful scenery (Figure 1). The choice of the video was also motivated by the participation of atypically developing populations, such as infants at risk for autism, given that the perception experiment was part of larger projects on typical and atypical development (EBELa, http://labfon.letras.ulisboa.pt/babylab/EBELa/; Horizon21, http://labfon.letras.ulisboa.pt/babylab/horizon21/; PLOs, http://labfon.letras.ulisboa.pt/babylab/PLOS/en/). Some atypical populations are known to prefer cartoon-faces over real-faces (Rosset et al., 2007; Van Der Geest, Kemner, Verbaten & Van Engeland, 2002; inter alia), and thus a talking movie character would allow comparing findings across the different groups of children.

Figure 1
Figure 1

AOIs defined in the video: face, eyes, mouth, and arm.

Four different exemplars of the video were created, each containing a different encouraging speech message while the visual features were kept the same. The four exemplars were carefully designed so that audio and video were synchronized, namely, the speech messages were carefully constructed to match the video movements (mouth, head, arm) and were then synchronized with the video. Speech messages were produced in child directed speech by a female native speaker of EP. Order of presentation of the videos was fixed within participants and randomized across participants.

For each video, three dynamic Areas of Interest (AOIs) – face, eyes, mouth – and one non-head dynamic AOI – waving arm – were defined, as illustrated in Figure 1. Considering the total screen, the face covers 8,2% of the screen, the eyes cover 2,4%, the mouth covers 0,5%, and the arm covers 2,3%. Eye gaze to the background was not included in this analysis; only visual communicative cues (face and arm) were explored. A sample video is provided as an Additional file.

A total of 101 trials were initially considered for analysis (mean 3,74 by infant). After inspection of the participants’ tracking ratio for each video trial, all trials with a tracking ratio below 50% were discarded. Thus, 34 videos were excluded (33,66% of the initial set). This resulted in the exclusion of two infants, for whom there was not enough data for analysis. The excluded videos had a mean tracking ratio of 9,55% (SD = 29,90%). The included videos had a mean tracking ratio of 89,04% (SD = 39,89%).

Net dwell time (in milliseconds) for each AOI was used as the eye gaze measure.

2.2.2. Stress perception

Infant stress perception was examined using a modified version of the Anticipatory Eye Movement paradigm. Infants were exposed to trials with two geometric images, each associated to a stress pattern (iambic or trochaic). In the test phase, both stress patterns were presented and infants were expected to look to the side of the screen where the image associated to the pattern they were listening to appeared in the training phase. Training and test phase constituted a block, and the experiment included 8 blocks. Each block ended with an exemplar of the Noddy video described above. Infants performed a minimum of two and a maximum of six blocks (with a mean of 4 blocks), depending on their interest in the task. Disyllabic segmentally varied nonsense words with either penult or final stress were used. Speech stimuli were uttered by a female native speaker of EP in child directed speech style (cf. Frota et al., submitted).

3. Results and discussion

Not all infants watched all video exemplars. Due to the unequal sample size per encouraging message across infants, a Wilcoxon test was run to compare net dwell times (ms) between each pair of video exemplars. The different encouraging messages were shown not to impact on the results, as overall eye gaze to the different videos was not significantly different (video 1 vs. video 2: Z = –.282, p > .05; video 1 vs. video 3: Z = –.784, p > .05; video 1 vs. video 4: Z = –.338, p > .05; video 2 vs. video 3: Z = –.664, p > .05; video 2 vs. video 4: Z = –.140, p > .05; video 3 vs. video 4: Z = –1.352, p > .05). Therefore, the analysis of eye gaze was run on the basis of all videos, independently of the message provided.

3.1. Eye gaze

The eye gaze patterns found are shown in Figure 2. Eye gaze was concentrated more on the face than on the arm (t(24) = 6.836, p < .001, d = 1.37; mean face: 2149 ms, mean arm: 284 ms). In addition, infants overall attended more to the eyes than to the mouth (t(24) = 4.219, p < .001, d = .84; mean eyes: 1026 ms, mean mouth: 241 ms).

Figure 2
Figure 2

Boxplots displaying net dwell time per AOI.

Thus, the eye gaze of EP-learning infants at 5–6 months of age shows a similar pattern as reported in other studies, with a general preference for the eye region at this young age (Ayneto & Sebastián-Gallés, 2017; Fort et al., 2018; Lewkowicz & Hansen-Tift, 2012; Pons, Bosch & Lewkowicz, 2015).

3.2. Eye gaze pattern and stress preference

Frota et al. (submitted) demonstrated that EP learning infants show a preference for the iambic stress pattern, as they attended longer to the iambic side than the trochaic side of the screen during the test phase, irrespective of being exposed to the iambic or trochaic stress pattern.

Considering that the preferred stress pattern has been reported to be acquired earlier (Jusczyk, Cutler & Redanz, 1993, among others), an iambic preference indicates a more advanced learning stage. If increased attention to the mouth is related to ongoing acquisition, more attention to the mouth is expected in EP-learning infants that do not show an iambic preference, and less attention to the mouth in infants showing the dominant iambic preference. If, by contrast, more attention to the mouth is related to more advanced language abilities, the reverse pattern is predicted.

To explore the relation between eye gaze patterns and stress preference, we used the net dwell time data by participant from Frota et al.’s (submitted) study. The data was divided into the following two groups: infants who looked longer to the iambic side constituted the ‘iamb’ group (15 infants); infants who looked longer to the trochaic side, and infants who showed a balanced looking to either side of the screen formed the ‘non-iamb’ group (7 infants). A difference in looking time of more than 90 ms was taken as an indication of preference for a given stress pattern. This threshold was obtained by considering the mean difference in looking time to the two stress patterns (376 ms) minus the standard deviation (266 ms), adjusted to the closest difference in looking time to the two stress patterns (93 ms).

A mixed ANOVA was run with AOI as a within-subjects factor (4 levels: face, eyes, mouth, arm), and stress preference as the between-subjects factor (2 levels: iamb, and non-iamb). A main effect of the AOI was observed (F(3,60) = 47.368, p < .001, η = .84). There was no effect of stress preference (F(1,20) = 2.113, p > .05, η = .32) and no interaction between AOI and stress preference (F(3,60) = 1.589, p > .05, η = .26). A closer look at the data shows a strong dominance of the face and eyes overall, which underlies this result (Figure 3). However, beyond the overall result, eye gaze to the mouth region (red bar) – and face (orange bar) – was found to be modulated by stress preference, as there is more attention to the mouth in infants that do not show an iambic preference (Figure 3). To ascertain the strength of this effect, each AOI was analyzed separately with stress preference as a between-subjects factor. A significant effect was found for mouth (F(1,20) = 9.831, p < .01, η = .57), but not for eyes (F(1,20) = 0.003, p > .05, η = .01), face (F(1,20) = 2.563, p > .05, η = .34), or arm (F(1,20) = 0.003, p > .05, η = .01).

Figure 3
Figure 3

Net dwell time per AOI (arm, mouth, eyes, face) by stress pattern preference (iamb and non-iamb).

Furthermore, a moderate correlation was found between stress preference (iamb, non-iamb) and eye gaze to the mouth region (r = .574, p < .01). A more detailed analysis of the correlation pattern showed that looks to the mouth were positively correlated with looking time results for the non-iambic pattern (r = .478, p < .05), as shown in Figure 4. Conversely, eye gaze to the eye region was positively correlated with looking time results for the iambic pattern (r = .469, p < .05), as illustrated in Figure 5. No other significant correlations were found.

Figure 4
Figure 4

Correlation between looking time (net dwell time – ms) to the non-iambic stress patterns and eye gaze (net dwell time – ms) to the mouth.

Figure 5
Figure 5

Correlation between looking time (net dwell time – ms) to the iambic stress pattern and eye gaze (net dwell time – ms) to the eyes.

These results show that infants’ eye gaze to talking faces is related to their stress preference in a perception task. EP-learning infants showing the dominant iambic preference attend less to the mouth; by contrast, infants that do not display a preference for iambic stress attend more to the mouth. Given that the preferred stress pattern is the first to be acquired, and its acquisition has been reported to develop between 4 and 6 months (depending on the language), this suggests that increased attention to the mouth is recruited to support ongoing language acquisition, in this particular case the development of lexical stress perception. Further support for the link between a more advanced learning stage, signaled by the iambic preference, and less attention to the mouth is provided by later language measures taken from the follow up study mentioned in section 2.1. Looks to the mouth at 5–6 months were found to be negatively correlated with receptive vocabulary measured at 12–18 months using the Portuguese CDI short forms (r = –.60, p < .05). In other words, the present findings show that less attention to the mouth at 5–6 months relates to more advanced language abilities.

4. Discussion

Eye gaze patterns to talking faces have been shown to relate to communicative and social issues, as well as to language development. However, previous studies have suggested different understandings of the relation between eye gaze patterns and language development. On the one hand, more attention to the mouth seems to signal a less advanced learning stage, or ongoing acquisition, as shown by its relation to emerging articulatory abilities and/or, increased processing effort (Ayneto & Sebastián-Gallés, 2017; Fort et al., 2018; Hillairet de Boisferon et al., 2017; Lewkowicz & Hansen-Tift, 2012; among others). On the other hand, more attention to the mouth seems to indicate a more advanced learning stage, given the relation between increased attention to the mouth and both concurrent and predictive expressive language skills (Tenenbaum et al., 2015; Tsang et al., 2018).

In the present study, we investigated European Portuguese learning infants’ eye gaze patterns to talking faces. Specifically, we examined 5–6 month old infants’ eye gaze using a communicative video included in a stress perception experiment, in which infants showed a preference for the iambic stress pattern (Frota et al., submitted). We asked whether there was a relation between eye gaze patterns to different areas of interest in the communicative video (face, eyes, mouth, arm) and the preferred stress pattern. To the best of our knowledge, no study had previously addressed the relation between infants’ eye gaze to communicative faces and the acquisition of lexical stress. Considering that the preferred stress pattern has been reported to be acquired earlier, an iambic preference indicates a more advanced learning stage. If increased attention to the mouth is indeed related to ongoing acquisition, or processing effort, more attention to the mouth was expected in EP-learning infants that do not show an iambic preference, and less attention to the mouth in infants showing the dominant iambic preference. If, by contrast, more attention to the mouth is related to more advanced language abilities, the reverse pattern was predicted.

The analysis of eye gaze patterns showed that typically developing EP-learning infants aged 5–6 months old looked more to the face than to the arm, and that their attention was focused on the eyes over the mouth. This is in line with studies for other languages, showing the same attentional bias for faces, already shown by newborns (Di Giorgio et al., 2012; Gliga et al., 2009; Johnson et al., 1991), and points to a similar developmental path as that of English-, Catalan- or Spanish-learning infants of the same age (Lewkowicz & Hansen-Tift, 2012; Pons, Bosch & Lewkowicz, 2015). Since this attentional bias is informative of infants’ social and language development, in further research we are examining whether eye gaze to talking faces in other groups of infants, including clinical populations and at risk infants, can be an early marker of (a)typical language development (Pejovic, Cruz, Severino & Frota, in progress), as it has been suggested (Annaz et al., 2009; Jones & Klin 2013; Åsberg Johnels et al., 2014; Irwin & Brancazio, 2014; inter alia).

Besides the overall dominance of eye gaze to the face and the eyes, infants’ gaze to the mouth region (and face) was found to be modulated by the stress pattern. More specifically, infants who did not show the general preference for iambic stress tended to look more to the mouth than infants who demonstrated a preference for iambic stress. The general preference for iambic stress in 5–6 month old EP-learning infants found in Frota et al.’s (submitted) is in line with recent behavioral and neurophysiological findings for EP adult speakers (Lu, Vigário, Correia, Jerónimo & Frota, 2018). Lu and colleagues reported a processing advantage for the iambic stress pattern over the trochaic pattern manifested in more accurate and fast responses in behavioral tasks, as well as in an increased MMN in the ERP task. Taken together, the results of both infants and adults indicate that iambic stress is the perceptually more salient pattern, and thus the pattern that is easier to process.

In short, the present eye gaze findings confirm the prediction that increased attention to the mouth is recruited to support ongoing language acquisition and processing effort evidenced by EP-learning infants who do not show an iambic preference. By contrast, less attention to the mouth in infants showing the dominant iambic preference supports the relationship between attending less to the mouth at 5–6 months of age and a more advanced learning stage.

In conclusion, the eye gaze exploration of talking faces by 5–6 month old EP-learning infants adds to previous findings suggesting that increased support from visual cues signals ongoing language acquisition, thus contributing to the current understanding of the relation between eye gaze patterns and language development. Additionally, the present study brings novel data relating eye gaze exploration of talking faces and early stress perception, that provide further support for infants’ use of eye gaze in early language development. The current findings also highlight the need for multimodal approaches for a more comprehensive study of language development. In the particular case of the challenging topic of the acquisition of stress in European Portuguese, they provide converging evidence for an advantage of iambic stress in early development.

Additional File

The additional file for this article can be found as follows:

Sample Video

Dynamic AOIs. DOI: https://doi.org/10.5334/jpl.240.s1

Acknowledgements

A special acknowledgement is due to all infants and their families for participating in this study. This research was developed within the projects Horizon21 – Early Language Development in Down Syndrome (PTDC/MHCLIN/3901/2014) and PLOs – Predictors of Language Outcomes (PTDC/LLT-LIN/29338/2017), both funded by FCTFundação para a Ciência e a Tecnologia.

Competing Interests

The authors have no competing interests to declare.

References

Amso, D., Haas, S., & Markant, J. (2014). An eye tracking investigation of developmental change in bottom-up attention orienting to faces in cluttered natural scenes. PLoS One, 9(1), e85701. DOI:  http://doi.org/10.1371/journal.pone.0085701

Annaz, D., Karmiloff-Smith, A., Johnson, M. H., & Thomas, M. S. (2009). A cross-syndrome study of the development of holistic face recognition in children with autism, Down syndrome, and Williams syndrome. Journal of experimental child psychology, 102(4), 456–486. DOI:  http://doi.org/10.1016/j.jecp.2008.11.005

Åsberg Johnels, J., Gillberg, C., Falck-Ytter, T., & Miniscalco, C. (2014). Face-viewing patterns in young children with autism spectrum disorders: speaking up for the role of language comprehension. Journal of Speech, Language, and Hearing Research, 57(6), 2246–2252. DOI:  http://doi.org/10.1044/2014_JSLHR-L-13-0268

Atkinson, J., & Braddick, O. (2012). Visual attention in the first years: typical development and developmental disorders. Developmental Medicine & Child Neurology, 54(7), 589–595. DOI:  http://doi.org/10.1111/j.1469-8749.2012.04294.x

Ayneto, A., & Sebastián-Gallés, N. (2017). The influence of bilingualism on the preference for the mouth region of dynamic faces. Developmental Science, 20, e12446. DOI:  http://doi.org/10.1111/desc.12446

Bhatara, A., Boll-Avetisyan, N., Höhle, B., & Nazzi, T. (2018). Early sensitivity and acquisition of prosodic patterns at the lexical level. The Development of Prosody in First Language Acquisition, 23, 37. DOI:  http://doi.org/10.1075/tilar.23.03bha

Birulés, J., Bosch, L., Brieke, R., Pons, F., & Lewkowicz, D. J. (2018). Inside bilingualism: Language background modulates selective attention to a talker’s mouth. Developmental science, 22(3), e12755. DOI:  http://doi.org/10.1111/desc.12755

Chandrasekaran, C., Trubanova, A., Stillittano, S., Caplier, A., & Ghazanfar, A. A. (2009). The natural statistics of audiovisual speech. PLoS computational biology, 5(7), e1000436. DOI:  http://doi.org/10.1371/journal.pcbi.1000436

CSBS DPTM Infant-Toddler Checklist from Communication and Symbolic Behavior Scales Developmental ProfileTM (CSBS DPTM), Prizant & Wetherby. © 2002 and Portuguese translation © 2014–2016 Paul H. Brookes Publishing Co., Inc. In S. Frota, S. Vicente, M. Filipe & M. Vigário (Eds.), Translated by permission granted to the EBELa project’s team (EXCL/MHC-LIN/0688/2012).

Di Giorgio, E., Turati, C., Altoè, G., & Simion, F. (2012). Face detection in complex visual displays: An eye-tracking study with 3-and 6-month-old infants and adults. Journal of experimental child psychology, 113(1), 66–77. DOI:  http://doi.org/10.1016/j.jecp.2012.04.012

Fort, M., Ayneto-Gimeno, A., Escrichs, A., & Sebastian-Galles, N. (2018). Impact of bilingualism on infants’ ability to learn from talking and nontalking faces. Language Learning, 68(s1), 31–57. DOI:  http://doi.org/10.1111/lang.12273

Frank, M. C., Vul, E., & Johnson, S. P. (2009). Development of infants’ attention to faces during the first year. Cognition, 110(2), 160–170. DOI:  http://doi.org/10.1016/j.cognition.2008.11.010

Friederici, A. D., Friedrich, M., & Christophe, A. (2007). Brain responses in 4-month-old infants are already language specific. Current Biology, 17(14), 1208–1211. DOI:  http://doi.org/10.1016/j.cub.2007.06.011

Frota, S., Butler, J., Correia, S., Severino, C., Vicente, S., & Vigário, M. (2016). Infant communicative development assessed with the European Portuguese MacArthur-Bates Communicative Development Inventories Short forms. First Language, 36(5), 525–545. DOI:  http://doi.org/10.1177/0142723716648867

Frota, S., Butler, J., Uysal, E., Severino, C., & Vigário, M. (Submitted). European Portuguese-learning infants look longer at iambic stress: New data on language specificity in early stress perception.

Frota, S., Vicente, S., Filipe, M., & Vigário, M. (2014–2016). Communication and Symbolic Behavior Scales Developmental Profile (CSBS-DP) Checklist (Wetherby & Prizant, 2003) adapted for EP.

Gliga, T., Elsabbagh, M., Andravizou, A., & Johnson, M. (2009). Faces attract infants’ attention in complex displays. Infancy, 14(5), 550–562. DOI:  http://doi.org/10.1080/15250000903144199

Hillairet de Boisferon, A., Tift, A. H., Minar, N. J., & Lewkowicz, D. J. (2017). Selective attention to a talker’s mouth in infancy: Role of audiovisual temporal synchrony and linguistic experience. Developmental science, 20(3), e12381. DOI:  http://doi.org/10.1111/desc.12381

Höhle, B., Bijeljac-Babic, R., Herold, B., Weissenborn, J., & Nazzi, T. (2009). Language specific prosodic preferences during the first half year of life: Evidence from German and French infants. Infant Behavior and Development, 32(3), 262–274. DOI:  http://doi.org/10.1016/j.infbeh.2009.03.004

Irwin, J. R., & Brancazio, L. (2014). Seeing to hear? Patterns of gaze to speaking faces in children with autism spectrum disorders. Frontiers in Psychology, 5, Article ID 397. DOI:  http://doi.org/10.3389/fpsyg.2014.00397

Johnson, M. H., Dziurawiec, S., Ellis, H., & Morton, J. (1991). Newborns’ preferential tracking of face-like stimuli and its subsequent decline. Cognition, 40(1–2), 1–19. DOI:  http://doi.org/10.1016/0010-0277(91)90045-6

Jones, W., & Klin, A. (2013). Attention to eyes is present but in decline in 2-6-month-old infants later diagnosed with autism. Nature, 504(7480), 427–431. DOI:  http://doi.org/10.1038/nature12715

Jusczyk, P., Cutler, A., & Redanz, N. (1993). Infants’ preference for the predominant stress patterns of English words. Child Development, 64(3), 675–687. DOI:  http://doi.org/10.2307/1131210

Kuhl, P. K., & Meltzoff, A. N. (1982). The bimodal perception of speech in infancy. Science, 218(4577), 1138–1141. DOI:  http://doi.org/10.1126/science.7146899

Kuhl, P. K., Ramírez, R. R., Bosseler, A., Lin, J.-F. L., & Imada, T. (2014). Infants’ brain responses to speech suggest Analysis by Synthesis. PNAS, 111(31), 11238–11245. DOI:  http://doi.org/10.1073/pnas.1410963111

Leppänen, J. M. (2016). Using eye tracking to understand infants’ attentional bias for faces. Child development perspectives, 10(3), 161–165. DOI:  http://doi.org/10.1111/cdep.12180

Lewkowicz, D. J. (2010). Infant perception of audio-visual speech synchrony. Developmental psychology, 46(1), 66. DOI:  http://doi.org/10.1037/a0015579

Lewkowicz, D. J., & Hansen-Tift, A. M. 2012. Infants deploy selective attention to the mouth of a talking face when learning speech. PNAS, 109(5), 1431–1436. DOI:  http://doi.org/10.1073/pnas.1114783109

Lu, S., Vigário, M., Correia, S., Jerónimo, R., & Frota, S. (2018) Revisiting Stress “Deafness” in European Portuguese – A Behavioral and ERP Study. Frontiers in Psychology, 9, 2486. DOI:  http://doi.org/10.3389/fpsyg.2018.02486

MacKain, K., Studdert-Kennedy, M., Spieker, S., & Stern, D. (1983). Infant intermodal speech perception is a left-hemisphere function. Science, 219(4590), 1347–1349. DOI:  http://doi.org/10.1126/science.6828865

Morin-Lessard, E., Poulin-Dubois, D., Segalowitz, N., & Byers-Heinlein, K. (2019). Selective attention to the mouth of talking faces in monolinguals and bilinguals aged 5 months to 5 years. Developmental psychology, 55(8), 1640–1655. DOI:  http://doi.org/10.1037/dev0000750

Munhall, K. G., & Johnson, E. K. (2012). Speech perception: When to put your money where the mouth is. Current Biology, 22(6), R190–R192. DOI:  http://doi.org/10.1016/j.cub.2012.02.026

Patterson, M. L., & Werker, J. F. (2003). Two-month-old infants match phonetic information in lips and voice. Developmental Science, 6(2), 191–196. DOI:  http://doi.org/10.1111/1467-7687.00271

Pejovic, J. (2019). The development of audiovisual vowel processing in monolingual and bilingual infants: A cross-sectional and longitudinal study. Unpublished PhD Dissertation. Universidad del País Vasco. https://addi.ehu.es/handle/10810/36143

Pejovic, J., Cruz, M., Severino, C., & Frota, S. (In progress). Selective attention to audiovisual communicative cues in infants with Down Syndrome. Paper to be submitted to an international journal.

Pejovic, J., Yee, E, & Molnar, M. (In progress). Selective attention to the eyes and the mouth during monolingual and bilingual audiovisual vowel matching. Evidence from a cross-sectional and longitudinal study. Paper to be submitted to an international journal.

Pons, F., Bosch, L., & Lewkowicz, D. J. (2015). Bilingualism modulates infants’ selective attention to the mouth of a talking face. Psychological Science, 26, 490–498. DOI:  http://doi.org/10.1177/0956797614568320

Pons, F., & Lewkowicz, D. J. (2014). Infant perception of audio-visual speech synchrony in familiar and unfamiliar fluent speech. Acta Psychologica, 149, 142–147. DOI:  http://doi.org/10.1016/j.actpsy.2013.12.013

Pons, F., Lewkowicz, D. J., Soto-Faraco, S., & Sebastián-Gallés, N. (2009). Narrowing of intersensory speech perception in infancy. Proceedings of the National Academy of Sciences, 106(26), 10598–10602. DOI:  http://doi.org/10.1073/pnas.0904134106

Rosenblum, L. D., Schmuckler, M. A., & Johnson, J. A. (1997). The McGurk effect in infants. Perception & Psychophysics, 59(3), 347–357. DOI:  http://doi.org/10.3758/BF03211902

Rosset, D. B., Rondan, C., Da Fonseca, D., Santos, A., Assouline, B., & Deruelle, C. (2007). Typical emotion processing for cartoon but not for real faces in children with autistic spectrum disorders. Journal of autism and developmental disorders, 38(5), 919–925. DOI:  http://doi.org/10.1007/s10803-007-0465-2

Sebastián-Gallés, N., Albareda-Castellot, B., Weikum, W. M., & Werker, J. F. (2012). A bilingual advantage in visual language discrimination in infancy. Psychological Science, 23(9), 994–999. DOI:  http://doi.org/10.1177/0956797612436817

Tenenbaum, E. J., Shah, R. J., Sobel, D. M., Malle, B. F., & Morgan, J. L. (2013). Increased focus on the mouth among infants in the first year of life: A longitudinal eye-tracking study. Infancy, 18(4), 534–553. DOI:  http://doi.org/10.1111/j.1532-7078.2012.00135.x

Tenenbaum, E. J., Sobel, D. M., Sheinkopf, S. J., Malle, B. F., & Morgan, J. L. (2015). Attention to the mouth and gaze following in infancy predict language development. Journal of Child Language, 42(6), 1173–1190. DOI:  http://doi.org/10.1017/S0305000914000725

Tsang, T., Atagi, N., & Johnson, S. P. (2018). Selective attention to the mouth is associated with expressive language skills in monolingual and bilingual infants. Journal of experimental child psychology, 169, 93–109. DOI:  http://doi.org/10.1016/j.jecp.2018.01.002

Van Der Geest, J. N., Kemner, C., Verbaten, M. N., & Van Engeland, H. (2002). Gaze behavior of children with pervasive developmental disorder toward human faces: A fixation time study. Journal of Child Psychology and Psychiatry, 43(5), 669–678. DOI:  http://doi.org/10.1111/1469-7610.00055

Weber, C., Hahne, A., Friedrich, M., & Friederici, A. D. (2004). Discrimination of word stress in early infant perception: Electrophysiological evidence. Cognitive Brain Research, 18(2), 149–161. DOI:  http://doi.org/10.1016/j.cogbrainres.2003.10.001

Weikum, W. M., Vouloumanos, A., Navarra, J., Soto-Faraco, S., Sebastián-Gallés, N., & Werker, J. F. (2007). Visual language discrimination in infancy. Science, 316(5828), 1159–1159. DOI:  http://doi.org/10.1126/science.1137686

Wetherby, A. M., & Prizant, B. M. (2003). CSBS manual: Communication and symbolic behavior scales. Brookes Pub. DOI:  http://doi.org/10.1037/t11527-000

Young, G. S., Merin, N., Rogers, S. J., & Ozonoff, S. (2009). Gaze behavior and affect at 6 months: predicting clinical outcomes and language development in typically developing infants and infants at risk for autism. Developmental science, 12(5), 798–814. DOI:  http://doi.org/10.1111/j.1467-7687.2009.00833.x