Hostname: page-component-7f64f4797f-bnl7t Total loading time: 0 Render date: 2025-11-11T21:18:14.685Z Has data issue: false hasContentIssue false

Linguopalatal contact differences between Japanese geminates and singletons across different places and manners

Published online by Cambridge University Press:  20 August 2025

Alexei Kochetov*
Affiliation:
Department of Linguistics, University of Toronto, Toronto, Canada
Rights & Permissions [Opens in a new window]

Abstract

This study uses electropalatography to examine linguopalatal contact differences between Japanese geminate and singleton consonants of various lingual places and manners of articulation. The analysis of over 8,000 tokens of these consonants produced by five Japanese speakers in three sets of stimuli (varying by the word lexical status, contrastive focus, and position within an utterance) showed significantly stronger constrictions for geminates of all places and manners, except for alveolopalatal fricatives. The geminate-singleton differences were the largest for alveolar and alveolopalatal nasals, while being the smallest for alveolopalatal affricates and velar stops. Durational differences between geminates and singletons were quite robust and tended to positively correlate with linguopalatal contact differences for most geminate and singleton consonants. No clear contact or durational differences were observed across the datasets, suggesting that the realization of the contrast is affected minimally by lexical status and position in the utterance. The findings for Japanese geminates are further discussed in the context of articulatory studies of similar contrasts in other languages.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The International Phonetic Association

1. Introduction

Japanese is known to exhibit a phonemic contrast between geminate and single consonants, as, for example, in [katːa] ‘bought’ vs. [kata] ‘mold’ and [kanːa] ‘plane’ vs. [kana] ‘syllabary’. This contrast has received considerable attention in phonetic literature, with most studies focusing on its durational properties (Kawahara Reference Kawahara and Kubozono2015a). A handful of available articulatory studies (as reviewed below in section 1.2) have uncovered some non-durational differences, and primarily the relative tightness/strength of the constriction for geminates. Comparisons across different manners of articulation, on the other hand, have produced conflicting results, suggesting that non-durational differences may be suspended in some consonant types, namely fricatives. To better understand the details of the phonetic implementation of the Japanese length contrast, this study seeks to investigate place- and manner-specific differences in gemination more systematically, by considering stops/affricates, fricatives and nasals produced at the alveolar, alveolopalatal, and velar places of articulation. Differences between geminate and singleton consonants of these types, produced by five speakers in a variety of materials, are examined for the amount of linguopalatal contact and constriction duration using electropalatography (EPG).

1.1 Non-durational articulatory correlates of geminates: A cross-linguistic perspective

While constriction duration is the primary and uncontroversial correlate of geminate-singleton contrasts, articulatory studies of geminates in several languages have also revealed some systematic non-durational differences. In particular, electropalatographic (EPG) investigations of coronal geminates in Cypriot Greek (Armosti Reference Armosti2009), Italian (Farnetani Reference Farnetani, Hardcastle and Marchal1990; Payne Reference Payne2006), Swiss German (Kraehenmann & Lahiri Reference Astrid and Lahiri2008), Tarifit Berber (Bouarourou et al. Reference Bouarourou, Vaxelaire, Ridouane, Hirsch, Sock, Fuchs and Laprie2008), and Tashlhiyt Berber (Ridouane Reference Ridouane2007; Ridouane & Hallé Reference Ridouane, Hallé and Kubozono2017) found that geminates were produced with a greater amount of tongue–palate contact than the corresponding singletons. Similarly tighter constrictions for geminates (coronals and/or labials) were observed using electromagnetic articulography (EMA) in Italian (Maspong, Burroni & Kirby Reference Maspong, Burroni and Kirby2024) and real-time MRI in Ikema Miyako Ryukyuan (Fujimoto, Shinohara & Mochihashi Reference Fujimoto, Shinohara and Mochihashi2023). Tighter constrictions for geminate consonants can be attributed to a greater articulatory force required to sustain longer constrictions (Ridouane Reference Ridouane2007; Kochetov & Kang Reference Kochetov and Kang2017). It should be noted that the relation between length and the tightness of constrictions is more general, not limited to geminates. That is, single consonants produced with relatively forceful constrictions (e.g., a voiceless stop in a stressed syllable) tend to exhibit longer closures as well (Vaxelaire Reference Vaxelaire1995 on French, X-ray; Byrd Reference Byrd1995 on English, EPG).

Studies that examined temporal articulatory characteristics of geminate consonants, however, have produced mixed results. Some studies reported length-related kinematic differences, such as greater temporal anticipation or faster gesture velocity (Lehiste, Morton & Tatham Reference Ilse, Morton and Tatham1973 on Estonian, EMG of bilabial consonants; Smith Reference Smith1995 on Japanese and Italian, EMA of bilabial and alveolar stops and nasals; Šimko, O’Dell & Vainio Reference Šimko, O’Dell and Vainio2014 on Finnish, EMA of labial stops; Türk, Lippus & Šimko Reference Türk, Lippus and Šimko2017 on Estonian, EMA of coronal and labial stops; Celata, Meluzzi & Bertini Reference Celata, Meluzzi and Bertini2022 on Italian, EMA of dental stops, nasals, and laterals). Other studies observed no clear movement kinematic differences between geminates and singletons (e.g., Dunn Reference Dunn1993 on Italian and Finnish, EMA of bilabial consonants) or noted those for some of the examined consonant pairs, languages, or speakers (Löfqvist Reference Löfqvist2005: Japanese but not Swedish, EMA of labial stops, nasals, and fricatives; Gili-Fivela et al. Reference Gili-Fivela, Claudio Zmarich, Savariaux and Tisato2007 on Italian, EMA of bilabial nasals, dental and velar stops, and alveolar laterals).

The lack of consistency in the kinematic results can be in part explained by differences among the studies in the methods employed (EPG, EMA, or MRI) and the articulatory gestures (the lips, the tongue tip, etc.) studied; but it may also be due to inherent differences among languages in how they implement geminate-singleton contrasts. For example, in some languages (e.g., Bengali, Berber, Hindi, and Italian) geminates shorten preceding vowels, while in others (e.g., Japanese) pre-geminate vowels get lengthened (see Kubozono Reference Kubozono and Kubozono2017 for a review). In contrast, the more consistent results of studies that examined constriction amount differences suggest that this parameter is likely a more robust articulatory correlate of the geminate-singleton contrast, regardless of the language.

1.2 The Japanese geminate-singleton contrast and articulatory investigations

The geminate-singleton contrast in Japanese is of particular interest, as it is highly frequent across the lexicon and involves consonants of most places and manners of articulation (Vance Reference Vance1987, Reference Vance2008; Kawagoe Reference Kawagoe and Kubozono2015; Kawahara Reference Kawahara and Kubozono2015a, among others). Segmental minimal pairs shown in Table 1 (adapted from Morimoto Reference Morimoto2020) illustrate the contrast across multiple consonant types.

Table 1. Examples of words illustrating the geminate-singleton contrast in Japanese (see footnote 1 on the phonemic status of alveolopalatals)

In phonological accounts, Japanese obstruent geminates are typically analyzed as a sequence of a syllable-final moraic phoneme /Q/ (called sokuon and considered to be underspecified for place) followed by a syllable-initial obstruent (stop, affricate, or fricative of a particular place; e.g., /kaQta/ [katːa]; Vance Reference Vance1987, Reference Vance2008, and citations therein). Japanese nasal geminates, on the other hand, are treated as a sequence of a syllable-final moraic nasal phoneme /N/ (called hatsuon and also considered to be underspecified for place) followed by a syllable-initial labial or coronal nasal (e.g., /kaNna/ [kanːa]). This analysis is also reflected in Japanese orthography: the initial portion of the geminate is rendered by the lowercase kana っ or ッ for obstruents and by the kana or ン for nasals.

An overwhelming majority of phonetic studies on the Japanese geminate-singleton contrasts have focused exclusively on duration differences. In these (mostly acoustic) investigations, geminates were found to be about twice as long (or even more) as single stops, confirming their phonological patterning as having extra prosodic length, a mora (Homma Reference Homma1981, Beckman Reference Beckman1982, Han Reference Han1994, Hirata & Whiton Reference Hirata and Whiton2005; Kawahara Reference Kawahara2006; Idemaru & Guion Reference Idemaru and Guion2008, among others; see Kawahara Reference Kawahara and Kubozono2015a for a review). Several studies also examined non-durational acoustic differences between geminates and singletons and found that geminate stops were produced with creakiness on the adjacent vowels (a shallower spectral tilt; Kawahara Reference Kawahara2006, Idemaru & Guion Reference Idemaru and Guion2008), which was attributed to the vocal folds being pressed tightly together for the geminates.

There is only a small number of articulatory investigations of Japanese geminates and singletons; these, however, do provide some evidence for durational differences being accompanied by certain non-durational differences, and particularly in terms of how tight the consonant constrictions are. Linguopalatal contact differences, for example, can be observed in EPG palatograms from the National Language Research Institute corpus (NLRI 1990) illustrated in Figure 1. These images represent the roof of the mouth of a Japanese speaker producing the geminate /tː, sː, ʨː, kː/ and their singleton counterparts in nonsense words of the type [aCːa] and [CaːCa] (second consonant), respectively. The dots indicate electrodes embedded in the artificial EPG palate that the speaker wore during the experiment. Dark lines show the extent of the contact between the tongue and the artificial palate at the maximum constriction during the consonant. We can see that the alveolar stops /tː/ and /t/ were produced with complete closure at the alveolar ridge and a side contact further back on the palate. The pattern for the alveolopalatal affricates /ʨː/ and /ʨ/ is similar to the stops, but with the closure extending further back and showing more side contact. Alveolar fricatives /sː/ and /s/ were produced with a central channel at the alveolar ridge and some posterior side contact, while velar stops /kː/ and /k/ showed some side contact at the back of the artificial palate (which reflects only a partial capture of the velar closure). Crucial for our purposes is the comparison between geminates and singletons of the same consonant type (place and manner). Note that the extent of the linguopalatal contact was slightly higher, covering a larger area, for /tː/ than /t/. Similar differences can be observed for the affricates. The two fricatives differ mainly in the width of the central channel, which is narrower for the geminate. No clear differences can be noted for the velars, which is possibly due to the limited extent of the artificial palate (as the accompanying X-ray tracings on p. 326 of NLRI 1990 do show a more extensive sagittal contact between the tongue dorsum and the velum). NLRI (1990), the goal of which was to provide articulatory and acoustic illustrations of various Japanese sounds, did not explicitly discuss these geminate-singleton differences, nor provided any quantitative analyses of the data. Based on the images in Figure 1, however, it is possible that geminate consonants in Japanese can be produced with greater linguopalatal contact compared to singletons. This generalization, nevertheless, is tentative at best, as the data presented in NLRI (1990) was obtained from a single speaker, and thus may in principle reflect individual, rather than language-general differences in the production of the contrast.

Figure 1. EPG frames overlaid over a palate cast for geminate and singleton consonants produced by a Japanese speaker; dots represent electrodes embedded into the artificial palate; the lines demarcate the areas where linguopalatal contact was present (dotted for geminates and plain black for singletons) (adapted from figures on pp. 135, 147, 169, 227, 321, 323, 325, 327 in NLRI 1990). For example, the contact for [tː] in atta was produced in the front (upper) region of the palate below the incisors, which corresponds to the alveolar ridge; additionally, the contact was at the sides of the palate, but not in its central region.

Following up on these observations, Kochetov & Kang (Reference Kochetov and Kang2017) examined the production of the length contrast in Japanese alveolar stops /tː, t/ and alveolopalatal affricates /ʨː, ʨ/ (comparing those to Korean fortis stops and affricates). Five Japanese speakers wearing EPG palates produced these sounds in nonsense words of the type [ma_a] (matta vs. mata, matcha vs. macha). The results showed that geminates were consistently produced with a greater amount of linguopalatal contact in the anterior portion of the palate (the alveolar and postalveolar regions). In addition, geminate stops (but not geminate affricates) showed greater contact in the posterior (palatal) portion of the palate. That is, geminates were produced with a tighter constriction and (for stops) with an overall higher position of the tongue. The geminate-singleton differences in linguopalatal contact, however, were relatively small compared to the extensive durational differences (the duration of the articulatory closure). Specifically, the geminate-to-singleton ratios in terms of the amount of contact across the palate were 1.13 for stops and 1.06 for affricates, compared to the durational ratios of 2.03 and 2.57, respectively.

In another recent EPG study, Matsui, Kawahara & Shaw (Reference Matsui, Kawahara and Shaw2016) studied Japanese geminates of several manners of articulation – stops /t, d/ (including their affricate allophones), fricatives /s, z/, and the rhotic /ɾ/. (It should be noted that the occurrence of geminate voiced obstruents and /ɾː/ in Japanese is limited to recent loanwords and onomatopoeia; Kawahara Reference Kawahara and Kubozono2015a; Morimoto Reference Morimoto2020.) Four speakers produced geminate and singleton variants of these sounds in reduplicated onomatopoeic words of the type CV_V-CVCV (e.g., katta-kata vs. kata-kata). The results showed significantly more contact in geminates than singletons for all consonant pairs, except those involving the voiceless alveolar fricatives (/sː/ vs. /s/). In a related report on data from two of the speakers Kawahara & Matsui (Reference Kawahara and Matsui2017: 16) concluded that ‘articulatory strengthening does not occur for voiceless fricatives in Japanese geminates’, noting the parallel results reported by Ridouane & Hallé (Reference Ridouane, Hallé and Kubozono2017) for Tashlhiyt Berber.

Beyond the EPG research, some evidence for stronger geminate constrictions comes from EMA studies by Löfqvist. Specifically, in his (2025) investigation, Löfqvist found that geminate bilabials (voiceless and voiced stops, nasal, and fricative) were produced in real words (e.g., samma vs. sama) by his four speakers with a higher lower lip than their singleton counterparts. Löfqvist’s (Reference Löfqvist2007) study examining five speakers producing alveolar stops, alveolar and alveolopalatal fricatives, and velar stops (e.g., hossa vs. hosa) showed a greater magnitude of tongue movement during the consonant constriction (as well as slower peak velocity) for geminates compared to singletons. While not interpreted as such by the author (given a different focus of the study), the higher lower lip position and a greater tongue movement magnitude observed here can be regarded as evidence for tighter constrictions for the geminates. Finally, while primarily focusing on the Japanese rhotic geminate /ɾː/, Morimoto (Reference Morimoto2020) used EMA to examine other coronal geminates and singletons: the stops /tː, t, dː, d/, the fricative /sː, s/, and the nasal /nː, n/ (including their vowel-conditioned allophones). The data were obtained from eight speakers producing these sounds in onomatopoeic words of the type CVCV-CVCV, (e.g., gatta-gata vs. gata-gata; cf. Matsui et al. Reference Matsui, Kawahara and Shaw2016). The results revealed a significantly higher position of the tongue tip and tongue body for geminates compared to singletons, however, only for some of the consonant types. Specifically, tongue tip height differences involved the voiceless stops /tː, t/ and the rhotic /ɾː, ɾ/, while the tongue body height differences involved the alveolar stops, the rhotics, and, to a lesser extent, the alveolar nasals. No significant differences were observed for alveolar fricatives (/sː, s/), echoing the results of Matsui et al. (Reference Matsui, Kawahara and Shaw2016), but seemingly contradicting those of Löfqvist (Reference Löfqvist2007) (who did find significant differences between /sː/ and /s/).

Finally, the most recent EMA investigation of Japanese length contrasts, conducted by Burroni, Kawahara & Shaw (Reference Burroni, Kawahara and Shaw2025), examined a large set of coronal geminates and their singleton counterparts ([t], [d], [ts], [ʨ], [s], [z]-[ʝ], [ɾ]). These consonants were produced in onomatopoeic words by seven speakers. The results, combined for all consonant pairs, revealed multiple differences in kinematic properties: geminates exhibited higher tongue tip positions indicative of tighter constrictions, overall longer gesture durations (both the movements towards the target and the constriction plateaus), greater movements, and lower stiffness (movement speed). They also showed longer times to constriction targets, resulting in longer preceding vowels. This study thus confirmed some previous findings (namely Löfqvist Reference Löfqvist2007), while also providing new insights into the mechanisms of the Japanese geminate production. It did not, however, examine possible differences in the realization of length across different manners and places of articulation, which is the focus of the current study.

1.3 This study

As reviewed above, the available articulatory research on the Japanese length contrasts has provided some evidence for non-durational differences between geminates and singletons, and primarily in terms of the strength of the constriction. This is consistent with findings for similar contrasts in other languages, as reviewed in section 1.1. At the same time the studies on Japanese geminates have produced somewhat contradictory results with respect to articulatory differences in certain manners of articulation, namely voiceless fricatives. The lack of significance in some cases may stem from the fact that constriction amount differences are in general relatively small (compared to the extensive durational differences) and difficult to capture based on small numbers of tokens or items. Some of the differences may also have arisen due to the use of different methods. While EMA can provide high resolution kinematic data for various articulators, comparing consonants in terms of their extent of lingual constrictions is more difficult, as observations are based on positions of a single flesh point for a given articulator (e.g., a single sensor on the tongue tip for /t/ or a single sensor for the tongue dorsum for /k/; see Kochetov Reference Kochetov2020 for an overview of EMA and EPG methods). In contrast, EPG provides considerably more information about the contact of the tongue with the palate (given multiple electrodes embedded in the artificial palate), and thus is more likely to detect small-scale differences. Finally, some differences in the results may also come from the types of materials used – nonsense words (NLRI 1990; Kochetov & Kang Reference Kochetov and Kang2017), onomatopoeia (Matsui et al. Reference Matsui, Kawahara and Shaw2016; Morimoto Reference Morimoto2020), or words of the core lexicon (Löfqvist Reference Löfqvist2007). It seems plausible to assume that the use of the less standard vocabulary may lead to greater variability in the data and either hyper- or hypo-articulation, depending on the speaker. Relatively frequent words with geminates and singletons, on the other hand, are likely to show more homogeneity, while at the same time showing propensity to lenition (see e.g., Gahl Reference Gahl2008; Tomaschek et al. Reference Tomaschek, Tucker, Fasiolo and Harald Baayen2018 on the role of word frequency in speech production). This may lead to reduced articulatory differences between geminates and singletons in real words compared to nonsense words.

The goal of this study is to employ EPG to more systematically examine Japanese length contrast by considering a range of consonant types (places and manners) and a variety of reading materials (nonsense words and real words in different prosodic contexts). With respect to consonant types, the study examines voiceless obstruents of three places – alveolar stops /tː/ vs. /t/ and fricatives /sː/ vs. /s/, alveolopalatal affricate /ʨː/ vs. /ʨ/, and fricatives /ɕː/ vs. /ɕ/, and velar stops /kː/ vs. /k/, as well as alveolar and alveolopalatal nasals /nː/ vs. /n/ and /ɲː/ vs. /ɲ/.Footnote 1 Note that some of these consonants, namely geminate alveolopalatals, velar stops, and nasals (regardless of their place) have received limited (if any) attention in previous articulatory studies, and thus the study expands the coverage of consonants involved in gemination. On the other hand, we chose not to include voiced obstruents and /ɾ/, as these exhibit gemination only in recent loanwords or expressive vocabulary (Kawahara Reference Kawahara and Kubozono2015a; Morimoto Reference Morimoto2020).

In general, while we expect geminates to show more linguopalatal contact than their singleton counterparts, this effect may be weaker or absent altogether for alveolar fricatives, given earlier, albeit contradictory results (cf. Löfqvist Reference Löfqvist2007; Matsui et al. Reference Matsui, Kawahara and Shaw2016; Morimoto Reference Morimoto2020). The effect may also be limited in alveolopalatal affricates (given the partial results in Kochetov & Kang Reference Kochetov and Kang2017: more geminate contact only in the anterior portion of the palate). If this is the case, it would confirm the characterization of alveolar fricatives and alveolopalatal affricates as highly resistant to positional effects or coarticulation. This is the conclusion reached by Recasens & Espinosa Reference Recasens and Espinosa2006 (cf. Recasens & Espinosa Reference Recasens and Espinosa2009): strengthening or weakening of consonants depends to a great degree on their specification for high-demand manner or place requirements such as frication, trilling, and tongue body involvement. Given this, alveolopalatal affricates and fricatives are the prime candidates to show resistance to gemination, and thus minimal (if any) geminate-singleton differences. Nasals, on the other hand, may be affected to a greater degree by length manipulations, given the generally weaker constrictions compared to obstruents (Gibbon et al. Reference Gibbon, Yuen, Lee and Adams2007; but see Morimoto Reference Morimoto2020 on rather limited differences for /nː/ vs. /n/). We may also expect smaller differences or their absence for velars, as the EPG provides only a partial view of their constrictions (Gibbon & Nicolaidis Reference Gibbon, Nicolaidis, Hardcastle and Hewlett1999; see also Figure 1).

In terms of the materials (as discussed further below), the study uses a variety of stimuli (nonsense words and real words in different prosodic contexts) with the goal to determine whether lexical status and position in the utterance affect gemination. In general, we may expect larger length differences in real words and especially in minimal pairs occurring in the same sentence or an utterance-initial prosodic context (e.g., Armosti Reference Armosti2009 on the greater linguopalatal contact in Cypriot Greek geminates utterance-finally vs. utterance-medially.).

2. Method

2.1 Speakers

The participants were five female native speakers of Japanese with the average age of 36.4 years (range 28–43 years) at the onset of the study. They were from several locations along the eastern/southern coast of the island Honshu: Shizuoka (JF1), Shiga (JF2), Ibaraki (JF3), Kyoto (JF4), and Hyogo (JF5). Dialectally, Shiga, Kyoto, and Hyogo are considered to be part of the Western Japanese dialect; Shizuoka and Ibaraki, on the other hand, belong to the Eastern dialect. We are not aware of any differences between the two dialects in terms of gemination and the examined consonants (while differences in pitch accent patterns are commonly noted; Onishi, Reference Onishi, Boberg, Nerbonne and Watt2018).

The speakers were residing in Toronto at the time of the experiment, living abroad for an average of 8.8 years (range 2–14 years). They reported using Japanese on a regular basis, on average 52% of their daily communication (range 40% to 60%), while using English (and Serbian for JF3) in the other cases. While relatively small for a phonetic study, the number of speakers involved here is comparable to or even higher than is typical for EPG research (e.g., the median of four participants across 54 EPG-based journal publications between 2000 and 2019; Kochetov Reference Kochetov2020).

2.2 Materials

The materials included three sets, all containing words with intervocalic geminates and singletons of seven types: alveolar stops /tː, t/, alveolopalatal affricates /ʨː, ʨ/, velar stops /kː, k/, alveolar nasals /nː, n/, alveolopalatal nasals /ɲː, ɲ/, alveolar fricatives /sː, s/, and alveolopalatal fricatives /ɕː, ɕ/.

Set 1 consisted of a combination of phonetically balanced nonsense and real words of the type [maCa], as shown in Table 2. The items were randomized and presented in Japanese orthography (katakana) to the participants in a carrier sentence sorede ___ mo dekiru (それで ___ も、出来る) ‘You can make ___ out of it’.Footnote 2 The participants were instructed to produce pitch accent (high tone) on the first syllable (see Kawahara Reference Kawahara and Kubozono2015b on Japanese pitch accent).Footnote 3

Table 2. Items used in Set 1, shown in Roman transliteration and the katakana syllabary; some of these are real words or personal names; others are nonsense words (indicated with ‘—’)

Sets 2 and 3 consisted of 13 segmental minimal pairs (the same for both sets), two per consonant type, except for /ɲː/ vs. /ɲ/ (which had only one pair). These 26 words are shown in Table 3.Footnote 4 Note that combinations of back and front vowels, in different orders, were used in all words except with velars. For the latter, words with two front vowels were selected to ensure that closures could be captured by EPG. While words within pairs were chosen to be segmentally the same, it was not always possible to control for the pitch accent placement. We are not, however, aware of any research on possible effects of pitch accent on lingual constrictions (cf. Kawahara Reference Kawahara and Kubozono2015b) and would therefore assume that these differences will be unlikely to play a role.

Table 3. Real words used in the study (Set 2 and Set 3), shown in Roman transliteration and Japanese orthography

In Set 2, words from Table 3 were randomized and presented in a carrier phrase kabe ni wa ___ mo kaite aru (壁には_______ も書いてある) ‘___ is also written on the wall’. In Set 3, each minimal pair was placed in the same sentence, creating contrastive focus – ___ dewa naku ___ to itta (___ ではなく___ と言った) ‘This is not ___; this is ___’, and presented in two different orders: with a singleton word first and the corresponding geminate word second (e.g., mate – matte) and with a geminate word first and the corresponding singleton word second (e.g., matte – mate). To take into consideration the position of the word within the sentence, we will refer to cases when the word is utterance-initial as Set 3a (e.g., [t] in mate – matte), and when it is utterance-medial as Set 3b (e.g., [tː] in mate – matte). These two sets with the manipulations in the order for Set 3 were selected to elicit potential differences in the realization of the geminate-singleton contrast given the contrastive focus. Thus, one may expect greater length differences when minimal pairs are juxtaposed, so that the target consonants are explicitly contrasted with each other (contrastive focus, Set 3). In the same vein, one may expect to find stronger realizations of contrast utterance-initially than utterance-medially (Set 3a vs. Set 3b). Somewhat clearer differences may be expected between Set 1 and the other two sets, given the more controlled phonetic contexts, as well as possible articulatory weakening in more frequent vocabulary items in Sets 2 and 3 (cf. Gahl Reference Gahl2008).

On average, 22 repetitions per word per speaker were elicited for Set 1 and 18 repetitions were elicited for Sets 2 and 3.Footnote 5 With a handful omissions due to recording errors, this gave us 1,511 tokens for Set 1 (14 items × 22 repetitions × 5 speakers), 2,340 tokens for Set 2 (26 items × 18 repetitions × 5 speakers), and 4,528 tokens for Set 3 (26 items × 2 orders × 22 repetitions × 5 speakers), giving in total 8,379 tokens. The data were recorded over multiple recording sessions together with stimuli for other experiments.

2.3 Instrumentation

A WinEPG system by Articulate Instruments (Wrench et al. Reference Wrench, Gibbon, McNeill, Wood, Hansen and Pellom2002) was used to collect articulatory data at a sampling rate of 100 Hz and audio data at 22,050 Hz. The system uses acrylic palates with 62 electrodes, custom-made for each participant. The traditional (University of Reading-style) EPG palates were used for participants JF1, JF2, and JF3, while newer ‘Articulate’ palates (Wrench Reference Wrench2007) were used for JF4 and JF5. As can be seen in Figure 2, Articulate palates extend further back, thus providing a better coverage for the velar region; otherwise, however, they are similar in capturing contact at the front and central portions of the palate (alveolar, postalveolar, and palatal regions) (see Tabain Reference Tabain2011; Kochetov, Colantoni & Steele Reference Kochetov, Colantoni and Steele2017 for a comparison of the two models).

Figure 2. Artificial palates and the accompanying palate casts of the speakers.

2.4 Annotation and analysis

Consonant closures/constrictions were annotated based on the waveform and spectrogram using the Articulate Assistant software (version 1.18; Wrench et al. Reference Wrench, Gibbon, McNeill, Wood, Hansen and Pellom2002). Closures of stops and affricates were taken to be intervals between the onset and offset of relative silence (thus excluding the stop release or affricate frication); constrictions of fricatives were taken to be intervals between the onset and offset of fricative noise; closures of nasals were taken to be intervals between the points of abrupt amplitude decrease and increase, also corresponding to the nasal murmur interval during the oral constriction. These acoustic events typically corresponded well to articulatory closures and constrictions of the consonants, as evidenced by EPG frames. Figure 3a presents a sample annotation – a waveform, a spectrogram, and a set of corresponding palate frames taken every 10 ms – for a token of sekki. The interval of the velar closure is indicated with a label “kk”, and the cursor is placed on the first frame of maximum contact during the closure (which in this case is also the midpoint of the consonant constriction). The image in (b) shows an enlarged view of the selected frame, with palate regions indicated to the left corresponding to rows 1–8 of the artificial palate. Note that the closure is produced in the velar region (row 8), with considerable side contact in the palatal region (rows 5–7) and extends into the alveolar region (to row 2). The image in (c) shows an average of all frames with the closure for the same token, while the image in (d) shows an average of nine tokens of this word produced by the speaker within a session. The close similarity between the (c) and (d) indicates that the velar stop closure was produced consistently across the sample.

Figure 3. (a) A sample annotated token: /kː/ in sekki by speaker JF3; (b) the frame taken at the point of maximum contact; (c) an average of all frames within the closure interval with numbers and shading indicating percentages of each electrode activation across frames; (d) an average of nine tokens of /kː/ (the entire closure intervals) in sekki by this speaker.

For each token, linguopalatal contact values (‘1’ or ‘0’ for each electrode) were automatically extracted from the first point of maximum contact frame as well as – for Set 1 – from five equally spaced points within the annotated interval. The choice to limit the temporal analysis to Set 1 was based on the more uniform phonetic context there (a_a), compared to the more variable environments in real words.

Two variables were used in the analysis. The first one was the amount of contact across the palate, Q (Quotient of maximum activation). It was calculated by dividing the number of contacts activated by the total number of contacts (62). For example, the frame shown in Figure 3b has the Q value of 0.484 (30 activated electrodes out of 62). (Note that we did not use Q at particular regions of the palate, e.g., in the first five rows as in Kochetov & Kang Reference Kochetov and Kang2017, since the consonants examined here differed in place.) The other variable was Duration of the annotated interval (in seconds), which was also automatically extracted for each token by the software.

The data were analyzed using linear mixed effects regression (LMER) models implemented with the lme4 package (Bates et al. Reference Bates, Martin Maechler, Steve Walker, Singmann and Grothendieck2017) using R (R Core Team 2014) separately for Q and Duration. Fixed factors were Length (two levels: geminate (CC), singleton (C)), Consonant Type (C_type, seven levels: alveolar stops, alveolopalatal stops (affricates), velar stops, alveolar nasals, alveolopalatal nasals, alveolar fricatives, alveolopalatal fricatives), and Set (four levels: Set 1, Set 2, Set 3a, Set 3b), with the first two factors included as an interaction; random factors were Speaker, Word, and Recording session, with random intercepts only.Footnote 6 Treatment-contrast coding was used for categorical variables, with the baseline condition being the geminate alveolar stop in Set 1. For each analysis, likelihood ratio tests were used to compare the full model to a nested model excluding the factor of interest, employing the Anova() function of the lmerTest package (Kuznetsova et al. Reference Kuznetsova, Brockhoff and Christensen2017). Pairwise comparisons and posthoc tests (with a Bonferroni correction) were performed using the phia package (De Rosario-Martinez Reference De Rosario-Martinez2015). Pearson correlations were further performed to examine the relation between Q and Duration, separately for geminates and singletons.

3. Results

We begin the presentation of the results by a qualitative overview of sample linguopalatal contact profiles in section 3.1. This is followed by the presentation of quantitative results for the amount of contact Q at the point of maximum contact in section 3.2 and an examination of temporal differences in Q in section 3.3. Further, quantitative durational results are presented in section 3.4, followed by an examination of correlations between amount of contact and duration in section 3.5.

Figure 4. Average palate profiles for geminate and singleton consonants from Set 2 produced by JF3 (based on nine repetitions within a single recording session).

3.1 Overview

Figure 4 presents sample average linguopalatal profiles for all consonant pairs produced by one of the speakers, JF3 (a subset of Set 2; see Figure A1 for examples of all words produced by five speakers with original-size images available in the Supplementary Materials). Considering the first two images, /tː/ in matte vs. /t/ in mate, we can see that both consonants were produced with a closure in the anterior portion and side contact throughout the artificial palate. The closure for the geminate, however, was more extensive, occupying three instead of two rows (the alveolar and partly postalveolar regions); the side contact for the geminate /tː/ was also slightly greater than for /t/. This indicates that the geminate was produced with a higher tongue position and stronger contact. Similar differences can be observed for the rest of the consonants, albeit not as robust in the cases of affricates and velar stops. In the case of fricatives, the central channel (alveolar or postalveolar) was narrower for geminates compared to singletons. (It should be kept in mind that EPG measures the extent of the contact but not the distance between the tongue and the palate, which can be also crucial for acoustic consequences of these articulations.)

3.2 Amount of contact (Q)

Results of a linear mixed effect model for Q produced significant effects of Length ( $\chi$ 2(1) = 1035.23, p <.0001), Consonant Type ( $\chi$ 2(6) = 135.28, p <.0001), and Set ( $\chi$ 2(3) = 24.79, p <.0001), as well as a significant interaction of Length and Consonant Type ( $\chi$ 2(6) = 332.05, p <.0001). Full model comparisons are presented in Table 4 and the model estimates in Table 5. The reason for the interaction can be observed in Figure 5a: geminates of all consonant types had on average higher Q values than their singleton counterparts, but the magnitude of these differences varied by type. Posthoc tests run on Consonant Type pairs (see Table 6) revealed that pairwise comparisons for Length were significant at the p <.0001 level for alveolar stops, alveolar nasals, alveolar fricatives, and alveolopalatal nasals; they were significant at the p <.05 level for alveolopalatal affricates and velar stops; the comparison for alveolopalatal fricatives was not significant (p >.05), albeit showing on average slightly more contact for the geminate (by 0.02). Differences for the other pairs were larger: 0.04 for alveolopalatal affricates and velar stops, 0.05 for alveolar fricatives, 0.09 for alveolar stops and alveolopalatal nasals, and 0.14 for alveolar nasals. The average difference across consonant types was 0.07, which means that geminates were on average produced with the contact with four to five extra electrodes on the artificial palate than singletons. Based on these differences, the overall geminate-singleton ratio was 1.13. Ratios for Consonant Types were (in increasing order) 1.05 for alveolopalatal fricatives, 1.07 for alveolopalatal affricates, 1.08 for velar stops, 1.14 for alveolar fricatives and alveolar stops, 1.16 for alveolopalatal nasals, and 1.26 for alveolar nasals.

Figure 5. Boxplots of the amount of linguopalatal contact (Q) (a) by Length and Consonant Type and (b) by Length and Set; CC = geminate, C = singleton, t = alveolar stops, ch = alveolopalatal affricates, k = velar stops, n = alveolar nasals, ny = alveolopalatal nasals, s = fricative alveolars, sh = alveolopalatal fricatives.

The significant Length and Consonant Type interaction was also due to considerable differences in Q among different manners and place of articulation (as seen in Figure 5a). Generally, the amount of contact was higher for coronal (alveolar and alveolopalatal) stops than velar stops and coronal fricatives, as well as higher for alveolopalatals than alveolars. Most of these differences were significant; this was fully expected given the inherent place and manner differences and their linguopalatal realizations. For example, less contact was in general expected for velars given the partial coverage of their constrictions and for fricatives given the absence of closures. The results of pairwise comparisons of different Consonant Types by length are not shown here.

The above-mentioned significant effect of Set was largely due to differences between Set 1 and the other sets, as revealed by posthoc tests shown in Table 7. As also seen in Figure 5b, the amount of contact was lower in Set 1 than in Sets 2 and 3(a,b). This was likely due to the target consonants being placed in the context of the low vowel /a/, compared to the other sets having more varied phonetic contexts (see also Figure A2 in the Appendix). Unexpectedly, no significant differences were found among Sets 2, 3a, and 3b. There was also no significant interaction of Length and Set. On average, geminates had slightly more contact in Sets 3a,b (the contrastive condition) compared to Set 2 (the no-contrast condition): 0.62 and 0.61 vs. 0.59, but singletons showed a similar increase in Sets 3a and 3b compared to Set 2: 0.55 and 0.55 vs. 0.53.

Table 4. Model comparisons for Q (Analysis of Deviance Table, Type II Wald $\chi$ 2 tests); significance codes: ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

Table 5. Summary of a linear mixed model for Q; formula: lmer(Q ∼ Length * C_type + Length * Set + (1|Speaker) + (1|Word) + (1|Recording_session), data); significance codes: ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1; the intercept is the geminate alveolar stop in Set 1

Table 6. Q results of posthoc tests for Length by C_type; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

Table 7. Q results of posthoc tests for Set; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

Considering individual differences, all speakers showed on average greater Q values for geminates than singletons (see Figure A3a), but to a different degree. JF1 and JF2 exhibited relatively low geminate-to-singleton ratios (1.04 and 1.08), while the other three speakers showed considerably higher ratios (1.15 for JF3, 1.18 for JF4, and 1.19 for JF5). Some of these differences can be attributed to the use of different artificial palate models, as the speakers showing the greatest contrast, JF4 and JF5, used the newer model with a better coverage of the velar region (see section 2.3). The speakers’ anatomical differences, however, could also be a factor: JF1 and JF2 had relatively flat hard palates, while the other speakers’ palates were more domed (see Figure 2). This may also explain the overall greater tongue–palate contact for JF1 and JF2, in particular for alveolars (see Figure A1). Another notable inter-speaker difference was in the realization of alveolar fricatives. These sounds were produced by JF1 with a very narrow channel, which given insufficient spatial resolution of the artificial palate was often rendered as a complete closure. In contrast, JF4’s alveolar fricatives appeared to be dental and thus showed very limited anterior contact (especially for the singleton /s/; see the displays for dase and esa in Figure A1).

In summary, our analysis of the amount of linguopalatal contact taken at the constriction maximum showed relatively small but consistently significant differences between geminates and singletons, with the exception of alveolopalatal fricatives. A closer examination of the data for the latter consonants showed considerable variation across the items. Specifically, length differences were near-absent in the pairs involving the words asshi and ashi (Sets 2 and 3). A series of follow-up t-tests by word pair, summarized in Table 8, confirmed the lack of significance for comparisons involving these two items. The geminate-singleton differences for the other pairs (massha vs. masha and issho vs. isho), however, were significant.

Table 8. Results of t-tests for Q for word pairs involving alveolopalatal fricatives; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

The lack of significance in the asshiashi pair can be attributed to the marginal status of the first word (see Footnote 4; cf. Shigeto Kawahara p.c.). A likely additional factor was the devoicing/deletion of the following vowel /i/ in Set 3. This segment was flanked there by voiceless obstruents, as in [aɕːi dewa naku aɕi̥ to itːa], and therefore subject to devoicing (see Fujimoto Reference Fujimoto and Kubozono2015 on Japanese high vowel devoicing). This, in turn, often resulted in the fricative being strongly coarticulated with /t/ (notably for JF1, JF4, and JF5). Taking into consideration these lexical and contextual factors, it can be concluded that the lack of significant length differences for the full set of alveolopalatal fricatives is partly due to the choice of the stimuli and carrier sentences. Finally, it should be noted that vowel devoicing/deletion also affected the item shike: it was consistently produced as [ɕ(i̥)ke] by all the speakers, with the /k/ often showing carryover coarticulation from the alveolopalatal /ɕ/ (see e.g., JF1’s shikeshikke in Figure A1) and thus reducing the length difference.Footnote 7

Figure 6. Trajectories of amount of contact (Q) based on five equally distant time points (normalized) during the closure for geminates (CC) and singletons (C), separately by Consonant Type; t = alveolar stops, ch = alveolopalatal affricates, k = velar stops, n = alveolar nasals, ny = alveolopalatal nasals, s = fricative alveolars, sh = alveolopalatal fricatives; the function geom_smooth() using method = ‘loess’ and formula ‘y ∼ x’; confidence intervals are indicated in grey.

3.3 Temporal differences in the amount of contact (Set 1)

The analysis presented above was based on a single point within the consonant constrictions – the point of maximum contact. It is of interest to know, however, how the amount of contact differences were manifested throughout the consonant constriction. Figure 6 presents Q trajectories across five points during the annotated consonant interval (time-normalized). We can see that with the exception of fricatives and velar stops, Q values for geminates were clearly higher than those for singletons at least at points 2, 3, and 4 (i.e., at 25%, 50%, and 75% of their constriction). Differences for the fricatives were found more towards the end: points 4 and 5 (alveolar) or 3 and 4 (alveolopalatal). Note that the magnitude of the length difference for the alveolopalatal fricative was slightly higher than that for its alveolar counterpart (and much higher than for the velar stop), despite our finding of non-significant differences in the single-point analysis (over the entire data) in section 3.2. The overall low Q values for velars were due to the artificial palate capturing only a part of the closure in the low vowel context.

Altogether, these results show that contact differences between geminates and singletons in Japanese, as produced by our speakers, were not limited to a single point but spanned a large part of the consonant constriction for coronal stops/affricates and nasals, while being more temporally limited for fricatives and velar stops.

Table 9. Model comparisons for Duration (Analysis of Deviance Table, Type II Wald $\chi$ 2 tests); significance codes: ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

3.4 Consonant duration

Results of a linear mixed effect model for duration produced significant effects of Length ( $\chi$ 2(1) = 22301.78, p <.0001), Consonant Type ( $\chi$ 2(6) = 268.07, p <.0001), and Set ( $\chi$ 2(3) = 8.50, p = .0367), as well as a significant interactions of Length and Consonant Type ( $\chi$ 2(2) = 371.07, p <.0001) and Length and Set ( $\chi$ 2(3) = 17.15, p =.0007). Full model comparisons are presented in Table 9, while model estimates are shown in Table 10.

As seen in Figure 7a, the Length and Consonant Type interaction was likely due to Consonant Type differences in the magnitude of the durational Length contrast, as well as in the Consonant Type-specific differences by Length. Overall, geminates of all consonant types had much higher duration values than their singleton counterparts, which is not surprising. Posthoc tests run on consonant type pairs revealed that all pairwise comparisons were significant (p <.0001; see Table 11). Highest differences were observed for velar stops (100 ms), alveolar stops (87 ms), and alveolopalatal fricatives (81 ms), while lower differences were found for alveolopalatal nasals (76 ms), alveolar fricatives (72 ms), and alveolar nasals (66 ms). The overall geminate-singleton ratio was 2.20, ranging from 1.73 for alveolopalatal fricatives to 2.65 for alveolopalatal nasals. In other words, geminate consonants were on average more than twice as long as singletons. In terms of Consonant Type, nasals had shorter duration than stops and fricatives; fricatives tended to be longer than stops, although not consistently across the length categories (see Figure 7a).

Table 10. Summary of a linear mixed model for duration; formula: lmer(Duration ∼ Length * C_type + Length * Set + (1|Speaker) + (1|Word) + (1|Recording_session), data); significance codes: ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1; the intercept is the geminate alveolar stop in Set 1

Table 11. Duration results of posthoc tests for Length by C_type; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

The significant Length and Set interaction was due to Set 3a being significantly different from Set 1, however, only for geminates (p = 0.0348; see Table 12 and Figure 7b). A similar (but non-significant) tendency was also observed for the Set 3b vs. Set 1 pair. This can possibly reflect the greater effect of gemination in the contrastive context: geminates were produced stronger when they occurred in the same sentence with singletons. However, the difference between Sets 3a and 3b vs. Set 2 (where the same words appeared in the no contrast condition) was not significant, and thus the observed differences may reflect the different lexical status of words (nonsense words vs. real words).

Table 12. Duration results of posthoc tests for Set by Length categories; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

With respect to individual results, all speakers showed robust durational differences (see Figure A3b). On average, geminates were more than twice as long as singletons for four out of five speakers (with the ratios being 2.02 for JF1, 2.04 for JF2, 2.67 for JF4, and 2.09 for JF5); one speaker showed a lesser durational difference (JF3, the ratio of 1.91). Interestingly, the length differences did not seem to be affected by the participants’ speech rate. An examination of utterance durations showed that a slower rate was exhibited by JF1 and JF2 (on average 172 and 190 ms per mora), while JF3, JF4, and especially JF5 showed much faster rates (144, 156, and 134 ms per mora respectively).Footnote 8

3.5 Relation between the amount of contact and consonant duration

Plotting all tokens by amount of contact and duration separately by Consonant Type in Figure 8 Footnote 9 shows that the length categories were almost fully separated by duration, while indicating considerable overlap in Q. Interestingly, both geminates and singletons tended to show a positive correlation of the two variables. As seen in Table 13, significant positive correlations were obtained for four out of seven geminate consonants; these included all the alveolopalatals (/ʨː/, /ɕː/, and /ɲː/) and the velar stop /kː/, but not the alveolars. Somewhat unexpectedly, the nasal /nː/ showed a significant negative correlation. Among singletons, significant positive correlations were observed for five out of seven consonants, with only the alveolars /n/ and /s/ showing the lack of it. It should be noted that most of the significant correlations were either moderate (between 0.30 and 0.50) or weak (below 0.30), with only the affricates /ʨː/ and /ʨ/ showing strong correlations (above 0.50). Overall, this means that there was mainly weak-to-moderate association between consonant duration and its amount of linguopalatal contact, largely regardless of the geminate or singleton status. In other words, for most consonants, tokens with longer duration had a somewhat greater amount of contact. While these correlations do not show causation, it is reasonable to interpret greater linguopalatal contact as at least in part resulting from longer constriction durations: to produce a longer consonant duration, one has to apply greater strength to the active articulator, and this results in more contact with the passive articulator. It is important to note, however, that linear regression curves for geminate consonants in Figure 8 were not just a continuation of singleton regression curves; in fact, they were often parallel to each other (notably for the alveolopalatals and the velar stops). This shows that geminates were not simply longer versions of the corresponding singletons; rather, the production mechanisms of the two length categories are inherently distinct (cf. Löfqvist Reference Löfqvist2005, Reference Löfqvist2007; Burroni et al. Reference Burroni, Kawahara and Shaw2025).

Figure 7. Boxplots of consonant Duration in seconds (a) by Length and Consonant Type and (b) by Length and Set; CC = geminate, C = singleton; t = alveolar stops, ch = alveolopalatal affricates, k = velar stops, n = alveolar nasals, ny = alveolopalatal nasals, s = fricative alveolars, sh = alveolopalatal fricatives.

Figure 8. Scatterplot of individual tokens by amount of contact (Q, normalized) and Duration (normalized) by Length and Consonant Type; CC = geminate, C = singleton; t = alveolar stops, ch = alveolopalatal affricates, k = velar stops, n = alveolar nasals, ny = alveolopalatal nasals, s = fricative alveolars, sh = alveolopalatal fricatives; black lines represent linear regression curves for each consonant.

Table 13. Outputs of Pearson’s product-moment correlation analyses by consonant; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

4. Discussion and conclusion

The goal of this study was to provide a systematic investigation of linguopalatal contact differences in the production of the Japanese length contrast across various lingual places and manners of articulation. These consonants were produced by five native speakers in a variety of phonetic and prosodic contexts, as well as in words of different lexical status. Our analysis of the amount of contact across the palate, taken at the point of maximum contact during the constriction, showed consistently higher values for geminates of all places and manners of articulation, with the exception of alveolopalatal fricatives. On average, geminates were produced with an extra contact of four to five electrodes (out of 62) over the artificial palate, resulting in a geminate-to-singleton ratio of 1.13. For comparison, place differences between alveolopalatal and alveolar singletons of the same manner were indicated by on average seven to eight electrodes (ratio 1.27), while manner differences between stop/affricate and fricative singletons of the same place were produced with on average 12 extra electrodes (ratio 1.46). Thus, the length differences were smaller than place and manner differences, but nevertheless substantial. There was also considerable variation with respect to the realization of the length contrast depending on manner and place. The largest differences were exhibited by nasals and the smallest differences were exhibited by fricatives. Furthermore, in terms of place, overall smaller differences were exhibited by alveolopalatal and velar obstruents, compared to alveolar obstruents and nasals regardless of the place. The noted lack of significant difference for alveolopalatal fricatives can be in part attributable to the choice of stimuli and carrier sentences, as differences were found to be significant for some of the items or conditions. It also likely reflects a stronger resistance of these sounds to contact changes (as discussed further below).

The expected length differences in amount of contact were also observed across all three datasets (nonsense words, real words in contrast and without contrast), prosodic contexts, and across the five speakers. Our temporal analysis of consonant constrictions in Set 1 showed that Q differences between geminates and singletons extended through much of the consonant duration, albeit being skewed towards the consonant offset for fricatives. Duration differences in length were very large, showing an average ratio of 2.2 (i.e., with geminates being more than twice as long as singletons). Correlation analyses showed weak-to-moderate positive association between constriction duration and amount of contact, and this was the case for the majority of geminate and singleton consonants. That is, longer durations tended to imply greater linguopalatal contact for most consonant types. Of note, this was in particular the case for alveolopalatals rather than alveolars, possibly due to the overall greater contact involved in the production of the former. While both geminates and singletons showed duration-contact correlations, the two categories had different patterns, pointing to somewhat distinct sources of this relation. Presumably, the greater contact for geminates is at least partly due to their phonological specification for stronger constrictions or more distant ‘virtual targets’ (cf. Ridouane Reference Ridouane, D’Imperio and Vallée2010; Kochetov & Kang Reference Kochetov and Kang2017, as further discussed below), or perhaps is a consequence of their other kinematic and timing properties (cf. Burroni et al. Reference Burroni, Kawahara and Shaw2025).

It is of interest to note that our speakers produced clear length distinctions in both contact and duration despite being (late) bilinguals in English (the language that does not have a comparable contrast). This further indicates that the Japanese distinction between geminates and singletons is very robust and appears to be resistant to L2 influences (see Kartushina, Frauenfelder & Golestani Reference Kartushina, Frauenfelder and Golestani2016 on L2-to-L1 transfer). To further confirm this, however, it would be important to examine the production of geminates by Japanese speakers of different levels of proficiency in English.

Overall, these results confirm our observations of the first articulatory comparison of linguopalatal contact for Japanese consonants in NLRI (1990), and namely for the obstruents /tː, t, ʨː, ʨ, sː, s/ produced in that study by a single speaker. The results also confirm the findings of two other EPG studies of Japanese – Kochetov & Kang (Reference Kochetov and Kang2017; on /tː, t, ʨː, ʨ/ in the ma_a context in nonsense words) and Matsui et al. (Reference Matsui, Kawahara and Shaw2016; on /tː, t, sː, s/ in onomatopoeic words) – and extend our observations to other consonants, prosodic conditions, and kinds of lexical items. It is important to note that alveolopalatal affricates in Kochetov & Kang (Reference Kochetov and Kang2017) were not as clearly distinguished in amount of contact as alveolar stops (showing differences in the anterior part of the palate only). The same holds in the current study for alveolopalatal obstruents in general: while affricates showed significant contact differences between geminates and singletons, differences in alveolopalatal fricatives were not significant (at least for the entire dataset). This is consistent with the general characterization of alveolopalatals as highly resistant to positional variation and coarticulation and in fact already exhibiting an extensive linguopalatal contact (Recasens & Espinosa Reference Recasens and Espinosa2006, Reference Recasens and Espinosa2009; see also Payne Reference Payne2006 on a comparison of alveolopalatals and geminate dentals in Italian).

Another important finding of the current study is that alveolar fricatives showed relatively small, nevertheless significant length differences in Q. This is in agreement with Löfqvist’s (2007) finding based on EMA (and broadly with Burroni et al. Reference Burroni, Kawahara and Shaw2025, who had these consonants as part of a larger dataset) but contrary to the lack of observed differences for /sː, s/ in Matsui et al. (Reference Matsui, Kawahara and Shaw2016) based on EPG and in Morimoto (Reference Morimoto2020) based on EMA. It is possible that the between-study differences reflect individual variability among Japanese speakers in the presence/absence or the degree of the non-durational realization of the contrast. Alternatively, given the small magnitude of length differences in alveolar fricatives, previous studies could have lacked the required statistical power or adequate spatial resolution for the tongue–palate contact (in the case of EMA). Note also that despite a significant amount of contact differences, alveolar fricatives in the current study behaved differently from other manners in the lack of Q-Duration correlations. Further research is therefore needed into manner- and place-specific differences in the realization of Japanese geminates making use of larger participant samples, more thoroughly controlled materials, and ideally a combination of different methods.

Another manner-specific finding of the current study is that nasals showed the most robust length difference (the ratios of 1.16 and 1.26, compared to 1.05 to 1.14 for the other consonant types). This is a novel result, as none of the previous EPG studies of the Japanese length have examined lingual nasals. While alveolar nasals were examined in Morimoto’s (Reference Morimoto2020) EMA study, they were found to show intermediate tongue-tip position differences between stops and fricatives. It is also of interest to note that nasals in our data showed the shortest duration both for geminates and singletons, and that one of the nasal consonant types (alveolar nasal) did not show the expected correlation between Q and duration. Some of these manner-specific differences can be attributed to the motor control requirements for nasals: unlike stops, nasals do not require a fully sealed-off oral cavity, leading to weaker constrictions and partial lack of side contact with the palate (Gibbon et al. Reference Gibbon, Yuen, Lee and Adams2007). A related factor is the susceptibility of single nasals to intervocalic lenition (also observed in our data), which must have contributed to the greater contrast with geminates. Finally, it is worth noting that geminate nasals in Japanese are traditionally analyzed differently from geminate obstruents (see section 1.2). These phonological representational differences, if meaningful, may have also contributed to manner-specific phonetic differences.

As mentioned above, consistent non-durational differences were observed in our data largely regardless of the dataset and prosodic context. While we predicted this to be the case, we also expected to find some differences among the datasets – namely between nonsense words and real words and between real words in isolation and in a contrastive focus context. Additionally, we expected the contrast to be more prominent in utterance-initial position than in utterance-medial position. These predictions were confirmed for neither amount of contact nor duration, with the exception of differences involving Set 1 (nonsense words) and some of the other sets, which could be more plausibly explained by segmental context differences. It therefore seems that the geminate-singleton contrast in Japanese is relatively stable, hardly sensitive to prosodic differences and shows relatively minimal differences in their realization across the lexicon.

The results for non-durational differences for Japanese consonants fit into the growing body of findings for typology of gemination across world languages. Recall that similar effects, in terms of the tightness of constriction, were obtained in EPG studies of languages as diverse as Italian, Swiss German, and Tashlhiyt Berber (Farnetani Reference Farnetani, Hardcastle and Marchal1990; Kraehenmann & Lahiri Reference Astrid and Lahiri2008; Payne Reference Payne2006; Ridouane Reference Ridouane2007; Ridouane & Hallé Reference Ridouane, Hallé and Kubozono2017; see section 1.1). Studies that used other methods, such as EMA and MRI have produced similar results (cf. Maspong et al. Reference Maspong, Burroni and Kirby2024 on Italian; Fujimoto et al. Reference Fujimoto, Shinohara and Mochihashi2023 on Ikema Miyako Ryukyuan), while pointing to some language-particular differences in kinematic properties (cf. Löfqvist Reference Löfqvist2005 on Japanese vs. Swedish; see Kubozono Reference Kubozono and Kubozono2017). The reason for tighter constrictions for geminate consonants is presumably due to the need to apply greater articulatory force in order to sustain longer contact between lower and upper articulators.

The fact that the resulting articulatory differences are systematic and largely independent of pure constriction duration differences (which are also observed in singletons; see section 3.5) suggests that these effects are grammatical in nature, being part of phonetic implementation of geminate consonants. This was in fact proposed by Ridouane (Reference Ridouane, D’Imperio and Vallée2010) to explain non-durational differences in Berber initial and medial geminates. According to his analysis, the phonologically long consonants, associated with two timing slots, are enhanced at the phonetic level of representation by the feature [tense]. The implementation of this feature-filling rule results in geminates being different from singletons not only in duration, but also in the tightness of the oral constrictions (see also Kochetov & Kang Reference Kochetov and Kang2017). Taking this further, the implementation of [tense] can have somewhat different realization depending on the manner and place of target consonants, as related to articulatory constraints on their production (coarticulatory resistance; Recasens, Pallarès & Fontdevila Reference Recasens, Dolors Pallarès and Fontdevila1997). Specifically, these effects are expected to be stronger for the less resistant constrictions of nasals and weaker for the more resistant constrictions of fricatives; similar differences are expected of place differences, namely between alveolars and alveolopatals (Recasens & Espinosa Reference Recasens and Espinosa2006, Reference Recasens and Espinosa2009). How exactly length contrasts are enhanced by articulatory tensing across various consonant types and prosodic positions in various languages is a fruitful direction to pursue in the future.

Acknowledgements

The author would like to thank Shigeto Kawahara and three anonymous reviewers for multiple helpful suggestions and corrections. He is also grateful to Maho Kobayashi for assistance with the design of the materials and to Jacqueline Wong for help with data annotation. The work was supported by the Social Sciences and Humanities Research Council of Canada Standard Grant #416-2010-0959. A subset of the data presented here were originally analyzed and presented in Kochetov (Reference Kochetov2012; four word pairs with /tː, t, kː, k/ from Set 2, three speakers, first recording session) and Kochetov and Kang (Reference Kochetov and Kang2017; two word pairs with /tː, t, ʨː, ʨ/ from Set 1, five speakers).

Appendix

Figure A1. Average palate profiles (taken over entire constriction intervals) for geminate and singleton consonants from Set 2 produced by all speakers (based on nine repetitions within a single recording session for JF1, JF2, and JF3; based on 18 repetitions for JF4 and JF5). All original-size images are available in the Supplementary Materials.

Figure A2. Boxplots of Q by Length, Consonant Type, and Set; CC= geminate, C = singleton, t = alveolar stops, ch = alveolopalatal affricates, k = velar stops, n = alveolar nasals, ny = alveolopalatal nasals, s = fricative alveolars, sh = alveolopalatal fricatives.

Figure A3. Boxplots of (a) Q and (b) consonant duration by Length and Speaker (JF1–JF5); CC= geminate, C = singleton.

Footnotes

1 The phonemic status of alveolopalatals in Japanese is somewhat debatable, as these can be interpreted as either independent phonemes or variants of alveolars in C+j,i contexts (Vance Reference Vance1987; Pintér Reference Pintér and Kubozono2015).

2 A subset of the data from Set 1 (the stops/affricates in 4 words: matta, mata, matcha, and macha) produced by the same speakers were used in Kochetov & Kang (Reference Kochetov and Kang2017). The author is thankful to Shigeto Kawahara and anonymous reviewers for suggestions on how to improve the presentation of the materials in this section.

3 Some of the initially-accented items that are marked here as nonsense words have non-initially-accented counterparts that are real words (mata, makka, and matcha). A review of the recordings showed that the participants produced the pitch accent as instructed, with a small number of occasional deviations in some of these words (e.g., matcha by JF5).

4 The items dasse and asshi can be regarded as lexically marginal. The first one is an Osaka dialect (Western Japanese) variant of desu yo ‘it is’ (colloquial). The second one is a sociolectal variant of the first-person singular pronoun. As pointed out by Shigeto Kawahara, the use of this variant is restricted to lower social class male speakers and is somewhat outdated.

5 Speaker-specific numbers of repetitions, however, differed due to the participants’ availability for recording sessions. For Set 1, 24 repetitions were obtained for JF1, 12 for JF2, 36 for JF3, and 18 for JF4 and JF5. For Set 2, 18 repetitions were obtained for JF1, 9 for JF2, 27 for JF3, and 18 for JF4 and JF5. Finally, for Set 3 (both orders), 18 repetitions were obtained for JF1, 9 for JF2, 27 for JF3, and 18 for JF4, and 15 for JF5.

6 Attempts to add random slopes to any of these factors produced model non-convergence; the same resulted from adding Set to the interaction with Consonant Type.

7 The finding of the greater overlap in the [ɕk] sequence is consistent with Shaw and Kawahara’s (Reference Shaw and Kawahara2023) finding that Japanese fricative-stop clusters resulting from vowel devoicing/deletion get gesturally reorganized, with their oral gestures being coordinated with each other.

8 It should be noted that these rates fall within the range reported for Japanese read speech by Campbell (Reference Campbell1999), where CV syllables were on average 112 to 186 ms long.

9 For the purpose of calculating correlations, Q and Duration values were z-scored by participant.

References

Armosti, Spyros. 2009. The phonetics of plosive and affricate gemination in Cypriot Greek. PhD dissertation, University of Cambridge.Google Scholar
Bates, Douglas, Martin Maechler, Ben Bolker, Steve Walker, Rune Haubo Bojesen Christensen, Singmann, Henrik, & Grothendieck, Gabor. 2017. lme4 package, version 1.1–13 [computer software]. Retrieved from https://cran.rproject.org/web/packages/lme4/index.html Google Scholar
Beckman, Mary. 1982. Segment duration and the mora in Japanese. Phonetica 39, 113135.10.1159/000261655CrossRefGoogle Scholar
Bouarourou, Fatssal, Vaxelaire, Béatrice, Ridouane, Rachid, Hirsch, Fabrice, & Sock, Rudolph. 2008. Gemination in Tarifit Berber: X-ray and acoustic data. In Rudolph Sock, Fuchs, Susanne & Laprie, Yves (eds.), Proceedings of the 8th International Seminar on Speech Production, 117–120. Rocquencourt: INRIA.Google Scholar
Burroni, Francesco, Kawahara, Shigeto, & Shaw, Jason A.. 2025. Articulatory correlates of consonantal length contrasts: The case of Japanese mimetic geminates. JASA Express Letters 5(1).10.1121/10.0034762CrossRefGoogle ScholarPubMed
Byrd, Dani. 1995. Articulatory characteristics of single and blended lingual gestures. Proceedings of the 13th International Congress of Phonetic Sciences, Vol. 2, 438441.Google Scholar
Campbell, Nick. 1999. A study of Japanese speech timing from the syllable perspective. Journal of the Phonetic Society of Japan 3(2), 2939.Google Scholar
Celata, Chiara, Meluzzi, Chiara, & Bertini, Chiara. 2022. Acoustic and kinematic correlates of heterosyllabicity in different phonological contexts. Language and Speech 65(3), 755780.10.1177/00238309211065789CrossRefGoogle ScholarPubMed
De Rosario-Martinez, Helios. 2015. Package ‘phia’, available at https://github.com/heliosdrm/phia Google Scholar
Dunn, Margaret Hall. 1993. The phonetics and phonology of geminate consonants: A production study. PhD dissertation, Yale University.Google Scholar
Farnetani, Edda. 1990. V-C-V lingual coarticulation and its spatiotemporal domain. In Hardcastle, William & Marchal, Alain (eds.), Speech production and speech modeling, 93–130. Dordrecht: Kluwer.Google Scholar
Fujimoto, Masako. 2015. Vowel devoicing. In Kubozono, Haruo (ed.), The handbook of Japanese phonetics and phonology, 167–214. Berlin: Mouton de Gruyter.Google Scholar
Fujimoto, Masako, Shinohara, Shigeko, & Mochihashi, Daichi. 2023. Articulation of geminate obstruents in the Ikema dialect of Miyako Ryukyuan: A real-time MRI analysis. Journal of the International Phonetic Association 53(1), 6993.10.1017/S0025100321000013CrossRefGoogle Scholar
Gahl, Susanne. 2008. “Thyme” and “time” are not homophones. Word durations in spontaneous speech. Language 84(3), 474496.CrossRefGoogle Scholar
Gibbon, Fiona & Nicolaidis, Katerina. 1999. Palatography. In Hardcastle, William & Hewlett, Nigel (eds.), Coarticulation: Data, theory and techniques, 229–245. Cambridge: Cambridge University Press.Google Scholar
Gibbon, Fiona E., Yuen, Ivan, Lee, Alice, & Adams, Lynne. 2007. Normal adult speakers’ tongue palate contact patterns for alveolar oral and nasal stops. Advances in Speech Language Pathology 9, 8289.10.1080/14417040600954824CrossRefGoogle Scholar
Gili-Fivela, Barbara, Claudio Zmarich, Pascal Perrier, Savariaux, Christophe, & Tisato, Graziano. 2007. Acoustic and kinematic correlates of phonological length contrast in Italian consonants. In Proceedings of the 16th International Congress of Phonetic Sciences, 469–472.Google Scholar
Han, Mieko S. 1994. Acoustic manifestations of mora timing in Japanese. Journal of the Acoustical Society of America 96, 7382.CrossRefGoogle Scholar
Hirata, Yukari & Whiton, Jacob. 2005. Effects of speaking rate on the single/geminate stop distinction in Japanese. Journal of the Acoustical Society of America 118, 16471660.CrossRefGoogle ScholarPubMed
Homma, Yayoi. 1981. Durational relationship between Japanese stops and vowels. Journal of Phonetics 9, 273281.10.1016/S0095-4470(19)30971-4CrossRefGoogle Scholar
Idemaru, Kaori & Guion, Susan G.. 2008. Acoustic covariants of length contrast in Japanese stops. Journal of International Phonetic Association 38, 167186.10.1017/S0025100308003459CrossRefGoogle Scholar
Kartushina, Natalia, Frauenfelder, Ulrich H., & Golestani, Narly. 2016. How and when does the second language influence the production of native speech sounds: A literature review. Language Learning 66, 155186.10.1111/lang.12187CrossRefGoogle Scholar
Kawagoe, Itsue. 2015. The phonology of sokuon, or geminate obstruents. In Kubozono, Haruo (ed.), The handbook of Japanese phonetics and phonology, 79–120. Berlin: De Gruyter Mouton.Google Scholar
Kawahara, Shigeto. 2006. A faithfulness ranking projected from a perceptibility scale: The case of [+voice] in Japanese. Language 82, 536574.10.1353/lan.2006.0146CrossRefGoogle Scholar
Kawahara, Shigeto. 2015a. The phonetics of sokuon, or geminate obstruents. In Kubozono, Haruo (ed.), The handbook of Japanese phonetics and phonology, 43–78. Berlin: De Gruyter Mouton.Google Scholar
Kawahara, Shigeto. 2015b. The phonology of Japanese accent. In Kubozono, Haruo (ed.), The handbook of Japanese phonetics and phonology, 445–492. Berlin: De Gruyter Mouton.Google Scholar
Kawahara, Shigeto & Matsui, F. Michinao. 2017. Some aspects of Japanese consonant articulation: A preliminary EPG study. ICU Working Papers in Linguistics (ICUWPL) 2, 9–20.Google Scholar
Kochetov, Alexei. 2012. Linguopalatal contact differences between Japanese geminate and singleton stops. Canadian Acoustics 40, 2829.Google Scholar
Kochetov, Alexei. 2020. Research methods in articulatory phonetics I: Introduction and studying oral gestures. Language and Linguistics Compass 14(4), 14:e12368.10.1111/lnc3.12368CrossRefGoogle Scholar
Kochetov, Alexei & Kang, Yoonjung. 2017. Supralaryngeal implementation of length and laryngeal contrasts in Japanese and Korean. Canadian Journal of Linguistics/Revue canadienne de linguistique 62(1), 1855.10.1017/cnj.2016.39CrossRefGoogle Scholar
Kochetov, Alexei, Colantoni, Laura, & Steele, Jeffrey. 2017. A comparison of articulate and reading EPG palates: Capturing place/manner contrasts. Poster presented at the 11th International Seminar on Speech Production (ISSP2017), Tianjin, China.Google Scholar
Astrid, Kraehenmann & Lahiri, Aditi. 2008. Duration differences in the articulation and acoustics of Swiss German word-initial geminate and singleton stops. Journal of the Acoustical Society of America 123, 44464455.Google Scholar
Kubozono, Haruo (ed.). 2017. Introduction to the phonetics and phonology of geminate consonants. In Kubozono, Haruo (ed.), The phonetics and phonology of geminate consonants, 1–10. Oxford: Oxford University Press.10.1093/oso/9780198754930.001.0001CrossRefGoogle Scholar
Kuznetsova, Alexandra, Brockhoff, Per B., & Christensen, Rune H. B.. 2017. lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software 82, 126.10.18637/jss.v082.i13CrossRefGoogle Scholar
Ilse, Lehiste, Morton, Katherine & Tatham, M. A. A.. 1973. An instrumental study of consonant gemination. Journal of Phonetics 1, 131148.Google Scholar
Löfqvist, Anders. 2005. Lip kinematics in long and short stop and fricative consonants. The Journal of the Acoustical Society of America 117(2), 858878.10.1121/1.1840531CrossRefGoogle ScholarPubMed
Löfqvist, Anders. 2007. Tongue movement kinematics in long and short Japanese consonants. Journal of the Acoustical Society of America 122, 512518.CrossRefGoogle ScholarPubMed
Maspong, Sireemas, Burroni, Francesco, & Kirby, James. 2024. CF0 effect and articulatory strength of geminate consonants. Proceedings of Speech Prosody 2024, 334338.10.21437/SpeechProsody.2024-68CrossRefGoogle Scholar
Matsui, Michinao, Kawahara, Shigeto, & Shaw, Jason A.. 2016. EPG-wo mochiita nihongo shikei sokuon-no chou-on tokuchoo [Articulatory nature of alveolar geminates in Japanese: EPG data]. Proceedings of the Phonetic Society of Japan Meeting 1, 132137.Google Scholar
Morimoto, Maho. 2020. Geminated liquids in Japanese: A production study. PhD dissertation, University of California Santa Cruz.Google Scholar
NLRI. 1990. Nihongo no boin, shiin, onsetsu: Chouon undou no jikken onseigakuteki kenkyuu [Japanese vowels, consonants, and syllables: Experimental phonetics research of articulatory movements]: Report of the National Language Research Institute, no.100. Tokyo: National Language Research Institute.Google Scholar
Onishi, Takuichiro. 2018. Dialects of Japanese. In Boberg, Charles, Nerbonne, John & Watt, Dominic (eds.). The handbook of dialectology, 559–570. Hoboken, NJ: John Wiley & Sons.Google Scholar
Payne, Elinor M. 2006. Non-durational indices in Italian geminate consonants. Journal of the International Phonetic Association 36(1), 8395.10.1017/S0025100306002398CrossRefGoogle Scholar
Pintér, Gabor. 2015. The emergence of new consonant contrasts. In Kubozono, Haruo (ed.), The handbook of Japanese phonetics and phonology, 121–165. Berlin: De Gruyter Mouton.Google Scholar
R Core Team. 2014. R: A language and environment for statistical computing [Computer program]. R Foundation for Statistical Computing. http://www.R-project.org/ Google Scholar
Recasens, Daniel & Espinosa, Aina. 2006. Articulatory, positional, and contextual characteristics of palatal consonants: Evidence from Majorcan Catalan. Journal of Phonetics 34, 295318.CrossRefGoogle Scholar
Recasens, Daniel & Espinosa, Aina. 2009. An articulatory investigation of lingual coarticulatory resistance and aggressiveness for consonants and vowels in Catalan. The Journal of the Acoustical Society of America 125, 22882298.10.1121/1.3089222CrossRefGoogle ScholarPubMed
Recasens, Daniel, Dolors Pallarès, Maria, & Fontdevila, Jordi. 1997. A model of lingual coarticulation based on articulatory constraints. The Journal of the Acoustical Society of America 102, 544561.10.1121/1.419727CrossRefGoogle Scholar
Ridouane, Rachid. 2007. Gemination in Tashlhiyt Berber: An acoustic and articulatory study. Journal of the International Phonetic Association 37(2), 119142.10.1017/S0025100307002903CrossRefGoogle Scholar
Ridouane, Rachid. 2010. Geminates at the junction of phonetics and phonology. In Cécile Fougeron, Barbara Kühnert, D’Imperio, Mariapaola & Vallée, Nathalie (eds.), Papers in Laboratory Phonology 10: Variation, phonetic detail and phonological representation, 61–90. Berlin: Mouton de Gruyter.Google Scholar
Ridouane, Rachid & Hallé, Pierre. 2017. Word-initial geminates from production to perception. In Kubozono, Haruo (ed.), The phonetics and phonology of geminate consonants, 66–84. Oxford: Oxford University Press.Google Scholar
Shaw, J. A. & Kawahara, S.. 2023. Limits on gestural reorganization following vowel deletion: The case of Tokyo Japanese. Laboratory Phonology: Journal of the Association for Laboratory Phonology 14(1), 133.Google Scholar
Šimko, Juraj, O’Dell, Michael & Vainio, Martti. 2014. Emergent consonantal quantity contrast and context-dependence of gestural phasing. Journal of Phonetics 44, 130151.10.1016/j.wocn.2013.11.006CrossRefGoogle Scholar
Smith, Caroline L. 1995. Prosodic patterns in the coordination of vowel and consonant gestures. In Bruce Connell & Amalia. Arvaniti (eds.), Laboratory Phonology IV: Phonology and Phonetic Evidence, 205–222. Cambridge: Cambridge University Press.Google Scholar
Tabain, Marija. 2011. Electropalatography data from Central Arrernte: A comparison of the new Articulate palate with the standard Reading palate. Journal of the International Phonetic Association 41(3), 343367.10.1017/S0025100311000132CrossRefGoogle Scholar
Tomaschek, Fabian, Tucker, Benjamin V., Fasiolo, Matteo, & Harald Baayen, R.. 2018. Practice makes perfect: The consequences of lexical proficiency for articulation. Linguistics Vanguard 4(s2), 20170018.CrossRefGoogle Scholar
Türk, Helen, Lippus, Pärtel, & Šimko, Juraj. 2017. Context-dependent articulation of consonant gemination in Estonian. Laboratory Phonology 8(1), 126.10.5334/labphon.117CrossRefGoogle Scholar
Vance, Timothy J. 1987. An introduction to Japanese phonology. New York: SUNY Press.Google Scholar
Vance, Timothy J. 2008. The sounds of Japanese. Cambridge: Cambridge University Press.Google Scholar
Vaxelaire, Béatrice. 1995. Single vs. double (abutted) consonants across speech rate: X-ray and acoustic data for French. Proceedings of the 13th International Conference of Phonetic Sciences, Stockholm, Vol. 1, 384–387.Google Scholar
Wrench, Alan A. 2007. Advances in EPG palate design. International Journal of Speech-Language Pathology 9, 312.Google Scholar
Wrench, Alan, Gibbon, Fiona, McNeill, Alison M. & Wood, Sara. 2002. An EPG therapy protocol for remediation and assessment of articulation disorders. In Hansen, J. H. L. & Pellom, B. (eds.), Proceedings of the 7th International Conference on Spoken Language Processing, 965–968.Google Scholar
Figure 0

Table 1. Examples of words illustrating the geminate-singleton contrast in Japanese (see footnote 1 on the phonemic status of alveolopalatals)

Figure 1

Figure 1. EPG frames overlaid over a palate cast for geminate and singleton consonants produced by a Japanese speaker; dots represent electrodes embedded into the artificial palate; the lines demarcate the areas where linguopalatal contact was present (dotted for geminates and plain black for singletons) (adapted from figures on pp. 135, 147, 169, 227, 321, 323, 325, 327 in NLRI 1990). For example, the contact for [tː] in atta was produced in the front (upper) region of the palate below the incisors, which corresponds to the alveolar ridge; additionally, the contact was at the sides of the palate, but not in its central region.

Figure 2

Table 2. Items used in Set 1, shown in Roman transliteration and the katakana syllabary; some of these are real words or personal names; others are nonsense words (indicated with ‘—’)

Figure 3

Table 3. Real words used in the study (Set 2 and Set 3), shown in Roman transliteration and Japanese orthography

Figure 4

Figure 2. Artificial palates and the accompanying palate casts of the speakers.

Figure 5

Figure 3. (a) A sample annotated token: /kː/ in sekki by speaker JF3; (b) the frame taken at the point of maximum contact; (c) an average of all frames within the closure interval with numbers and shading indicating percentages of each electrode activation across frames; (d) an average of nine tokens of /kː/ (the entire closure intervals) in sekki by this speaker.

Figure 6

Figure 4. Average palate profiles for geminate and singleton consonants from Set 2 produced by JF3 (based on nine repetitions within a single recording session).

Figure 7

Figure 5. Boxplots of the amount of linguopalatal contact (Q) (a) by Length and Consonant Type and (b) by Length and Set; CC = geminate, C = singleton, t = alveolar stops, ch = alveolopalatal affricates, k = velar stops, n = alveolar nasals, ny = alveolopalatal nasals, s = fricative alveolars, sh = alveolopalatal fricatives.

Figure 8

Table 4. Model comparisons for Q (Analysis of Deviance Table, Type II Wald $\chi$2 tests); significance codes: ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

Figure 9

Table 5. Summary of a linear mixed model for Q; formula: lmer(Q ∼ Length * C_type + Length * Set + (1|Speaker) + (1|Word) + (1|Recording_session), data); significance codes: ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1; the intercept is the geminate alveolar stop in Set 1

Figure 10

Table 6. Q results of posthoc tests for Length by C_type; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

Figure 11

Table 7. Q results of posthoc tests for Set; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

Figure 12

Table 8. Results of t-tests for Q for word pairs involving alveolopalatal fricatives; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

Figure 13

Figure 6. Trajectories of amount of contact (Q) based on five equally distant time points (normalized) during the closure for geminates (CC) and singletons (C), separately by Consonant Type; t = alveolar stops, ch = alveolopalatal affricates, k = velar stops, n = alveolar nasals, ny = alveolopalatal nasals, s = fricative alveolars, sh = alveolopalatal fricatives; the function geom_smooth() using method = ‘loess’ and formula ‘y ∼ x’; confidence intervals are indicated in grey.

Figure 14

Table 9. Model comparisons for Duration (Analysis of Deviance Table, Type II Wald $\chi$2 tests); significance codes: ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

Figure 15

Table 10. Summary of a linear mixed model for duration; formula: lmer(Duration ∼ Length * C_type + Length * Set + (1|Speaker) + (1|Word) + (1|Recording_session), data); significance codes: ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1; the intercept is the geminate alveolar stop in Set 1

Figure 16

Table 11. Duration results of posthoc tests for Length by C_type; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

Figure 17

Table 12. Duration results of posthoc tests for Set by Length categories; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

Figure 18

Figure 7. Boxplots of consonant Duration in seconds (a) by Length and Consonant Type and (b) by Length and Set; CC = geminate, C = singleton; t = alveolar stops, ch = alveolopalatal affricates, k = velar stops, n = alveolar nasals, ny = alveolopalatal nasals, s = fricative alveolars, sh = alveolopalatal fricatives.

Figure 19

Figure 8. Scatterplot of individual tokens by amount of contact (Q, normalized) and Duration (normalized) by Length and Consonant Type; CC = geminate, C = singleton; t = alveolar stops, ch = alveolopalatal affricates, k = velar stops, n = alveolar nasals, ny = alveolopalatal nasals, s = fricative alveolars, sh = alveolopalatal fricatives; black lines represent linear regression curves for each consonant.

Figure 20

Table 13. Outputs of Pearson’s product-moment correlation analyses by consonant; ‘***’ <0.001, ‘**’ <0.01, ‘*’ < 0.05, ‘.’ <0.1

Figure 21

Figure A1. Average palate profiles (taken over entire constriction intervals) for geminate and singleton consonants from Set 2 produced by all speakers (based on nine repetitions within a single recording session for JF1, JF2, and JF3; based on 18 repetitions for JF4 and JF5). All original-size images are available in the Supplementary Materials.

Figure 22

Figure A2. Boxplots of Q by Length, Consonant Type, and Set; CC= geminate, C = singleton, t = alveolar stops, ch = alveolopalatal affricates, k = velar stops, n = alveolar nasals, ny = alveolopalatal nasals, s = fricative alveolars, sh = alveolopalatal fricatives.

Figure 23

Figure A3. Boxplots of (a) Q and (b) consonant duration by Length and Speaker (JF1–JF5); CC= geminate, C = singleton.