Hostname: page-component-7dd5485656-bvgqh Total loading time: 0 Render date: 2025-10-24T17:52:03.194Z Has data issue: false hasContentIssue false

An acoustic study of tense, lax, and glottalized vowels in Chichicastenango K’iche’ (Maya)

Published online by Cambridge University Press:  02 October 2025

Elizabeth Anne Wood*
Affiliation:
The University of Texas at Austin
Rights & Permissions [Opens in a new window]

Abstract

This article presents an acoustic analysis of vowel quality and duration in Chichicastenango K’iche’ (Maya) tense, lax and glottalized vowels through a controlled speech production experiment. The results show that most of the five tense–lax pairs can be distinguished through F1 and F2, with the high and mid lax vowels lower than their tense counterparts and the low lax vowel higher than its tense counterpart. Glottalized high and mid vowels have lax quality while glottalized low vowels have tense quality. The high lax vowels /ɪ/ and /ʊ/ and their glottalized counterparts show a high degree of overlap with surrounding categories and appear to be in process of being lost, though they retain distinct phonological behavior. Glottalized vowels are longer than tense vowels, which are longer than lax vowels. The voice quality of glottalized vowels is highly variable and is influenced by context. Realizations with full closures are almost entirely absent. Neither vowel quality nor voice quality results show clear evidence in favor of either a one-segment or two-segment analysis for glottalized vowels.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The International Phonetic Association

1. Introduction

The Chichicastenango dialect of K’iche’ (ISO639-3 QUC) is known among Mayan languages for its tense–lax vowel system, which replaced a historical length contrast (López Ixcoy Reference López Ixcoy1994; Par Sapón & Can Pixabaj Reference Par Sapón and Angelina Can Pixabaj2000; Can Pixabaj Reference Can Pixabaj2017; England & Baird Reference England, Baird and Judith Aissen2017), as also occurred in the closely related language Kaqchikel (Majzul, García Matzar & Espantzay Serech Reference Majzul, Matzar and Espantzay Serech2000; Bennett Reference Bennett2016a). Like other Mayan languages, K’iche’ also has a set of words with vowels produced with laryngealized phonation towards the end of the vowel, which have been alternately analyzed as either two segments – a vowel followed by a glottal stop – or one segment – a phonemically glottalized vowel (Baird Reference Baird2011). In Chichicastenango K’iche’, these vowels appear to have lax quality for high and mid vowel pairs but tense quality for the central pair, and although they are perceptually distinct from plain (tense or lax) vowels, full glottal closures appear to be very uncommon. The only existing acoustic study of vowels in the language is a pilot study of tense and lax vowels (Wood Reference Wood2020). Therefore, many aspects of this system remain unclear. This paper presents an acoustic analysis of vowel quality, duration, and voice quality in Chichicastenango K’iche’ (henceforth CK) vowels through a controlled speech production experiment.

The paper is organized as follows. Section 2 provides background on the consonants and vowels of CK, contextualized within the Mayan language family and cross-linguistic patterns. Section 3 details the methods used for the production experiment, and Section 4 describes the results. Section 5 discusses some major aspects of these results, focusing on the evidence for ongoing mergers of the high lax vowels with surrounding vowels and the implications of the vowel quality and voice quality results for the analysis of glottalized vowels. Section 6 concludes the article. The appendix includes additional statistical results excluded from the main text. The supplementary materials include audio examples for all CK words cited throughout the paper. When possible, all examples come from the experimental data; otherwise, they come from elicitation sessions or spontaneous speech recorded by the author in Chichicastenango between 2018–2025.Footnote 1

2. Background: Chichicastenango K’iche’ sounds in Mayan and cross-linguistic perspective

The consonant inventory of K’iche’ is shown in Table 1. Where different from the IPA, the orthographic symbols are included in < >.

Table 1. K’iche’ consonant inventory

Stops and affricates contrast a plain series with a series referred to as ‘glottalized’ in the Mayanist literature (Bennett Reference Bennett2016b). In CK, like other Mayan languages, most of the glottalized consonants are ejective, but the bilabial is usually realized as a voiced implosive. With the exception of the bilabial implosive, all obstruents are voiceless and all sonorants voiced. All voiced consonants except for the nasals are sometimes devoiced in word-final position, though this appears to occur much less frequently than in related languages (Bennett Reference Bennett2016b).

Most K’iche’ dialects have vowels at five places of articulation /i e u o a/ with a phonemic length contrast (Par Sapón & Can Pixabaj Reference Par Sapón and Angelina Can Pixabaj2000). In CK, however, the length contrast has been replaced by a contrast in quality, with historic long vowels appearing as ‘tense’ /i e u o a/ and historic short vowels as ‘lax’ /ɪ ɛ ʊ ɔ ə/ (<ï eë ü ö ä>) (López Ixcoy Reference López Ixcoy1994; Par Sapón & Can Pixabaj Reference Par Sapón and Angelina Can Pixabaj2000; Can Pixabaj Reference Can Pixabaj2017; England & Baird Reference England, Baird and Judith Aissen2017). A pilot speech production study of CK showed that tense vowels are overall more peripheral and lax vowels more centralized, but the relative contributions of F1, F2, and duration could not be determined due to the small size of the dataset and considerable variation across speakers and vowel pairs (Wood Reference Wood2020). Furthermore, the high front lax vowel /ɪ/ in this data appeared to be merging with surrounding categories /i/ and /ə/, with variation observed across speakers and items.

The tense–lax contrast is important in the grammar of CK in several ways. The stress pattern of verbs prioritizes tense over lax vowels, e.g., stress is on the tense vowel /u/ in the penultimate syllable of jumaj /χumaχ/ [ˈχu.maχ] ‘to throw’, but on the final syllable of knaj /kʊnaχ/ [k.ˈnaχ] ‘to heal’, where the penultimate vowel is lax (see Wood Reference Wood2024 for a more detailed description of stress in the language). Lax vowels in CK are regularly deleted in unstressed, non-final open syllables, but tense vowels strongly resist deletion (Wood Reference Wood2024). Tense and lax vowels also alternate in a number of phonological and morphological contexts. Lax vowels are realized as the corresponding tense pair when word-initial, e.g. atz’yäq [at͡s'.ˈjəq] ‘clothing’ beginning with [a] vs. wätz’yaq [wət͡s'.ˈjaq] ‘my clothing’ where the first vowel is realized as its underlying lax quality [ə] due to the presence of an initial consonant. Many nouns have lax vowels in their final syllable which become the corresponding tense vowel when the noun is possessed; the same pattern is found with vowel length in other K’iche’ dialects (Can Pixabaj Reference Can Pixabaj2017). This alternation can be seen in the second syllable of the word for ‘clothing’ mentioned previously. Lax vowels in CVC transitive roots become tense in the passive form; this alternation again occurs with vowel length in other K’iche’ dialects, e.g., xäkk’äm [ʃək.ˈk'əm] ‘they took it’ with the lax vowel [ə] in the root /k'əm/ ‘take’ vs. kik’am [ki.ˈk'am] ‘they are taken’ with corresponding tense vowel [a].

The terms “tense” and “lax” have been used to describe a wide range of phonetic contrasts in different languages, including various combinations of vowel height (e.g., Dalton Reference Dalton2011), duration (e.g., Bennett Reference Bennett1968), tongue root position (e.g., Fulop, Kari & Ladefoged Reference Fulop, Kari and Ladefoged1998) and voice quality (e.g., Di Paolo & Faber Reference Di Paolo and Faber1990). In Kaqchikel and other Mayan languages with quality contrasts beyond the five canonical vowels /a e i o u/, the terms refer to a centralization contrast, where tense vowels are more peripheral and lax vowels are in some sense more central (closer to the center of gravity of the vowel space) (Bennett Reference Bennett2016a; England & Baird Reference England, Baird and Judith Aissen2017). It is worth noting that in this usage, lax vowels are not universally lower than their tense counterparts, as the lax partner of tense /a/ is /ə/, /ɨ/ or otherwise higher than /a/. Bennett (Reference Bennett2016a) finds a duration difference of about 20 ms between the tense and lax central vowel pair in Kaqchikel, but no significant differences between the tense and lax high and mid vowel pairs. This difference is much smaller than the 30–50 ms difference found for stressed tense and lax vowels in English (Peterson & Lehiste Reference Peterson and Lehiste1960; van Santen Reference van Santen1992), which, like those of Kaqchikel and CK, come historically from a length contrast (see e.g. Bennett Reference Bennett2016b on Mayan, Hogg Reference Hogg2012 on English). ATR systems, where tongue root position creates the contrast between the pairs, are not typically accompanied by duration contrasts (Hess Reference Hess1992; Guion, Post & Payn Reference Guion, Post and Payn2004; Przezdziecki Reference Przezdziecki2005; Starwalt Reference Starwalt2008; Koffi Reference Koffi2016; Kirkham & Nance Reference Kirkham and Nance2017). Because of the wide variety of ways in which different languages implement fine-grained quality contrasts, it is not immediately obvious whether duration differences should be expected between CK tense and lax vowels. However, if there are such differences, tense vowels are expected to be longer in keeping with their more peripheral vowel quality and historical origin as long vowels.

K’iche’, like other Mayan languages, has a number of words that have been analyzed as containing vowels followed by a complex coda in which the first consonant is a glottal stop, e.g., pö’t /pɔʔt/ ‘huipil’. These same words have alternately been analyzed as having phonemically glottalized vowels, e.g. /pɔ̰t/. Which analysis is preferable continues to be a contested question in Mayan linguistics (Sobrino Gómez & Bennett Reference Sobrino Gómez, Bennett, Arellanes, Hernández and Hernándezsubmitted). In K’iche’, glottal stop is clearly a consonant in some contexts. For instance, when epenthesized to stressed vowel-initial words, it permits the realization of the preceding indefinite article jün /χʊn/ as ju [χu], without its final consonant. This is not permitted before vowel-initial words, e.g., ju ‘ik’ [χu ʔik'] ‘a month’, ju kär [χu kəɾ] ‘a fish’, but jün ixöq [χʊn iʃɔq] ‘a woman’ (for evidence that word-initial glottal stops are epenthetic in these cases, see Wood Reference Wood2024 on CK and Bennett Reference Bennett2016b on cross-Mayan patterns). Glottal stop can also appear intervocalically, whereas vowel hiatus is strictly avoided within words, e.g., mi’al [miʔal] ‘daughter’, u’al [uʔal] ‘broth’. One method of resolving vowel hiatus is through glottal stop insertion, e.g., ki’el /kiel/ [kiʔel] ‘they go out’.Footnote 2 Finally, many CVC roots – the canonical root shape in Mayan languages (England & Baird Reference England, Baird and Judith Aissen2017) – have a glottal stop as the final consonant. Unlike the potentially phonemically glottalized vowels, which are always lax (with the exception of [a], discussed further below), the vowels in these roots may be tense or lax in CK, just as occurs preceding any other final consonant in the language, e.g., no’ [noʔ] ‘no’ vs. jö’ [χɔʔ] ‘let’s go’, si’ [siʔ] ‘firewood’ vs. kï’ [kɪʔ] ‘sweet’. In contrast to these clearly consonantal uses of glottal stop, it is not clear whether the glottalization that occurs in words like ‘huipil’ should be considered a consonant or a vocalic feature.

Acoustic analysis of the Ixtahuacán and Cantel dialects of K’iche’ shows that words with glottalized vowels are most commonly produced with a full glottal stop at or towards the end of the vowel, though sometimes with creaky voicing towards the middle or end of the vowel with no complete closure (Baird Reference Baird2011). No previous acoustic studies exist of glottalized vowels or post-vocalic glottal stop in CK. Wood (Reference Wood2020, Reference Wood2024) shows that word-initial glottal stops are rarely produced as a full closure in CK, with the most common realization being glottalized phonation on the initial vowel. (As will be shown in this study, ‘glottalized’ vowels in CK are also practically never realized with a full closure, despite the very similar research methods and expected speech genre compared to Baird’s (Reference Baird2011) work on other K’iche’ dialects.)

As noted above, the question of whether there are phonemically glottalized vowels, or rather these are simply sequences of vowels and glottal stops, is an issue across the Mayan language family, and both the surface facts as well as their phonological analysis remain controversial (Baird & Pascual Reference Baird and Francisco Pascual2011; Bennett Reference Bennett2016b; England & Baird Reference England, Baird and Judith Aissen2017; DiCanio & Bennett Reference DiCanio, Bennett, Gussenhoven and Chen2021; Sobrino Gómez & Bennett Reference Sobrino Gómez, Bennett, Arellanes, Hernández and Hernándezsubmitted). In some languages, various phonological patterns relating to stress assignment, neutralization in unstressed syllables, and root phonotactics suggest that [ʔ] behaves at least sometimes as a vowel feature (England & Baird Reference England, Baird and Judith Aissen2017; Sobrino Gómez & Bennett Reference Sobrino Gómez, Bennett, Arellanes, Hernández and Hernándezsubmitted). Few acoustic studies exist of glottal stop or vowel glottalization in Mayan languages. Baird & Pascual (Reference Baird and Francisco Pascual2011) show that in Q’anjob’al full closures are about as frequent as other types of glottalized phonation without a full closure, and full closures are most likely in word-final position. In Yucatec Maya, comparable tokens are usually produced with creaky voicing on the second part of the vowel but no full closure (Frazier Reference Frazier2009). A different phonological status does not necessarily correlate with a different phonetic realization in Mayan languages, as both full closures as well as reduced productions can be found for both clearly consonantal instances of [ʔ] as well as those that behave phonologically as vowel features (Sobrino Gómez & Bennett Reference Sobrino Gómez, Bennett, Arellanes, Hernández and Hernándezsubmitted).

Cross-linguistically, the phonetic realization of a glottal stop, even when clearly functioning as a consonant, is highly variable. In many different languages, it has been shown that what is analyzed phonologically as a glottal stop is often or usually realized as glottalized phonation (creaky voice, tense voice, aperiodic voice, or other realizations of greater glottal constriction) on the adjacent vowels or other sonorants, without a complete closure (Priestly Reference Priestly1976; Kohler Reference Kohler1994; Ladefoged & Maddieson Reference Ladefoged and Maddieson1996; Alber Reference Alber2001; Quick Reference Quick2003; Pompino-Marschall & Zygis Reference Pompino-Marschall and Zygis2011; DiCanio Reference DiCanio2012; Garellek Reference Garellek2014; Whalen et al. Reference Whalen, DiCanio, Geissler and King2016; Esling et al. Reference Esling, Moisik, Benner and Crevier-Buchman2019; Mitterer, Kim & Cho Reference Mitterer, Kim and Cho2019; Davidson Reference Davidson2021). Phonemically glottalized vowels and other sonorants may also have varied realizations, from apparently modal voicing, to creaky or other types of laryngealized phonation, to full glottal closures (Bird Reference Bird2011; Davidson Reference Davidson2020). The timing of the non-modal phonation also varies: within a glottalized vowel, the glottalized portion often does not cover the full vowel segment, but may be present only at the beginning, middle, or end of the vowel (Davidson Reference Davidson2020). Laryngeal phenomena of many types are very common in Mesoamerican languages, affecting both vowels and consonants, and many aspects of these systems remain poorly understood, including their timing and their interaction with tone and other prosodic phenomena (DiCanio & Bennett Reference DiCanio, Bennett, Gussenhoven and Chen2021).

The following section describes the methods used for the acoustic study of vowels in CK. This study addresses the following questions: (i) What is the difference in vowel quality between tense and lax vowels? (ii) Does the vowel /ɪ/ have a distinct quality or is it merged or merging with surrounding categories, as suggested by pilot data (Wood Reference Wood2020)? (iii) Are there duration differences between tense and lax vowels? (iv) Do glottalized vowels have either tense or lax vowel quality? (v) Are glottalized vowels produced with a full glottal closure or other types of glottalization? (vi) Is there any acoustic evidence that favors a one-segment or two-segment analysis of glottalized vowels?

For the purposes of disambiguating potentially phonemically glottalized vowels from cases that are clearly vowel–glottal stop sequences, the former will be represented as V̰ throughout the remainder of the text, and the latter as Vʔ. Whether glottalized vowels are in fact better analyzed as one segment, or a sequence of segments, will be returned to in the discussion.

3. Methods

3.1 Experimental design

Six words were included in the experiment for each tense or lax vowel phoneme, with the exception of lax /ɪ/ for which nine words were included due to its apparent ongoing merger with surrounding vowels. Each of these words were common monosyllabic nouns or adjectives with surface shape (C)CVC, chosen with the goal of being known by all speakers and easy to translate from Spanish.Footnote 3 Five items were included for each of the glottalized vowel phonemes due to the smaller number of words of this type and resulting difficulty finding appropriate items. Each of the words with glottalized vowels were monosyllabic or disyllabic nouns or adjectives, where the relevant syllable was of the shape (C)CV̰C. In the case of disyllables, the target vowel was always in the second, stressed syllable. The items are shown in Table 2, with the total number of tokens of each item included in the experiment (more on data inclusion and exclusion below).

Table 2. Items and token counts included in the experiment

Many of the experiment items include vowels adjacent to uvular consonants, which tend to have strong coarticulatory effects on vowels in many languages (e.g., Zawaydeh Reference Zawaydeh1997; Gordon, Barthmaier & Sands Reference Gordon, Barthmaier and Sands2002; Wilson Reference Wilson2007; Evans et al. Reference Evans, Sun, Chiu and Liou2016; Holliday & Martin Reference Holliday and Martin2018). Uvular consonants are very frequent in CK, and therefore it would be very difficult to exclude them entirely. In particular, many of the words with the mid front vowels /e/ and /ɛ/, which are especially uncommon in the language, have uvular consonants. In order to account for these and other potential coarticulatory effects, the place of articulation of adjacent consonants was included in the statistical models, as discussed further below, and an effort was made to include items with uvular consonants for every phoneme. Similarly, a number of items followed by glottal stop were included in order to facilitate understanding the behavior of the vowel /ɪ/, which appeared in the pilot study data to be merging with surrounding categories. It was noticed that words with /ɪ/ followed by a glottal stop tended to sound [e]-like, whereas in other contexts they tended to sound [i]- or [ə]-like. In addition to a number of /ɪ/ items preceding glottal stops, instances of other front and central vowels preceding glottal stops were included in the study where possible. Notably, no words were found with [əʔ] sequences, a fact which will be returned to in the discussion. [ɛʔ] sequences occur in the language but no suitable common nouns or adjectives were identified and therefore they are also absent in the experiment data.

The phonemic identity of each vowel was determined through perceptual judgments, supported, where possible, by cross-dialectal comparison. The majority of the words can be found in Ajpacajá Tum et al.’s (Reference Ajpacajá Tum, Chox Tum, Tepax Raxulew and Guarchaj Ajtzalam2005) dictionary, with lax vowels corresponding to short vowels, tense vowels to long vowels, and glottalized vowels to vowel–glottal stop sequences (all of which include short vowels). Others appear in Christenson’s (unpublished) dictionary, although this source does not distinguish tense and lax vowels except for the central pair. A few items show some differences compared to the forms found in the dictionaries: in the dictionary entries, /kʊ̰t/ ‘one-handed’ has no glottalization, /t͡ʃ'ɔχ/ ‘fight’ does have glottalization, /mɔ̰s/ ‘ladino’ appears with a high rather than mid vowel, /k'ɛq/ ‘flea’ and /kɛq'/ ‘guava’ appear with low vowels, and /t͡sɛ̰n/ ‘laughter’ has no final nasal.Footnote 4 Finally, some words appear to be specific to CK or missing from the dictionaries for unknown reasons: /tuʃ/ ‘hen’, /qʊ̰l/ ‘turkey’, and /mam/ ‘rooster’.

Tense, lax, and glottalized vowels occur in both stressed and unstressed syllables in CK. Examples of each type of vowel in an unstressed syllable include ch’ab’äl [t͡ʃ'a.ˈɓəl] ‘language’, wächb’äl [wət͡ʃ.ˈɓəl] ‘image’, na’tsb’äl [na̰t.s(ə).ˈɓəl] ‘reminder’. However, the experimental data is restricted to stressed syllables due to time constraints and with the goal of comparing the most hyperarticulated tokens, where the contrast between categories is expected to be clearest.

The words were elicited through a translation task from the Spanish, the contact language in which most K’iche’ speakers, and all experiment participants, are fluent.Footnote 5 Each word with a tense or lax vowel was placed into a frame sentence of the structure ‘There is (a) [target noun] [locative phrase]’ or ‘There is (a) [target adjective] [noun] [locative phrase]’. Each word with a glottalized vowel was placed into a frame sentence of the structure ‘I [past tense verb] (a/the) [target noun] [oblique phrase]’ or ‘I [past tense verb] (a/the) [target adjective] [noun] [oblique phrase]’. Full verbs were used instead of existential structures for words with glottalized vowels to allow greater flexibility in the meanings of the target words, e.g., including some abstract nouns such as ‘hatred’ and ‘laughter’. However, the basic prosody and word order of both sentence structures is the same.

The sentences were shown on a computer screen with an accompanying image of the target word intended to stimulate lexical access. The sentences were displayed in a random order for each participant. Examples are shown in Figure 1.

Figure 1. Example stimuli for a noun with a plain vowel (sak’ ‘grasshopper’, top left), adjective with a plain vowel (q’ëq ‘black’, top right), noun with a glottalized vowel (xpa’ch ‘lizard’, bottom left), and adjective with a glottalized vowel (chqï’j ‘dry’, bottom right)

Participants were asked to read the sentence in Spanish (in their head or out loud) and then produce the K’iche’ translation, saying it twice. Participants were recorded with an H4n digital recorder at a sampling rate of 44.1 kHz and wore a Shure SM10A headset microphone to limit background noise. If the participant could not think of the translation of a word, the researcher (a non-native speaker of K’iche’) hesitantly suggested the target word, such as ‘¿se podría decir algo como X? (‘would it be possible to say something like X?’). Otherwise, the researcher did not interfere in the productions.

In total, 11 participants were included in the study (eight female, three male). Their ages ranged from young adults to middle aged. All were native speakers of K’iche’ from the Chichicastenango area (city center or dependent rural communities). Within Chichicastenango, K’iche’ is the default language among most adults both at home and in public, and all experiment participants regularly used the language in their daily lives. All participants provided informed consent to participate, and the study was conducted under the approval of the University of Texas at Austin, Institutional Review Board. All data was collected in Chichicastenango in March 2023.

3.2 Data inclusion

Participants often produced variations of the target sentence, such as changing the order of constituents (K’iche’ has a relatively free word order), adding morphemes to the target word (possessive prefixes on nouns and superlative suffixes or status suffixes on adjectives), adding additional words or phrases, or substituting words for synonyms or related words. All productions that included the target word were included in the study unless they were nouns produced with a possessive prefix where the possessed form of the noun has a different vowel quality, e.g. chkäch [t͡ʃkət͡ʃ] ‘basket’ vs. qächkach [qət͡ʃkat͡ʃ] ‘our basket’ (15 exclusions). With this exclusion and missing data from participants who did not produce the target word in the sentence, a total of 1,235 tokens were included in the analysis. The number of tokens for each item is shown in Table 2 above. It may be noted that for two items, ‘money’ and ‘nice’, no speakers produced the intended target word without a change in vowel quality caused by possession, and therefore no tokens of these two items were ultimately included in the analysis.

3.3 Segmentation and annotation

The beginning and end of each included vowel segment was marked in Praat (Boersma & Weenink Reference Boersma and Weenink2023). Following a glide, the boundary was placed 30% of the way through the glide–vowel portion, as the transition between the two segments was typically gradient and smooth.Footnote 6 The boundary between a vowel and a glottal stop realized with a full closure was placed at the onset of silence (at least 20 ms). Otherwise portions of glottalized phonation were included in the vowel. If two vowels were separated by a glottal stop realized as a period of glottalized phonation without complete closure, the boundary between them was placed at the beginning of the longest pulse in the glottalized section. This metric was chosen in order to approximate the point of greatest glottal constriction as well as to have a consistent and replicable metric, as the transition between glottalized and modal phonation was typically gradient. In all other contexts the segmentation was done with reference to the amplitude: where the amplitude begins to increase following an affricate, fricative, or sonorant, after the burst following a stop, and where the amplitude stops decreasing preceding a stop, affricate, fricative, or sonorant. These changes in amplitude seemed to the author to be more consistent, visually salient, and objectively determinable than changes in the formant structure, e.g., F2, commonly used in phonetic research (Machač & Skarnitzl Reference Machač and Skarnitzl2009). Examples of segmentation are shown in Figure 2. The spectrograms shown here and throughout the paper were made with Elvira García’s (Reference Elvira García2022) Create pictures with tiers Praat script.

Figure 2. Examples of vowel segmentation. After a glide and before a fricative in wäj [wəχ] ‘my fresh corn’ (top left), after a glottal stop and before a fricative in ju oj [χu ʔoj] ‘an avocado’ (top right), before and after an ejective stop in t’ot’ [t’ot'] ‘snail’ (bottom left), after an affricate and before a nasal in chim [t͡ʃim] ‘bag’ (bottom right).

Each vowel was categorized according to the following factors relevant to the statistical analysis, as explained below:

  • Speaker and item

  • Category: tense /i u e o a/, lax /ɪ ʊ ɛ ɔ ə/, glottalized (/ɪ̰ ʊ̰ ɛ̰ ɔ̰ a̰/)

  • Height: high (/i u ɪ ʊ ɪ̰ ʊ̰/), mid (/e o ɛ ɔ ɛ̰ ɔ̰/), low (/a ə a̰/)

  • Frontness: front (/i e ɪ ɛ ɪ̰ ɛ̰/), central (/a ə a̰/), back (/u o ʊ ɔ ʊ̰ ɔ̰/)

  • Place of articulation of preceding and following consonant: bilabial, alveolar, postalveolar, velar, uvular, glottal

  • Laryngeal state of preceding and following consonant: voiced sonorant, voiceless obstruent, glottalized obstruent

  • Order of production (instance of the sentence produced by the speaker): first, second

  • Following pause (at least 200 ms): yes, no

  • Added suffix: yes, no

  • Added prefix: yes, no

Table 3. Coding of different types of glottalization

Additionally, glottalized vowels and plain vowels followed by a glottal stop coda were categorized according to the phonetic type of glottalization observed in that token, as detailed in Table 3. Classification was carried out through visual inspection of the waveform and spectrogram, paying special attention to the second half of the vowel, as glottalized phonation occurred primarily towards the middle or end of the vowel segment across tokens. When multiple types were observed in a given token, it was coded according to the dominant pattern.

Examples of each of these categories are shown in Figure 3.

Figure 3. Full closure in ch’a’k /t͡ʃ'a̰k/ ‘sore’ (top left; 22 ms between two pulses towards the center of the vowel); creaky voice in pö’t /pɔ̰t/ ‘huipil’ (top right); intensity dip in sü’t /sʊ̰t/ ‘cloth’ (bottom left); apparently modal voice in the second syllable of t’isö’n /t’isɔ̰n/ ‘sewing’ (bottom right).

It is worth noting that it is possible that glottalized vowels might have additional acoustic cues not considered in this article. For instance, these vowels sometimes sound diphthongal, with a change in vowel quality in the glottalized portion. In some cases, glottalization is perceptible, but not visually detectable in the waveform or spectrogram. A more thorough investigation of all of the possible realizations of glottalization is left for future projects.

3.4 Measurements

A Praat (Boersma & Weenink Reference Boersma and Weenink2023) script was used to measure F1 and F2 at the midpoint of the vowel interval as well as the duration of the interval. High vowels with F1 values above 750 Hz (female speakers) or 700 Hz (male speakers), as well as back vowels with F2 values above 2,000 Hz (female speakers) or 1,500 Hz (male speakers) were individually checked. If these measurements were a result of a formant tracking error, all formant values for that token were manually corrected. Measurements were taken at the midpoint in order to minimize potential coarticulatory effects. Since only one point was measured per vowel, the results do not provide any information on any potential dynamic cues to the relevant contrasts over the course of the vowel.

In order to permit comparisons across speakers, F1 and F2 values were normalized according to the Lobanov method (Z-scores) (Lobanov Reference Lobanov1971; Adank, Smits & van Hout Reference Adank, Smits and van Hout2004; Bennett Reference Bennett2016a). The number of tokens in each category is not fully balanced, which can result in distortions of the vowel space when using Lobanov normalization (Barreda & Neary Reference Barreda and Nearey2018). However, the Lobanov values are highly correlated with formant ratio values (dividing F1 and F2 over F3 for each token), which are completely independent of the number of tokens in a given category and have been shown to remove speaker-dependent variation from vowel productions (Monahan & Idsardi Reference Monahan and Idsardi2010). Pearson’s correlation between the two normalization methods, calculated in R (R Core Team 2020), is 0.926 for the F1 dimension and 0.952 for the F2 dimension.

3.5 Statistical analysis

The vowel quality and duration measurements were analyzed statistically using linear mixed effects models, modeled with the package lme4 (Bates et al. Reference Bates, Mächler, Bolker and Walker2015) in R (R Core Team 2020), with pairwise comparisons done with the package emmeans (Lenth Reference Lenth2024). Data visualization was done with the package ggplot2 (Wickham Reference Wickham2016).

Two models of vowel quality were created, with dependent variables F1 and F2. The potential predictors considered in the F1 model included fixed effects phoneme category (tense, lax, glottalized), height (high, mid, low), the interaction between them, preceding consonant place of articulation (alveolar, bilabial, postalveolar, velar, uvular, glottal), following consonant place of articulation (alveolar, bilabial, postalveolar, velar, uvular, glottal), preceding consonant laryngeal state (glottalized, voiceless obstruent, voiced sonorant), and following consonant laryngeal state (glottalized, voiceless obstruent, voiced sonorant). Hypotheses about these factors are shown in Table 4.

Table 4. Factors and hypotheses for the F1 model

The factors considered in the model of F2 included fixed effects phoneme category (tense, lax, glottalized), frontness (front, central, back), the interaction between them, preceding consonant place of articulation (alveolar, bilabial, postalveolar, velar, uvular, glottal), following consonant place of articulation (alveolar, bilabial, postalveolar, velar, uvular, glottal), preceding consonant laryngeal state (glottalized, voiceless obstruent, voiced sonorant), and following consonant laryngeal state (glottalized, voiceless obstruent, voiced sonorant). Hypotheses are shown in Table 5.

Table 5. Factors and hypotheses for the F2 model

The factors considered in the model of duration included category (tense, lax, glottalized), order (first production, second production), following pause (yes, no), prefix (yes, no), suffix (yes, no), preceding and following consonant place of articulation (alveolar, bilabial, postalveolar, velar, uvular, glottal), and preceding and following consonant laryngeal state (glottalized, voiceless obstruent, voiced sonorant). Additionally, an interaction between category and pair (high front, high back, mid front, mid back, low central) was considered. The hypotheses about these factors are shown in Table 6.

Table 6. Factors and hypotheses for the duration model

All models also included random effects speaker and lexical item. For all models, significance level was set at α = 0.05. The statistical models used treatment (“dummy”) coding.

3.6 Data accessibility

The audio recordings, experimental stimuli, Praat TextGrids, Praat script, and data spreadsheet associated with this study can be accessed through the Texas Data Repository (DOI: 10.18738/T8/YSWMIC).

4. Results

4.1 Vowel quality

4.1.1 Data visualization

Figure 4 shows the normalized data for tense and lax vowels, with ellipse level 0.67 (showing about one standard deviation). Tense vowels are shaded in dark gray and lax vowels in light gray.

Figure 4. Normalized tense and lax vowels for all speakers.

Overall, the tense vowels /i e u o a/ occur along the edges of the vowel space, while the lax vowels /ɪ ɛ ʊ ɔ ə/ are closer to the center. However, this greater closeness to the center is primarily in the F1 dimension and not the F2 dimension. The lax vowels /ɛ ʊ ɔ/ occur along the edges of the vowel space, just like their tense counterparts, but they are lower. The lax vowel /ə/ and its tense counterpart /a/ are both in the middle of the F2 dimension. The only lax vowel which is closer to the center in terms of frontness in a way not expected from the natural triangular shape of the vowel space is /ɪ/. However, it is also immediately apparent that the ellipse marking one standard deviation of the normalized values for /ɪ/ is very large compared to the other vowel categories, overlapping considerably with categories from /i/ to /ə/. There is also a large amount of overlap between the categories /o/ and /ʊ/, with the ellipse for /o/ falling almost entirely inside of the /ʊ/ ellipse.

Figure 5 shows the glottalized back vowels compared to the tense and lax back vowels. In order to facilitate reading the plot, which has many categories crowded closely together, only the tense and lax vowels are shown on the left. On the right, the glottalized vowels are added. The glottalized tokens cluster with the lax group in each case: glottalized mid back vowels with /ɔ/ and glottalized high back vowels with /ʊ/ (and /o/). They are labeled on the graph, therefore, as /ɔ̰/ and /ʊ̰/, respectively.

Figure 5. Normalized tense, lax and glottalized back vowels for all speakers.

Figure 6 shows the comparison between plain and glottalized front and central vowels. The mid front glottalized vowels cluster again with the lax group /ɛ/, and are labeled as /ɛ̰/. The central glottalized vowels, in contrast, cluster with the tense category /a/, and are labeled as /a̰/. The high front glottalized tokens cover a large area contained mostly within the space of /ɪ/, stretching from /i/ to /ɛ/, and are labeled /ɪ̰ /.

Figure 6. Normalized tense, lax and glottalized front and central vowels for all speakers.

4.1.2 Statistical results

The original model specification for F1 is shown in (1).

  1. (1) Initial model specification for F1

    F1 ∼ height * category + preceding consonant POA + following consonant POA + preceding consonant laryngeal state + following consonant laryngeal state + (1|speaker) + (1|item)

The original model of F1 had a singular fit if the random effect of speaker was included, possibly due to the normalization, and therefore it was removed. Additionally, model comparison using the AIC (Akaike’s information criterion) and BIC (Bayesian information criterion) functions in R was performed to assess whether the inclusion of each of the factors for preceding and following consonant characteristics sufficiently improved the fit of the model in light of the added complexity. The full model was compared to a model with each one of the factors removed. As shown in Table 7, the values for the full model are higher than for any of the reduced models, indicating that the removal of these factors improves the fit. Therefore, each of the factors for preceding and following consonant characteristics were removed from the final model.

Table 7. Comparison of F1 models

The final model specification for F1 is shown in (2).

  1. (2) Final model specification for F1

    F1 ∼ height * category + (1|item)

The results of the final model are shown in Table 8, where the baseline for height is high and for phoneme category tense.

Table 8. Results of the final model of F1

As expected, there is a significant effect of height, where both mid and low (tense) vowels have higher F1 values than high (tense) vowels. There is also a significant effect of category, with both lax and glottalized (high) vowels having higher F1 values than tense (high) values. There are significant interactions between height and category as well. The difference in F1 for tense and lax low vowels is different than for tense and lax high vowels. The effect of glottalized category also differs for low vowels.

Pairwise comparisons of the height by category interaction in this model show that all contrasts are significant except for the following: low tense vs. low glottalized (p = .994), mid lax vs. mid glottalized (p = .101), high lax vs. high glottalized (p = .995), mid tense vs. high lax (p = .999), mid tense vs. high glottalized (p = 1.000), high lax vs. low lax (p = .077), and low lax vs. mid lax (p = .806). That is, low tense and glottalized vowels are at the same height, whereas high and mid lax vowels share the same height as the corresponding glottalized vowels, confirming their perceptual impressions. Additionally, mid tense and high lax/glottalized vowels are at the same height. Finally, low lax vowels cannot be significantly distinguished from either high or mid lax vowels. In contrast to all other comparisons, the difference between high lax and low lax approaches significance, and likely reflects the large range of the high lax vowel /ɪ/, many of whose realizations overlap with /ə/. The full pairwise results can be found in the Appendix (Table A1).

The original model specification for F2 is shown in (3).

  1. (3) Initial model specification for F2

    F2 ∼ frontness * category + preceding consonant POA + following consonant POA + preceding consonant laryngeal state + following consonant laryngeal state + (1|speaker) + (1|item)

This initial model had a singular fit if the random effect of speaker was included, again possibly due to normalization of the data, and therefore it was removed. As was done for the model of F1, the AIC and BIC functions were used in R to determine whether the inclusion of the preceding and following consonant place of articulation and laryngeal state factors sufficiently improved the model considering the added complexity. As shown in Table 9, the values for the full model are higher than for any of the other models, indicating that removing each of these factors improves the fit of the model. Therefore, these factors were removed in the final model.

Table 9. Comparison of F2 models

The final model specification for F2 is shown in (4).

  1. (4) Final model specification for F2

    F2 ∼ frontness * category + (1|item)

The results of the final F2 model are shown in Table 10, where the baseline for frontness is back and for category tense.

Table 10. Results of the final model of F2

As expected, there is a significant effect of frontness, with central and front (tense) vowels having higher F2 values than back (tense) vowels. There is no significant effect of category. There is, however, a significant interaction between frontness and category, with the effect of lax category being different for front vowels than for back vowels.

Pairwise comparisons of frontness by category in this model show that all differences are significant except for the following. There was no significant difference between back tense and lax (p = .849), back tense and glottalized (p = 1.000), or back lax and glottalized (p = .887). Similarly, there was no significant difference between central tense and lax (p = 1.000), central tense and glottalized (p = 1.000), or central lax and glottalized (p = 1.000). Finally, there was no significant difference between front lax and glottalized (p = .977) or front tense and glottalized (p = .154). In sum, all back vowels have similar F2 values, all central vowels have similar F2 values, and front glottalized vowels cannot be distinguished in F2 from either tense or lax front vowels. The difference between front tense and glottalized vowels approaches significance, and the lack of a significant difference likely reflects the wide range of /ɪ̰ /. The full results for the pairwise comparisons are shown in the Appendix (Table A2).

In sum, F1 distinguishes four height categories: high tense > high lax/mid tense > mid lax/low lax > low tense. F2 distinguishes four frontness categories: front tense > front lax > central > back – in line with the usual expectations of the vowel space, where the front line is sloped and the back line straighter. Glottalized vowels pattern with lax vowels in the high and mid sets, but tense vowels in the low set. Brought together, this creates the system shown in Table 11.

Table 11. Contrasts in vowel height and frontness according to the statistical results

4.1.3 Apparent vowel mergers

There is no statistically significant difference between the vowels /ʊ/ and /o/ in either F1 or F2. Figure 7 shows these two vowels for each speaker (values as recorded, not normalized). It can be seen that it is not the case that individual speakers merely have different strategies for distinguishing these two vowels. Instead, there is near-complete overlap between the two categories for practically every speaker.

Figure 7. Vowels /o/ and /ʊ/ by speaker.

Furthermore, a difference between /o/ and /ʊ/ does not appear when the overall duration of the vowel is longer, allowing a more hyperarticulated production. This is shown in Figure 8. For both F1 and F2, the regression lines for the two phonemes over time are very close, and their confidence intervals overlap extensively.

Figure 8. Normalized F1 and F2 vowels for the phonemes /o/ and /ʊ/ as duration changes.

The /ɪ/ category, in contrast, is statistically significantly different in F1 and/or F2 from all other vowel categories except for /ɪ̰ /. However, as noted above, it shows considerable overlap with several other categories, from /i/ to /ə/. It is not likely that this overlap results merely because /ɪ/ has a particularly large vowel space, because its realization changes very predictably according to the lexical item, as shown in Figure 9 (this will be further discussed below – see Table 15).

Figure 9. Normalized tense and lax front and central vowels, showing the realizations of /ɪ/.

The vowel /ɪ/ is consistently realized as [i] in kïk’ /kɪk'/ blood’, as [e] in b’ï’ /ɓɪʔ/‘name’ and kï’ /kɪʔ/‘sweet’, as [ɛ] in tz’ï’ /t͡s'ɪʔ/ ‘dog’, and as [ə] in nïm /nɪm/ ‘big’, sïb’ /sɪɓ/ ‘smoke’ and snïk /s(ə)nɪk/ ‘ant’. Realizations of the vowel in pïx /pɪʃ/ ‘tomato’ vary across speakers, with some producing [i] and others [ə].

Similarly, Figure 10 shows the distribution of the words with /ɪ̰ /.

Figure 10. Normalized front and central tense, lax and glottalized vowels, showing the realizations of /ɪ̰/.

This vowel is realized as [i] in tï’t /tɪ̰ t/ ‘hatred’, as [e] in chqï’j /t͡ʃ(ə)qɪ̰ χ/ ‘dry’ and usually as [e] in rï’j /ɾɪ̰ χ/ ‘old’. Realizations of the same vowel in kotz’ï’j /kot͡s'ɪ̰ χ/ ‘flower’ are mixed between [e] and [ɛ], and in sqï’n /sqɪ̰ n/ ‘a little’ they are consistently [ɛ].

4.2 Duration

4.2.1 Data visualization

Figure 11 shows the duration of tense, lax and glottalized vowels in each vowel pair.

Figure 11. Durations of tense, lax and glottalized vowels in each place of articulation set.

4.2.2 Statistical results

The initial model specification for duration is shown in (5).

  1. (5) Initial model specification for duration

    duration ∼ category * vowel pair + order + pause + prefix + suffix + preceding consonant POA + following consonant POA + preceding consonant laryngeal state + following consonant laryngeal state + (1|speaker) + (1|item)

In order to determine which factors included in the initial model improved the fit of the model despite the cost of added complexity, the AIC and BIC values of the full model were compared to simpler models removing each of the fixed effects other than category. As shown in Table 12, the AIC and BIC values for the full model are higher than those for the models which remove the factors prefix, suffix, following consonant place of articulation, preceding consonant laryngeal state and following consonant laryngeal state, indicating that the removal of these factors improves the fit of the model. Therefore, the final model removed each of these fixed effects. Contrary to the expectation based on the pilot study results (Wood Reference Wood2020), removing the fixed effect of vowel pair (high front, high back, mid front, mid back, central) or the interaction between category and vowel pair also resulted in lower values, so the factor vowel pair was also removed.

Table 12. Comparison of duration models

The final model specification for duration is shown in (6).

  1. (6) Final model specification for duration

    duration ∼ category + pause + order + preceding consonant POA + (1|speaker) + (1|item)

The results of the final model of vowel duration are shown in Table 13. The baseline level for category is tense, for pause none, for order first production, and for preceding consonant place of articulation alveolar.

Table 13. Results of final model of vowel duration (estimates in seconds)

There is a significant effect of category, where lax vowels are shorter and glottalized vowels longer than tense vowels. The significant positive effect of pause shows that vowels are longer preceding a pause, and the significant negative effect of order that vowels are shorter in the repetition of the sentence than in the initial sentence. These effects are as expected. Finally, there is a significant positive effect of preceding glottal consonant, indicating that vowels are longer in this context. This may be due to the fact that prevocalic glottal stops are often realized as creaky phonation at the beginning of a vowel, which was segmented together with the vowel, and therefore the vowel segmentation may include part of the production of this consonant.

4.2.3 Apparent vowel mergers

Because the high lax vowels could not be distinguished from surrounding categories in quality, an additional ad hoc investigation was conducted of the duration of these vowels compared to the categories they appear to be merged with. A model of duration was created for only the /ʊ o/ subset, including the same factors as the main duration model. In this model, there was no significant effect of vowel quality, although it did approach significance (p = .058). The full results can be found in the Appendix (Table A3). Another model of duration was created for the /i ɪ e ɛ ə/ subset with the same factors. The full results can be seen in the Appendix (Table A4). Pairwise comparisons between the phonemes in this model, with the vowel /ɪ/ separated into several subcategories according to its realization, showed no significant difference between the vowels /i/ and /ɪ/ in words where the realization is [i] (p = .410), no significant difference between the vowels /e/ and /ɪ/ in words where the realization is [e] (p = .953), no significant difference between the vowels /ɛ/ and /ɪ/ in words where the realization is [ɛ] (p = .695), and no significant difference between the vowels /ə/ and /ɪ/ in words where the realization is [ə] (p = .999). The full results can be found in the Appendix (Table A5).

4.3 Voice quality

The number of tokens with each type of glottalization is shown in Table 14, which compares plain (tense or lax) vowels followed by a simple glottal stop coda (C𐁖ʔ) with ‘glottalized’ vowels (CV̰C). Remember that, as described above, a single glottal stop or glottalized phonation following a vowel with no additional consonants in the word was categorized as a simple glottal stop coda, whereas a glottal stop or glottalized phonation on a vowel followed by another coda consonant was categorized as a glottalized vowel. Glottalized vowels are further categorized according to the type of consonant that follows: an obstruent (voiceless) or a sonorant (voiced).

Table 14. Rates of each type of glottalization by context

Figure 12. Rates of each type of glottalization by context.

Plain vowels followed by a simple glottal coda occurred with a full closure (> 20 ms) about 10% of the time, creaky voice (or other types of laryngealized phonation) about 62% of the time, an intensity dip about 14% of the time, and apparently modal voice about 14% of the time. Glottalized vowels showed much lower rates of strong glottalization, influenced by the type of consonant that followed. When followed by a voiceless obstruent, there was one instance of a full closure, about 35% of the time creaky voice, 45% of the time an intensity dip, and 19% of the time apparently modal phonation. When followed by a voiced sonorant, there were no instances of full closures, about 14% of the time creaky voice, 43% of the time an intensity dip, and 44% of the time apparently modal voice. (Note that the instances of apparently modal voice included in this table are of lexical items categorized as having a following glottal stop coda or glottalized vowel phonologically because other instances of the same word are typically produced with visually detectable glottalization or audible glottalization that is not visually detectable.)

In the categorization of the data reflected in Table 14, a token was considered to have a full closure if it presented a period of silence of at least 20 ms or a single pulse followed by silence with a total duration of at least 20 ms, following previous research on glottal stops and glottalized vowels in Mayan languages (Frazier Reference Frazier2009; Baird Reference Baird2011; Baird & Pascual Reference Baird and Francisco Pascual2011; Wood Reference Wood2023). However, a pause of over 20 ms between pulses may occur not only due to a true full glottal closure but also in other realizations of glottalized phonation, such as particularly low-pitched creaky voice, meaning that some or all of the tokens categorized as full closures in the data may not result from true glottal closures. An exploration of these tokens supports this concern. Of the 16 instances of ‘full closures’, only one has a duration over 30 ms, and all occur in the context of strong creaky voice (see Figure 3 above). Therefore, the distinction between full closures and creaky voice initially made in the categorization of the data is not well supported. Combining these two categories, the results for voice quality are summarized in Figure 12.

5. Discussion

The results of the study show that most of the 10 tense and lax vowel phonemes can be distinguished in quality, through F1 and F2. The tense vowel in most pairs is further from the center of the vowel space than the lax vowel, although this greater distance mostly reflects vowel height rather than frontness. Glottalized high and mid vowels have the relevant lax quality, while glottalized central vowels coincide in quality with the tense vowel. However, there are two areas of the vowel space that show considerable overlap between categories: the high front lax vowel /ɪ/ and its glottalized counterpart with surrounding vowels, and the back vowels /o/ and /ʊ/. Glottalized vowels are longer than tense vowels, which are longer than lax vowels. Finally, the voice quality of glottalized vowels varies considerably and shows an effect of following context, with stronger realizations before (voiceless) obstruents and weaker realizations before (voiced) sonorants. Realizations with full closures are practically non-existent. The following sections discuss in more detail the apparent vowel mergers and the status of glottalized vowels.

Table 15. Phonetic realization of lax and glottalized high front vowels by item (glottal stop included in following context for glottalized vowels)

5.1 Vowel mergers

The vowel /ɪ/, though on average different from all other vowel categories, overlaps with /i/, /e/, /ɛ/ and /ə/, with the realization being mostly predictable based on the lexical item. Similar variation is found for the glottalized version /ɪ̰ /. The typical realization of each item is summarized in Table 15.

Some tendencies based on the segmental context are observable. When followed by a glottal stop (either as a simple glottal stop coda or in a glottalized vowel/followed by complex glottal stop coda), the realization is nearly always [e] or [ɛ], whereas in other contexts it is [i] or [ə]. The realization appears to be [ə] adjacent to a nasal. However, it is not clear that it is possible to predict the realization based entirely on the phonological context, as similar contexts occur with different realizations in different words. For instance, the vowel is realized as [e] in /t͡ʃqɪ̰ χ/ ‘dry’ but [ɛ] in /sqɪ̰ n/ ‘a little’. /tɪ̰ t/ ‘hatred’ has a glottalized vowel but its realization is typically [i]. A more detailed study with a larger number of words with lax /ɪ/ would be necessary to determine whether a predictable phonological environment can be found for the different realizations.

The data does not show any quality differences between /ɪ/ realized as one of the other phonemes and the vowels actually belonging to that phoneme category. However, the vowel /ɪ/ does still seem to exist as a phonemic category in the language, since it behaves phonologically as a high front lax vowel irrespective of its specific realization in a given word. For instance, as noted above, many nouns have lax vowels which become the corresponding tense vowel when the noun is possessed. Irrespective of the realization of /ɪ/ in a given word, the possessed form (if it changes) seems to be realized as tense [i], not [e] or [a] as would be expected for a word with a phonemic /ɛ/ or /ə/, respectively. E.g. tz’ï’ /t͡s'ɪʔ/ [t͡s'ɛʔ] ‘dog’ vs. ntz’i’ /nt͡s'iʔ/ [nt͡s'iʔ] ‘my dog’, rjïl /ɾχɪl/ [ɾχəl] ‘money’ vs. nrjil /nɾχil/ [nɾχil] ‘my money’.

A similar argument may be made for the back vowels. Although there is no statistically significant difference in quality between the vowels /o/ and /ʊ/, these vowels behave differently phonologically. When a word with lax /ʊ/ changes under possession, it is realized as [u], not [o]. E.g., süb’ /sʊɓ/ [soɓ] ‘tamalito’ vs. qsub’ /qəsuɓ/ [qsuɓ] ‘our tamalito’.

These facts may be understood as representing an intermediate stage in the loss of the high lax phonemes /ɪ/ and /ʊ/. These categories still exist in the phonological system of the language, and exhibit different behaviors from the categories they overlap with phonetically, but they are no longer distinct in quality nor duration. It is possible that in the future, words with /ʊ/ will be reanalyzed as containing underlyingly /o/, and begin to pattern as tense vowels rather than lax vowels. Similarly, words with /ɪ/ may be reanalyzed as containing /i/, /e/, /ɛ/ or /ə/, and consequently shift their phonological behavior.

Vowel mergers resulting in the loss of the mid front lax vowel /ɛ/ or all of the non-central lax vowels are common in closely related languages (Majzul et al. Reference Majzul, Matzar and Espantzay Serech2000; Bennett Reference Bennett2016a; Bennett Reference Bennett2016b). The loss of /ɪ/ (merged with /i/) is attested in Sololá Kaqhickel (Bennett Reference Bennett2016a) and the high lax vowels are used inconsistently by some speakers in San Martín Jilotepeque Kaqchikel (Majzul et al. Reference Majzul, Matzar and Espantzay Serech2000). However, the loss of the high lax phonemes, and in particular /ɪ/, is in general uncommon in K’ichean languages: /ɪ/ is the only lax vowel other than /ə/ to be maintained in some related languages, such as the Kaqchikel dialects of San Miguel Pochuta and San Pedro Yopocapa (Majzul et al. Reference Majzul, Matzar and Espantzay Serech2000).

5.2 Glottalized vowels

The glottalized vowels of CK can be analyzed as either phonemically glottalized vowels or vowels followed by a complex coda in which the first consonant is a glottal stop. These vowels correspond to short vowels in other K’iche’ dialects. Therefore, they would be expected to surface as lax vowels in CK.Footnote 7 However, their phonetic realization is not lax in all cases: in the case of the central pair, it is tense [a] rather than lax [ə]. The central pair is the only one of the tense–lax pairs where the tense vowel is lower than the lax vowel. Therefore, the realization of the glottalized vowels could be seen as selecting the lower of the two options. A restriction against schwa preceding a glottal stop is found in unrelated languages such as those of the Salish family (Brunner & Zygis Reference Brunner and Zygis2011). Vowels are lowered adjacent to glottal stops in many languages and there is a general correlation between glottals and low vowels (Brunner & Zygis Reference Brunner and Zygis2011; Moisik, Czaykowska-Higgins & Esling Reference Moisik, Czaykowska-Higgins and Esling2021). These facts can be explained as resulting from the epilaryngeal constriction that accompanies and contributes to vocal fold adduction (Moisik Reference Moisik2013; Moisik et al. Reference Moisik, Esling, Crevier-Buchman, Amelot and Halimi2015; Moisik et al. Reference Moisik, Czaykowska-Higgins and Esling2021; Garellek Reference Garellek2022). Therefore, the appearance of [a̰] instead of [ə̰] could be a phonologization of a coarticulatory process: initially the articulation of the glottal and epilaryngeal constriction during or at the end of the vowel resulted in lower realizations of the vowel, which was then reanalyzed as belonging to the /a/ category. There is a large difference in the height of lax and glottalized central vowels, whereas glottalized high and mid vowels are only slightly lowered compared to their lax counterparts. Therefore, the very low quality of /a̰/ is not likely due to phonetic coarticulation alone.

Notably, the effects of a clearly consonantal following glottal stop on vowels that come historically from short /a/, and are therefore expected to surface as [ə] in CK, is the same as what is found for glottalized vowels. Tense (historically long) and lax (historically short) low vowels are neutralized in this context, e.g. ka’ *kaːʔ [kaʔ] ‘grinding stone’, ja’ * χaʔ [χaʔ] ‘water’ (reconstructions from Kaufman & Justeson Reference Kaufman and Justeson2003). There are no attested words with the sequence [əʔ] in CK. In contrast, the realization of high and mid glottalized vowels is lax and there is no restriction against lax (or tense) high or mid vowels preceding a simple glottal stop coda. E.g., tense si’ /siʔ/ [siʔ] ‘firewood’ and che’ /t͡ʃeʔ/ [t͡ʃeʔ] ‘tree’ vs. lax jö’ /χɔʔ/ [χɔʔ] ‘let’s go!’ and të’ /tɛʔ/ [tɛʔ] ‘then’, or the many examples with /ɪʔ/ discussed in the previous section.

The voice quality results show that the realization of glottalized vowels varies according to the following context, and is different from what is found for plain (tense or lax) vowels followed by a simple glottal coda. Stronger realizations (creaky voice) are more common in the latter context, whereas weaker realizations (intensity dip or apparently modal phonation) are more common for glottalized vowels.Footnote 8 However, it is possible that complex codas are simply more reduced than simple codas in this language, and this difference does not indicate a phonemic contrast in the vowels. Among the glottalized vowels, stronger realizations are more common when preceding a (voiceless) obstruent than a (voiced) sonorant, showing that the context can have a significant effect on the realization of glottal/glottalized segments. In some cases, glottalization is detectable in the following voiced consonant when the vowel itself appears to be modal, so it is possible that the weaker realization of glottalization in this context is facilitated by the fact that it can still be detected in the following consonant.Footnote 9

These results show some similarities to what Baird (Reference Baird2011) found about the realization of 𐁖ʔ sequences and glottalized vowels in the Ixtahuacán and Cantel dialects of K’iche’. In these dialects, realizations as creaky voice were much more common preceding voiceless fricatives and stops than preceding nasals. Baird did not distinguish 𐁖ʔ sequences and glottalized vowels in the analysis, treating all as sequences, however all items in the provided word list where the environment is preceding another consonant would be considered glottalized vowels according to the classification of the CK study. However, the CK results are strikingly different from Baird’s (Reference Baird2011) results in other ways. The near complete lack of realizations with full closures among the glottalized vowels in CK, despite the very low benchmark (> 20 ms), contrasts with what occurs in these other K’iche’ dialects, where a full closure was the most common realization: 81% of the time in Ixtahuacán K’iche’ and about 65% of the time in Cantel K’iche’. Furthermore, in these dialects, glottalized vowels preceding a nasal consonant were more likely to have a full closure than those preceding a voiceless consonant. Apparently modal realizations were very rare (less than 5% of the data from each dialect). Therefore, the contexts where weaker realizations (apparently modal phonation or intensity dip) occur most frequently in CK are the same contexts where full closures are most common in these other dialects.

A difference in registers cannot explain this contrast, as the experiment data described in this paper, like Baird’s data, consists of elicitations in a fairly formal context likely to induce clear, careful speech. Nevertheless, other environments where glottal stop is clearly a consonant rather than a vocalic feature, such as word-initial position, are also rarely realized with full closures in CK (Wood Reference Wood2023). Word-final glottal stops were almost always realized with a full closure in Baird’s (Reference Baird2011) data, whereas they are again practically never realized in this way in the present study. Therefore, the differences observed for glottalized vowels may simply reflect a difference in the typical realization of the glottal stop consonant across dialects. The lack of full closures is not surprising from a cross-linguistic perspective, as glottal stops, including those which clearly behave as consonants rather than vocalic features, are often realized in many different languages as creaky or other types of laryngealized phonation on adjacent vowels without a full closure (Priestly Reference Priestly1976; Kohler Reference Kohler1994; Ladefoged & Maddieson Reference Ladefoged and Maddieson1996; Alber Reference Alber2001; Quick Reference Quick2003; Pompino-Marschall & Zygis Reference Pompino-Marschall and Zygis2011; DiCanio Reference DiCanio2012; Garellek Reference Garellek2014; Whalen et al. Reference Whalen, DiCanio, Geissler and King2016; Esling et al. Reference Esling, Moisik, Benner and Crevier-Buchman2019; Mitterer et al. Reference Mitterer, Kim and Cho2019; Davidson Reference Davidson2021). Variation in the typical realization of glottalization across closely related languages is also attested, e.g. glottalization tends to be realized quite weakly in the Salish language SENĆOT–EN compared to related languages such as Hul’q’umi’num, where its realization tends to be strong (Bird Reference Bird2020; Bird, Czaykowska-Higgins & Leonard Reference Bird, Czaykowska-Higgins and Leonard2012; Percival Reference Percival2024).

In sum, neither vowel quality nor voice quality results show strong reason to prefer a one-segment (phonemically glottalized vowel) vs. two-segment (plain vowel followed by a glottal stop) analysis. In both respects, there are no differences in the behavior of glottalized vowels as compared to lax vowels followed by glottal stops that cannot be explained through contextual factors. It is possible that other types of acoustic differences might be determinable though quantitative analysis of voice quality in glottalized vowels compared to vowel–glottal stop sequences. This would be an interesting question for future research.

In some other K’ichean languages, phonological evidence shows that a glottal stop acts as a consonant in some contexts and a vocalic feature in others; nevertheless, this distinction does not correlate with a phonetic difference, as the same phonetic properties are found in both cases (Bennett Reference Bennett2024; Sobrino Gómez & Bennett Reference Sobrino Gómez, Bennett, Arellanes, Hernández and Hernándezsubmitted). For example, Sobrino Gómez & Bennett (Reference Sobrino Gómez, Bennett, Arellanes, Hernández and Hernándezsubmitted) show that in Kaqchikel lax vowels are neutralized to tense when outside of the final (stressed) syllable, as in k’äy [ˈk’ɨj̥] ‘bitter’ vs. ruk’ayil [ru.k’a.ˈjil̥] ‘its bitterness’. Glottalized vowels are also neutralized to tense in this position, as in chaqi’j [͡tʃa.ˈqiʔχ] ‘dry’ vs. chaqijirisab’äl [͡tʃa.qi.χi.ri.sa.b’əl̥] ‘dryer’. In stressed syllables only a tense vowel can precede a glottal stop (Sobrino Gómez & Bennett Reference Sobrino Gómez, Bennett, Arellanes, Hernández and Hernándezsubmitted). The same patterns are not found in CK, however, where neither tense, lax, nor glottalized vowels are restricted to stressed syllables, as in ch’ab’äl [t͡ʃ'a.ˈɓəl] ‘language’, wächb’äl [wət͡ʃ.ˈɓəl] ‘image’, na’tsb’äl [na̰t.ˈsɓəl] ‘reminder’. As shown above, either tense or lax vowels can precede a glottal stop coda. Thus, although ultimately distinguishing between a one-segment or two-segment analysis for CK glottalized vowels will likely rely on phonological rather than phonetic evidence, the existing phonological evidence is inconclusive.

6. Conclusion

This article details the results of a controlled speech production experiment focused on CK tense, lax and glottalized vowels. The results show that overall tense vowels are longer than lax vowels and occupy further positions from the middle of the vowel space, with this distance mostly in the height dimension. The lax high front vowel /ɪ/ varies in realization as [i], [e] [ɛ] or [ə], mostly predictable based on the item and suggesting possible phonological conditioning that cannot be fully elucidated with the available data. The lax high back vowel /ʊ/ is not significantly different from the tense mid back vowel /o/. However, both /ɪ/ and /ʊ/ behave phonologically like lax vowels rather than tense vowels, suggesting that they may be at an intermediate stage of merging with surrounding categories.

The voice quality results show that glottalized vowels (vowels alternately analyzed as phonemically glottalized or as plain vowels followed by a glottal stop) are practically never produced with full glottal stops. Strong realizations of glottalization, such as creaky voice or other types of laryngealized phonation, are more common when followed by a voiceless consonant than a voiced one; in the latter context, weak realizations, such as a dip in intensity or apparently modal phonation, are more common. Plain (tense or lax) vowels followed by a simple glottal stop coda show stronger realizations of glottalization than glottalized vowels, though full closures are again very rarely produced. The vowel quality of glottalized vowels is lax for the high and mid vowels (/ɛ̰/, /ɪ̰ /, /ɔ̰/, /ʊ̰/) but tense for the central vowel (/a̰/), mirroring the neutralization of tense and lax central vowels when preceding a simple glottal stop coda. In cross-linguistic and historical context, the vowel quality and voice quality results do not show clear evidence in favor of either a one-segment or two-segment analysis for glottalized vowels.

Acknowledgments

I would like to thank all of the K’iche’ speakers who contributed to this project. Thanks also to the Associate Editor and two anonymous reviewers, who provided valuable feedback which significantly improved this article. This work was supported by the University of Texas at Austin Carlota Smith Fellowship and College of Liberal Arts Office of Research and Graduate Studies Carl J. and Tamara M. Tricoli Endowed Fellowship.

Competing interests

The author declares none.

Supplementary material

To view supplementary material for this article, please visit https://doi.org/10.1017/S0025100325100698

Appendix: Full statistical results

Table A1. Pairwise comparisons of vowel height by phoneme category in the F1 model

Table A2. Pairwise comparisons of vowel frontness by phoneme category in the F2 model

Table A3. Results of the duration model of the /o ʊ/ subset

Table A4. Results of the duration model of the /i ɪ e ɛ ə/ subset

Table A5. Pairwise comparisons in the /i ɪ e ɛ ə/ duration model

Footnotes

1 The following abbreviations are used throughout this document: dB = decibel(s), C = consonant, CK = Chichicastenango K’iche’, DF = degrees of freedom, f0 = fundamental frequency, F1 = first formant, F2 = second formant, ms = millisecond(s), Hz = Hertz, Int. = Intensity, POA = place of articulation, s = second(s), SE = standard error, V = vowel, * = p < .05, ** = p < .01, *** = p < .001.

2 Note that the glottal stop in ki’el /kiel/ [kiʔel] ‘they go out’ must be epenthetic because it is not present in other forms of the 3rd person plural set B (absolutive) prefix /i/ nor the verb root /el/ ‘go out’, except to prevent hiatus, e.g., kicha /kit͡ʃa/ [kit͡ʃa] ‘they speak’, köjel /kɔχel/ [kɔχel] ‘we go out’.

3 Some of the items have an underlying vowel between the first two consonants, present in other dialects of K’iche’ but deleted in the surface form in Chichicastenango K’iche’, e.g.: tz’nün /t͡s'ʊnʊn/ ‘hummingbird’.

4 The vowel change from /ə/ to /ɛ/ in /k'ɛq/ ‘flea’ and /kɛq'/ ‘guava’ can be understood as a further step beyond the pattern of dissimilatory palatalization of velar stops when followed by short /a/ and then a uvular consonant which has occurred in several K’ichean languages (England & Baird Reference England, Baird and Judith Aissen2017). This appears to be a regular sound change undergone in CK: all words which historically would have had a velar stop followed by /ə/ and then a uvular stop show the palatalization of the velar and fronting of the vowel, e.g. këq /kɛq/ ‘red’ from proto-K’iche *kaq, k’ëq /k'ɛq/ ‘to throw’ from proto-K’iche’ *k’aq.

5 Most K’iche’ speakers are not literate in K’iche’, and those that have learned to read K’iche’ read a standardized form of the language whose pronunciation is very different from Chichicastenango K’iche’. Because of this, a reading task, as is commonly used in speech production experiments, was impossible in this context.

6 None of the experiment items have a glide before or after the target vowel. However, sometimes participants produced target items with the first person singular possessive prefix /w/, e.g. wäj [wəχ] ‘my fresh corn’.

7 Note that the neutralization of the tense–lax contrast in glottalized vowels might appear to be an argument for considering glottalized vowels one segment. However, this may simply reflect an inherited phonotactic constraint: only short vowels are allowed before a complex coda. When followed by the change from contrastive vowel length to quality, this would result in only lax glottalized vowels.

8 It is assumed here that the cases of apparently modal phonation reflect a weaker constriction. However, it is possible (and in fact likely) that these are not cases of truly modal phonation, but may be tense or pressed voice or other types of non-modal phonation that are more difficult to distinguish visually from modal voice. Many of these tokens are perceived as glottalized despite the lack of visual evidence of glottalization. This would be a very interesting area for future research, in particular supported by a quantitative analysis of voice quality.

9 The appearance of glottalization on a following sonorant rather than a vowel itself is reminiscent of the timing of Danish stød (Peña 2022).

References

Adank, Patti, Smits, Roel & van Hout, Roeland. 2004. A comparison of vowel normalization procedures for language variation research. The Journal of the Acoustical Society of America 116(5), 30993107. https://doi.org/10.1121/1.1795335 CrossRefGoogle ScholarPubMed
Ajpacajá Tum, Pedro Florentino, Chox Tum, M. I., Tepax Raxulew, F. L. & Guarchaj Ajtzalam, D. A.. 2005. Diccionario K’iche’. 2nd edition. Guatemala: Proyecto Lingüístico Francisco Marroquín.Google Scholar
Alber, Birgit. 2001. Regional variation at edges: Glottal stop epenthesis and dissimilation in Standard and Southern varieties of German. Zeitschrift für Sprachwissenschaft 20(1), 341.Google Scholar
Baird, Brandon O. 2011. Phonetic and phonological realizations of ‘broken glottal’ vowels in K’ichee’. Proceedings of formal approaches to Mayan linguistics: MIT working papers in linguistics 63, 3949.Google Scholar
Baird, Brandon & Francisco Pascual, Adan. 2011. Realizaciones fonéticas de /Vʔ/ en Q’anjob’al (Maya). Proceedings of the Conference on Indigenous Languages of Latin America (CILLA) V.Google Scholar
Barreda, Santiago & Nearey, Terrance M.. 2018. A regression approach to vowel normalization for missing and unbalanced data. Journal of the Acoustical Society of America 144(1), 500520.10.1121/1.5047742CrossRefGoogle ScholarPubMed
Bates, Douglas, Mächler, Martin, Bolker, Ben & Walker, Steve. 2015. Fitting linear mixed-effects models using lme4. Journal of Statistical Software 67.1, 148. https://doi.org/10.18637/jss.v067.i01 CrossRefGoogle Scholar
Beguš, Gašper. 2017. Effects of ejective stops on preceding vowel duration. The Journal of the Acoustical Society of America 142(4), 21682184. https://doi.org/10.1121/1.5007728 CrossRefGoogle ScholarPubMed
Bennett, David C. 1968. Spectral form and duration as cues in the recognition of English and German vowels. Language and Speech 11(2), 6585.10.1177/002383096801100201CrossRefGoogle Scholar
Bennett, Ryan. 2016a. La tensión vocálica en el kaqchikel de Sololá, Guatemala: un estudio preliminar. Las Actas del Seminario Phonologica. Google Scholar
Bennett, Ryan. 2016b. Mayan Phonology. Language and Linguistics Compass, 469514. https://doi.org/10.1111/lnc3.12148 CrossRefGoogle Scholar
Bennett, Ryan. 2024, February. Segmentos y rasgos laríngeos en la familia maya: evidencia que la fonología es abstracta, y distinta de la fonética. Presentation at Form and Analysis in Mayan Linguistics (FAMLi) VII, Universidad Nacional Autónoma de México (UNAM), Mexico City.Google Scholar
Bird, Sonya. 2011. The nature of laryngealization in St’át’imcets laryngealized resonants. International Journal of American Linguistics 77(2), 159184. https://doi.org/10.1086/658893 CrossRefGoogle Scholar
Bird, Sonya, Czaykowska-Higgins, Ewa & Leonard, Janet. 2012. To reduce or not to reduce: Evidence from SENĆOT–EN storytelling. Canadian Acoustics 40(3), 1415.Google Scholar
Bird, Sonya. 2020. Pronunciation among adult indigenous language learners: The case of SENĆOT–EN /t’/. Journal of Second Language Pronunciation 6(2), 148179.10.1075/jslp.17042.birCrossRefGoogle Scholar
Boersma, Paul & Weenink, David. 2023. Praat: doing phonetics by computer. [Computer program]. Version 6.3.14. http://www.praat.org/ Google Scholar
Brunner, Jana & Zygis, Marzena. 2011. Why do glottal stops and low vowels like each other? Proceedings of the 17th International Congress of Phonetic Sciences, pp. 376379.Google Scholar
Can Pixabaj, Telma A. 2017. K’iche’. In: The Mayan languages. Ed. by Judith Aissen, Nora C. England, and Roberto Zavala Maldonado. 1st edition. London: Routledge Ltd., Chap. 18, pp. 461499. https://doi.org/10.4324/9781315192345-18 CrossRefGoogle Scholar
Dalton, Will. 2011. An ultrasound imaging study of the tense-lax distinction in Canadian French vowels. Canadian Acoustics 39(3), 168169.Google Scholar
Davidson, Lisa. 2020. The versatility of creaky phonation: Segmental, prosodic, and sociolinguistic uses in the world’s languages Wiley Interdisciplinary Reviews: Cognitive Science 12(3), e1547.Google ScholarPubMed
Davidson, Lisa. 2021. Effects of word position and flanking vowel on the implementation of glottal stop: Evidence from Hawaiian. Journal of Phonetics 88, 101075.10.1016/j.wocn.2021.101075CrossRefGoogle Scholar
DiCanio, Christian. 2012. Coarticulation between tone and glottal consonants in Itunyoso Trique. Journal of Phonetics 40, 162176.10.1016/j.wocn.2011.10.006CrossRefGoogle Scholar
DiCanio, Christian & Bennett, Ryan. 2021. Prosody in Mesoamerican languages. In: The Oxford handbook of language prosody. Ed. by Gussenhoven, Carlos and Chen, Aoju, pp. 408427. https://doi.org/10.1093/oxfordhb/9780198832232.013.25 CrossRefGoogle Scholar
Di Paolo, Marianna and Faber, Alice. 1990. Phonation differences and the phonetic content of the tense-lax contrast in Utah English. Language Variation and Change 2(2), 155204.10.1017/S0954394500000326CrossRefGoogle Scholar
Elvira García, Wendy. 2022. Create pictures with tiers v.6. Praat script. https://github.com/wendyelviragarcia/create_pictures Google Scholar
England, Nora C. & Baird, Brandon O.. 2017. Phonology and phonetics. In: The Mayan languages. Ed. by Judith Aissen, Nora C. England, and Roberto Zavala Maldonado. 1st edition. London: Routledge Ltd., Chap. 7, pp. 175200. https://doi.org/10.4324/9781315192345-7 CrossRefGoogle Scholar
Esling, John, Moisik, Scott, Benner, Allison & Crevier-Buchman, Lise. 2019. Voice quality: The laryngeal articulator model. Cambridge: Cambridge University Press.10.1017/9781108696555CrossRefGoogle Scholar
Esposito, Anna. 2002. On vowel height and consonantal voicing effects: Data from Italian. Phonetica 59, 197231.10.1159/000068347CrossRefGoogle ScholarPubMed
Evans, Jonathan P., Sun, Jackson T.-S., Chiu, Chenhao & Liou, Michelle. 2016. Uvular approximation as an articulatory vowel feature. Journal of the International Phonetic Association 46(1), 131. https://doi.org/10.1017/s0025100315000146 CrossRefGoogle Scholar
Everett, Caleb. 2013. Evidence for direct geographic influences on linguistic sounds: The case of ejectives. PLoS One 8(6), e65275. https://doi.org/10.1371/journal.pone.0065275 CrossRefGoogle ScholarPubMed
Frazier, Melissa. 2009. The production and perception of pitch and glottalization in Yucatec Maya. Ph.D. thesis. The University of North Carolina at Chapel Hill.Google Scholar
Fulop, Sean, Kari, Ethelbert & Ladefoged, Peter. 1998. An acoustic study of the tongue root contrast in Degema vowels. Phonetica 55(1–2), 8098.10.1159/000028425CrossRefGoogle ScholarPubMed
Garellek, Marc. 2013. Production and perception of glottal stops. Ph.D. thesis, University of California, Los Angeles.Google Scholar
Garellek, Marc. 2014. Voice quality strengthening and glottalization. Journal of Phonetics 45, 106113.10.1016/j.wocn.2014.04.001CrossRefGoogle Scholar
Garellek, Marc. 2022. Theoretical achievements of phonetics in the 21st century: Phonetics of voice quality. Journal of Phonetics 94, 101155.10.1016/j.wocn.2022.101155CrossRefGoogle Scholar
Gordon, Matthew. 2001. Linguistic aspects of voice quality with special reference to Athabaskan. Proceedings of the 2001 Athabaskan languages conference. Los Angeles: University of California.Google Scholar
Gordon, Matthew, Barthmaier, Paul & Sands, Kathy. 2002. A cross-linguistic acoustic study of voiceless fricatives. Journal of the International Phonetic Association 32(2), 141174. https://doi.org/10.1017/s0025100302001020 CrossRefGoogle Scholar
Guion, Susan, Post, Mark & Payn, Doris. 2004. Phonetic correlates of tongue root vowel contrasts in Maa. Journal of Phonetics 32, 517542.10.1016/j.wocn.2004.04.002CrossRefGoogle Scholar
Hess, Susan. 1992. Assimilatory effects in a vowel harmony system: An acoustic analysis of advanced tongue root in Akan. Journal of Phonetics 20, 475492.10.1016/S0095-4470(19)30651-5CrossRefGoogle Scholar
Hillenbrand, James M. & Clark, Michael J.. 2000. Effects of consonant environment on vowel formant patterns. Journal of the Acoustical Society of America 109(2), 748763.10.1121/1.1337959CrossRefGoogle Scholar
Hogg, Richard. 2012. Introduction to Old English. Edinburgh: Edinburgh University Press.10.1515/9780748654314CrossRefGoogle Scholar
Holliday, Nicole & Martin, Sean. 2018. Vowel categories and allophonic lowering among Bolivian Quechua–Spanish bilinguals. Journal of the International Phonetic Association 48(2), 199222. https://doi.org/10.1017/s0025100317000512 CrossRefGoogle Scholar
Jun, Sun-Ah. 2005. Prosodic typology. In: Prosodic typology: The phonology of intonation and phrasing. Ed. Jun, Sun-Ah, pp. 430458. Oxford: Oxford University Press.10.1093/acprof:oso/9780199249633.003.0016CrossRefGoogle Scholar
Kaufman, Terrence & Justeson, John S.. 2003. A preliminary Mayan etymological dictionary. Foundation for the Advancement of Mesoamerican Studies. http://www.famsi.org/reports/01051/ Google Scholar
Keating, Patricia A., Garellek, Marc & Kreiman, Jody. 2015. Acoustic properties of different kinds of creaky voice. Proceedings of the 18th International Congress of Phonetic Sciences, pp. 27.Google Scholar
Kirkham, Sam & Nance, Claire. 2017. An acoustic-articulatory study of bilingual vowel production: Advanced tongue root vowels in Twi and tense/lax vowels in Ghanaian English. Journal of Phonetics 62, 6581.10.1016/j.wocn.2017.03.004CrossRefGoogle Scholar
Koffi, Ettien. 2016. The acoustic correlates of [±ATR] vowels: An analysis by reference levels of Anyi vowels. Linguistic Portfolios 5(1), 9.Google Scholar
Kohler, Klaus J. 1994. Glottal stops and glottalization in German. Phonetica 51, 3851.10.1159/000261957CrossRefGoogle Scholar
Ladefoged, Peter & Maddieson, Ian. 1996. The sounds of the world’s languages. Oxford: Blackwell.Google Scholar
Lehiste, Ilse. 1972. The timing of utterances and linguistic boundaries. The Journal of the Acoustical Society of America 51(6B), 20182024.10.1121/1.1913062CrossRefGoogle Scholar
Lenth, Russell V. 2024. emmeans: Estimated marginal means, aka least-squares means. R package. Version 1.10.4.900003. https://rvlenth.github.io/emmeans/ Google Scholar
Lobanov, Boris. 1971. Classification of Russian vowels spoken by different speakers. The Journal of the Acoustical Society of America 49(2B), 606608. https://doi.org/10.1121/1.1912396 CrossRefGoogle Scholar
López Ixcoy, Saqijix Candelaria Dominga. 1994. Las vocales en K’ichee’. Guatemala: OKMA.Google Scholar
Machač, Pavel & Skarnitzl, Radek. 2009. Principles of phonetic segmentation. Prague, Czechia: Epocha.Google Scholar
Majzul, Filiberto Patal, Matzar, Pedro Oscar García & Espantzay Serech, Carmelina. 2000. Rujunamaxik ri Kaqchikel chi’: variación dialectal en Kaqchikel. Ciudad de Guatemala, Guatemala: Cholsamaj.Google Scholar
Mitterer, Holger, Kim, Sahyang & Cho, Taehong. 2019. The glottal stop between segmental and suprasegmental processing: The case of Maltese. Journal of Memory and Language 108, 104034.10.1016/j.jml.2019.104034CrossRefGoogle Scholar
Moisik, Scott. 2013. The epilarynx in Speech. Ph.D thesis, University of Victoria.Google Scholar
Moisik, Scott, Esling, John H., Crevier-Buchman, Lise, Amelot, Angélique & Halimi, Philippe. 2015. Multimodal imaging of glottal stop and creaky voice: Evaluating the role of epilaryngeal constriction. Proceedings of the 18 th International Congress of Phonetic Sciences. Google Scholar
Moisik, Scott, Czaykowska-Higgins, Ewa & Esling, John H.. 2021. Phonological potentials and the lower vocal tract. Journal of the International Phonetic Association 51(1). https://doi.org/10.1017/S0025100318000403 CrossRefGoogle Scholar
Monahan, Philip J. & Idsardi, William J.. 2010. Auditory sensitivity to formant ratios: Toward an account of vowel normalization. Language and Cognitive Processes 25(6), 808839.10.1080/01690965.2010.490047CrossRefGoogle ScholarPubMed
Öhman, Sven E. G. 1966. Coarticulation in VCV utterances: Spectrographic measurements. The Journal of the Acoustical Society of America 39(1), 151168. https://doi.org/10.1121/1.1909864 CrossRefGoogle ScholarPubMed
Par Sapón, María Beatriz & Angelina Can Pixabaj, Telma. 2000. Ujunamaxiik ri K’ichee’ Ch’ab’al: Variación Dialectal en K’ichee’. Guatemala: OKMA, Cholsamaj.Google Scholar
Peña, Jailyn M. 2022. Stød timing and domain in Danish. Languages 7(1), 50.10.3390/languages7010050CrossRefGoogle Scholar
Percival, Maida. 2024. Production and perception of ejective stops across languages. Ph.D. thesis, University of Toronto. TSpace Institutional Repository. https://utoronto.scholaris.ca/items/577327fa-c531-423a-a857-b84916b6f431 Google Scholar
Peterson, Gordon & Lehiste, Ilse. 1960. Duration of syllable nuclei in English. The Journal of the Acoustical Society of America 32(6), 693703.10.1121/1.1908183CrossRefGoogle Scholar
Pompino-Marschall, Bernd & Zygis, Marzena. 2011. Glottal marking of vowel-initial words in German. Proceedings of the 17th International Congress of Phonetic Sciences, pp. 16261629.Google Scholar
Priestly, Tom. 1976. A note on the glottal stop. Phonetica 33, 268274.10.1159/000259775CrossRefGoogle Scholar
Przezdziecki, Marek. 2005. Vowel harmony and coarticulation in three dialects of Yoruba: Phonetics determining phonology. Ph.D. thesis, Cornell University.Google Scholar
Quick, Phil. 2003. A grammar of the Pendau language. Ph.D. thesis, Australian National University.Google Scholar
R Core Team. 2020. R: A language and environment for statistical computing. [Comupter program]. Version 4.0.3. Vienna, Austria. https://www.R-project.org/ Google Scholar
Redi, Laura & Shattuck-Hufnagel, Stefanie. 2001. Variation in the realization of glottalization in normal speakers. Journal of Phonetics 29(4), 407429. https://doi.org/10.1006/jpho.2001.0145.CrossRefGoogle Scholar
Sobrino Gómez, Martín & Bennett, Ryan. Submitted. Fonética y fonología de los rasgos y segmentos laríngeos en la familia maya. In: Rasgos laríngeos en lenguas indomexicanas. Eds. Arellanes, Francisco, Hernández, Mario, and Hernández, Fidel. Berlin, Germany: Language Science Press. Google Scholar
Starwalt, Coleen. 2008. The acoustic correlates of ATR harmony in seven- and nine-vowel African languages: A phonetic inquiry into phonological structure. Ph.D. thesis, University of Texas, Arlington.Google Scholar
Stevens, Kenneth N. & House, Arthur S.. 1963. Perturbation of vowel articulations by consonantal context: An acoustical study. Journal of Speech and Hearing Research 6, 111128.10.1044/jshr.0602.111CrossRefGoogle ScholarPubMed
van Santen, Jan P. H. 1992. Contextual effects on vowel duration. Speech Communication 11(6), 513546.10.1016/0167-6393(92)90027-5CrossRefGoogle Scholar
Whalen, Dough, DiCanio, Christian, Geissler, Christopher & King, Hannah. 2016. Acoustic realization of a distinctive, frequent glottal stop: The Arapaho example. Journal of the Acoustical Society of America 139(4), 22122213.10.1121/1.4950615CrossRefGoogle Scholar
Wheeler, Paige Erin. 2024. Vowels in naturalistic Enenlhet speech: Quality, duration, and phonation. Ph.D. thesis, University of Texas at Austin.Google Scholar
Wickham, Hadley. 2016. ggplot2: Elegant graphics for data analysis. New York: Springer-Verlag. https://ggplot2.tidyverse.org 10.1007/978-3-319-24277-4_9CrossRefGoogle Scholar
Wilson, Ian. 2007. The effects of post-velar consonants on vowels in Nuu-chah-nulth: Auditory, acoustic, and articulatory evidence. Canadian Journal of Linguistics 52(1–2), 243270. https://doi.org/s0008413100004199 CrossRefGoogle Scholar
Wolf, Catherine G. 1978. Voicing cues in English final stops. Journal of Phonetics 6, 299309.10.1016/S0095-4470(19)31162-3CrossRefGoogle Scholar
Wood, Elizabeth Anne. 2020. Vowels and stress in Chichicastenango K’iche’. M.A. thesis, University of Texas at Austin.Google Scholar
Wood, Elizabeth. 2023. Vowel-initial words and glottalization: A corpus study of Chichicastenango K’iche’. Proceedings of the 20th International Congress of Phonetic Sciences, pp. 17661770.Google Scholar
Wood, Elizabeth Anne. 2024. Prosodic structure in spontaneous speech: Phrase-final marking, word-initial glottalization and vowel deletion in Chichicastenango K’iche’. Ph.D. thesis, University of Texas at Austin.Google Scholar
Zawaydeh, Bushra Adnan. 1997. An acoustic analysis of uvularization spread in Ammani Jordanian Arabic. Studies in the Linguistic Sciences 27(1).Google Scholar
Figure 0

Table 1. K’iche’ consonant inventory

Figure 1

Table 2. Items and token counts included in the experiment

Figure 2

Figure 1. Example stimuli for a noun with a plain vowel (sak’ ‘grasshopper’, top left), adjective with a plain vowel (q’ëq ‘black’, top right), noun with a glottalized vowel (xpa’ch ‘lizard’, bottom left), and adjective with a glottalized vowel (chqï’j ‘dry’, bottom right)

Figure 3

Figure 2. Examples of vowel segmentation. After a glide and before a fricative in wäj [wəχ] ‘my fresh corn’ (top left), after a glottal stop and before a fricative in ju oj [χu ʔoj] ‘an avocado’ (top right), before and after an ejective stop in t’ot’ [t’ot'] ‘snail’ (bottom left), after an affricate and before a nasal in chim [t͡ʃim] ‘bag’ (bottom right).

Figure 4

Table 3. Coding of different types of glottalization

Figure 5

Figure 3. Full closure in ch’a’k /t͡ʃ'a̰k/ ‘sore’ (top left; 22 ms between two pulses towards the center of the vowel); creaky voice in pö’t /pɔ̰t/ ‘huipil’ (top right); intensity dip in sü’t /sʊ̰t/ ‘cloth’ (bottom left); apparently modal voice in the second syllable of t’isö’n /t’isɔ̰n/ ‘sewing’ (bottom right).

Figure 6

Table 4. Factors and hypotheses for the F1 model

Figure 7

Table 5. Factors and hypotheses for the F2 model

Figure 8

Table 6. Factors and hypotheses for the duration model

Figure 9

Figure 4. Normalized tense and lax vowels for all speakers.

Figure 10

Figure 5. Normalized tense, lax and glottalized back vowels for all speakers.

Figure 11

Figure 6. Normalized tense, lax and glottalized front and central vowels for all speakers.

Figure 12

Table 7. Comparison of F1 models

Figure 13

Table 8. Results of the final model of F1

Figure 14

Table 9. Comparison of F2 models

Figure 15

Table 10. Results of the final model of F2

Figure 16

Table 11. Contrasts in vowel height and frontness according to the statistical results

Figure 17

Figure 7. Vowels /o/ and /ʊ/ by speaker.

Figure 18

Figure 8. Normalized F1 and F2 vowels for the phonemes /o/ and /ʊ/ as duration changes.

Figure 19

Figure 9. Normalized tense and lax front and central vowels, showing the realizations of /ɪ/.

Figure 20

Figure 10. Normalized front and central tense, lax and glottalized vowels, showing the realizations of /ɪ̰/.

Figure 21

Figure 11. Durations of tense, lax and glottalized vowels in each place of articulation set.

Figure 22

Table 12. Comparison of duration models

Figure 23

Table 13. Results of final model of vowel duration (estimates in seconds)

Figure 24

Table 14. Rates of each type of glottalization by context

Figure 25

Figure 12. Rates of each type of glottalization by context.

Figure 26

Table 15. Phonetic realization of lax and glottalized high front vowels by item (glottal stop included in following context for glottalized vowels)

Figure 27

Table A1. Pairwise comparisons of vowel height by phoneme category in the F1 model

Figure 28

Table A2. Pairwise comparisons of vowel frontness by phoneme category in the F2 model

Figure 29

Table A3. Results of the duration model of the /o ʊ/ subset

Figure 30

Table A4. Results of the duration model of the /i ɪ e ɛ ə/ subset

Figure 31

Table A5. Pairwise comparisons in the /i ɪ e ɛ ə/ duration model

Supplementary material: File

Wood supplementary material

Wood supplementary material
Download Wood supplementary material(File)
File 8.2 MB