Hostname: page-component-7dd5485656-npwhs Total loading time: 0 Render date: 2025-10-21T13:00:44.822Z Has data issue: false hasContentIssue false

Correspondence at the acoustic boundaries: cluster simplification, syncope and P-map correspondence constraints

Published online by Cambridge University Press:  20 October 2025

Yeong-Joon Kim*
Affiliation:
Department of Linguistics, Seoul National University , Seoul, South Korea
Rights & Permissions [Opens in a new window]

Abstract

The C2 dominance effect in cluster simplification, in which the second consonant is preserved over the first (V1C1C2V2 → V1C2V2), has been attributed to the perceptual salience of prevocalic consonants. However, this P-map account fails in classic Optimality Theory when syncope feeds cluster simplification (V1C1V0C2V2 → V1C1C2V2 → V1C2V2), as the input and output contexts do not differentiate C1 from C2. This article proposes a solution to this problem using correspondence constraints that reference acoustic transitions from and to vowels. Since syncope removes the targeted vowel and its associated transitions, Ident [transition] constraints cannot refer to the eliminated transitions. Specifically, the transition from C2 is protected by the relevant Ident [release transition] constraint, while C1’s transition is not. Thus, under the ranking of Ident [release transition] over Ident [closure transition], C1, despite being underlyingly prevocalic, remains subject to the C2 dominance effect and is targeted for deletion. This proposal also addresses interactions between syncope and other cluster reduction processes, such as major place assimilation and debuccalisation.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (https://creativecommons.org/licenses/by-nc-sa/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

1. Introduction

When a medial consonant cluster is simplified by removing one of its members, it is typically achieved by deletion of the first consonant (C1):

In theory, cluster simplification can also be achieved by the deletion of the second consonant (C2). However, the cross-linguistic preference for the deletion of C1 over C2 is dominant (Wilson Reference Wilson2001; Jun Reference Jun2002, Reference Jun, van Oostendorp, Ewen, Hume and Rice2011; McCarthy Reference McCarthy2008, Reference McCarthy2011, among others). Cluster simplification in Basque and Diola Fogny exemplifies this effect. In both languages, when the simplification process targets a medial cluster, C1 is always deleted, while C2 remains unaffected.

As pointed out by Wilson (Reference Wilson2001), the asymmetric positional effect in cluster simplification stems from the lower perceptibility of C1. In medial cluster simplification, the prevocalic C2 is more likely to be preserved because of its perceptual salience, while the perceptually weak preconsonantal C1 is more prone to deletion or alteration. This Licensing-by-Cue idea (Steriade Reference Steriade1999) can be implemented in classic Optimality Theory (OT) (Prince & Smolensky [1993] Reference Prince and Smolensky2004) using contextual faithfulness constraints. These constraints are ranked differently based on their contexts, assuming that a speaker’s knowledge of contrast distinctiveness across positions is mapped onto the relative constraint ranking (P-map theory; Steriade Reference Steriade, Hanson and Inkelas2009). For example, Max-C /__V should outrank Max-C /__C, as consonants are perceptually more salient prevocalically. Under this ranking, the V1C1C2V2 cluster is neutralised to V1C2V2 rather than V1C1V2 when simplification occurs, as schematised in (3).

This C2 dominance effect is evident even when syncope creates the target cluster for simplification. The following examples from Carib of Suriname demonstrate this interaction between syncope and cluster simplification:

In this language, a stem-final high vowel is deleted when it is followed by a CV-initial suffix. Then, the consonant preceding the deleted vowel is also removed to avoid creating a geminate or a medial cluster. Notably, the C2 dominance effect remains consistent regardless of whether syncope has occurred: the simplification process removes C1 rather than C2. This asymmetric effect is not limited to Carib, but is also observed in other languages where syncope interacts with cluster simplification, such as Tangale (Kidda Reference Kidda1993), Tunica (Haas Reference Haas and Hoijer1946) and Nishnaabemwin (Valentine Reference Valentine2001).

However, within classic OT, the contextual faithfulness analysis in (3) encounters a significant challenge in capturing the C2 dominance effect when syncope is involved (e.g., Wilson Reference Wilson2001; McCarthy Reference McCarthy2011). The main issue stems from the contextual information specified in the faithfulness constraints. In simple cases where cluster simplification occurs without syncope, it is sufficient to assume that the faithfulness constraints specify the input context: C1 is deleted because it is preconsonantal in the input. However, this assumption fails when syncope feeds simplification, as both members of the targeted cluster are prevocalic in the underlying representation. Consider the following tableau with an ad hoc Syncope constraint that removes a medial vowel between two consonants (V0):

The tableau in (5), where the relevant faithfulness constraints refer to the input contexts, predicts that deleting either C1 or C2 is equally viable, as both consonants are protected by the same constraint (i.e., Max-C /__V input ). The given faithfulness constraints and their ranking cannot distinguish between C1 and C2 by penalising only the deletion of C2, unlike in (3).

An alternative approach that assumes faithfulness constraints specifying the surface context also fails:

In (6), neither deletion of C1 nor of C2 is penalised by the relevant faithfulness constraints. In both (6c) and (6d), a deleted consonant is absent from the surface forms, making it impossible to penalise consonant deletion based on the output context.

This study aims to capture the patterns of cluster simplification, regardless of whether syncope is involved or not, keeping the P-map approach’s core idea that faithfulness constraints are ranked according to the perceptual hierarchy (Steriade Reference Steriade, Hanson and Inkelas2009). The central claim is that the C2 dominance effect can be consistently captured through correspondence constraints that refer to subsegments. Specifically, following Flemming (Reference Flemming2008b), this article proposes that P-map correspondence constraints can refer to phonetic details conveyed by transitions from and out of a consonant, and these constraints are ranked hierarchically according to the perceptual salience of the phonetic properties they refer to. For instance, Ident [release transition], which penalises changes in the transitional phase from a consonant to a following vowel, should be ranked over Ident [closure transition], which evaluates changes in the transition from a vowel to a following consonant. This ranking is necessary because the release transition from a consonant to a vowel carries more perceptually reliable cues for the consonant than the closure transition from a vowel to a consonant. Given this ranking, the C2 dominance effect is easily captured: when cluster simplification is enforced, C1 is removed instead of C2 to avoid violating the higher-ranked Ident [release transition] constraint, at the cost of violating the lower-ranked Ident [closure transition].

These P-map constraints that refer to subsegmental transitions have advantages over conventional faithfulness constraints when cluster simplification interacts with syncope. This is because the changes in consonant properties are governed by correspondence constraints that evaluate the acoustic transitions occurring outside the consonant itself. In the context of syncope, C1 and C2 are differentiated by the fact that the underlying vowel following C1 is removed on the surface, whereas the vowel following C2 remains intact. As a result, potential C2 deletion is still penalised by Ident [release transition], while C1 deletion is not evaluated by the same constraint due to the lack of a surface correspondent of the input transition. Similarly, the transition from the preceding vowel to C1 is preserved on the surface, whereas the transition to C2 is removed by syncope. Therefore, Ident [release transition] selectively penalises only changes related to C2, while Ident [closure transition] only concerns changes related to C1. Such an evaluation is not possible with contextual faithfulness constraints, as outlined above. Thus, the hypothesis that correspondence constraints refer to acoustic transitions is crucial for capturing the C2 dominance effect in the context of syncope.

The remainder of the article is structured as follows. §2 outlines the proposal, which serves as the basis for the analyses of various cluster simplification processes provided in §3. §4 extends this analysis to other cluster reduction processes, such as major place assimilation and debuccalisation, where the C2 dominance effect is also observed in the context of syncope. §5 discusses exceptions to the C2 dominance effect and provides solutions to them. §6 compares the current proposal with two alternatives: the derivational framework of Harmonic Serialism (HS) (McCarthy Reference McCarthy2008, Reference McCarthy2011) and Contiguity constraints (e.g., Kenstowicz Reference Kenstowicz1994; McCarthy & Prince Reference McCarthy and Prince1995). Finally, §7 concludes the article.

2. Proposal

This section consists of three subsections. In §2.1, the core elements of the proposal are outlined in detail, emphasising the crucial role of subsegmental transitions in analysing the C2 dominance effect in the syncope context. The remaining two subsections address topics that refine the theoretical aspects of the proposal. §2.2 discusses the representation of features within subsegments, adopting the view that these features are phonetically detailed. Then, §2.3 provides a brief overview of Flemming’s (Reference Flemming2008b) Realised Input model, within which the main proposal is couched.

2.1. Key proposal

The central claim of the current proposal is that transitions into or out of consonants are represented in the relevant forms, and their correspondence relations are accessible to faithfulness constraints. This claim raises three related questions. First, how are these transitions represented? Second, why should faithfulness constraints evaluate these transitions? Third, and most importantly, how do these faithfulness constraints for transitions solve the syncope problem? This subsection focusses on these questions, while related but less central aspects of the proposal, such as the representational assumptions with regard to features and the grammatical architecture to be adopted, are explored separately in subsequent subsections.

First, regarding how the transitions are represented, it is assumed here, following Flemming (Reference Flemming2002: 23 ff., Reference Flemming2006: 19, Reference Flemming2008b: 16 ff.), that they are specified as subsegments occupying time intervals starting or ending at consonant–vowel boundaries. These subsegments can contain relevant auditory features, such as F2 frequencies. Additionally, it is assumed that subsegments are subordinate to their host phonological segments: no subsegment can stand alone without a host segment.Footnote 1 The schema in Figure 1 illustrates this with a hypothetical configuration of a medial cluster composed of two stops differing in place of articulation (cf. Flemming Reference Flemming2002: 24). Dotted lines indicate associations between subsegments and their host phonological segments.

Figure 1 The auditory subsegments and features of the sound string [atpu] (NF = noise frequency; NL = noise loudness).

Each segment illustrated in Figure 1 comprises at least one subsegment. For example, the initial vowel [a] consists of two distinct subsegments: Vmid, which represents the vowel’s steady state, and the closure transition, which is the transitional phase from the vowel to [t]. Likewise, the final vowel [u] also has two subsegments: the release transition, the transitional phase from the preceding consonant [p] to the vowel, and Vmid.Footnote 2 Crucially, these subsegments, rather than the host segments themselves, serve as the repositories of features, the building blocks of phonetic detail. This implies that changes in feature values are computed in the grammar at the subsegmental level rather than the segmental level.

This assumption that features are computed at the subsegmental level answers the second question: why do faithfulness constraints need to refer to transitions? As illustrated above, most consonant features are represented in the adjacent transitions rather than the phonological consonant itself. Thus, changes in consonant properties due to cluster simplification or other phonological processes are likely to be manifested as changes in feature values within the transitions. For instance, when cluster simplification occurs, a sequence like V1C1C2V2 is neutralised to V1C2V2, invoking changes in the closure transition carried by V1 due to the removal of C1. If the sequence [atpu] in Figure 1 neutralises to [apu], the closure transition into [t] in the source representation differs from its output correspondent aiming at [p], specifically in terms of F2 frequencies, which provide primary place cues to a consonant (Dorman et al., Reference Dorman, Studdert-Kennedy and Raphael1977). Thus, for faithfulness constraints to properly assess the changes in consonantal place of articulation resulting from cluster simplification, they must be capable of referring to the F2 values present in the transitions.Footnote 3

To illustrate how faithfulness constraints that refer to auditory features can effectively regulate cluster simplification, suppose that changes in release and closure transitions are evaluated by constraints of types schematised in (7) and (8). Adopting Correspondence Theory (McCarthy & Prince Reference McCarthy and Prince1995), I use Ident constraints for featural changes while reserving Max and Dep constraints for the deletion and insertion of segmental and subsegmental units.Footnote 4

The tableau in (9) illustrates how these Ident [transition] constraints evaluate the relevant changes (the transitions are denoted by superscripts). When a cluster consisting of two consonants, such as a [tp] sequence, is simplified, C2 deletion would change the transition from [p] to [u] into a transition from [t] to [u], thus violating Ident [release transition: F2]. On the other hand, C1 deletion would alter the transition from [a] to [t] into a transition from [a] to [p], consequently incurring a violation of Ident [closure transition: F2].

The C2 dominance effect in cluster simplification can be captured by the hierarchical relationship between the Ident [release transition] and Ident [closure transition] constraints. The P-map idea is crucial here, stating that a constraint evaluating a perceptually more significant change must outrank a constraint evaluating a less significant change. The ranking Ident [release transition] ≫ Ident [closure transition] can be set based on the fact that listeners weigh onset signals more heavily than offset signals in perception (e.g., Fujimura et al. Reference Fujimura, Macchi and Streeter1978; Phillips et al. Reference Phillips, Hall and Boehnke2002; Wright Reference Wright, Hayes, Kirchner and Steriade2004: 43). For instance, release cues are perceptually more important than closure cues in detecting a consonant and its features (Ohala Reference Ohala, Kingston and Beckman1990; Kingston Reference Kingston and Keating1994). These faithfulness constraints formulated in (7) and (8) are thus P-map correspondence constraints, and they are universally ranked based on perceptual differences across distinct contexts:

The tableau in (11) illustrates how this ranking condition captures the C2 dominance effect:

In the given grammar, candidate (11a) is ruled out by the top-ranked *CC constraint, which prohibits consonant sequences. Then, the consonant to be deleted is determined by the ranking condition in (10). Candidate (11c), which removes the first stop, is preferred over candidate (11b), which deletes the second. The former candidate only violates the lower-ranked Ident [closure transition: F2], while the latter is penalised by the higher-ranked Ident [release transition: F2].

Now that correspondence constraints capable of addressing the C2 dominance effect have been established, the final question can be tackled: how do these correspondence constraints for transitions solve the syncope problem? To answer this, imagine a hypothetical source sequence [atipu] with a high vowel between two consonants, which ultimately neutralises to [apu]. The tableau in (12) describes how the C2 dominance effect in this neutralisation can be captured. In this tableau, neither Ident [closure transition] nor Ident [release transition] is violated due to vowel deletion, as both the release transition from the first consonant and the closure transition into the second consonant are removed together with the medial vowel by syncope. In other words, Ident [release transition] is only violated by the changes in the release transition that are present both in the source and the output, as in (12c). Likewise, Ident [closure transition] penalises only the closure transition whose correspondence relationship is maintained, as in (12d). The transitions removed by syncope cannot affect the evaluations since they lack their correspondents on the surface. Consequently, (12d) is selected as the optimal output by the ranking condition Ident [release transition] ≫ Ident [closure transition].

To sum up, the C2 dominance effect can be captured by the ranking of Ident [release transition] over Ident [closure transition], regardless of whether syncope is involved. In the current proposal, vowel deletion entails the removal of the transitions contained by the deleted vowel; thus, the correspondence in transitions between the source and output representations cannot be maintained. Consequently, Ident [release transition] and Ident [closure transition] are vacuously satisfied with regard to the removed transitions. The remaining transitions are evaluated by the correspondence constraints because they are present in the adjacent vowels even after one member of the cluster is removed. In this sense, syncope provides an environment that is not technically different from the condition where a vowel does not intervene between the two consonants.

2.2. Representation of the features

In (7) and (8), I formalised the Ident constraints for subsegmental features without detailing their nature. Since the crucial aspect of the proposal is the presence of consonantal features in transitions, the proposal is compatible with various feature specifications, provided that transitions are represented in relevant forms and referenced by faithfulness constraints. For instance, transitions may be represented with conventional SPE-style binary features like [±labial] or [±nasal] instead of the auditory features in Figure 1. Using these conventional features raises no critical issue in addressing the syncope problem.

Consider in detail how such binary features can be employed with the current proposal. For this purpose, it is assumed that release and closure phases are inserted during consonant adjacency, inheriting relevant binary features from it. Specifically, in some existing subsegmental theories, the proposal could be implemented with the assumption that featural inheritance is controlled by surface correspondence relationships, as in Agreement by Correspondence (ABC) theories (e.g., Walker Reference Walker2000; Rose & Walker Reference Rose and Walker2004). The following schema illustrates a potential subsegmental representation within a framework drawing on ABC+Q-theory, which combines ABC with Q-theory (Shih & Inkelas Reference Shih and Inkelas2019).Footnote 5 Consonantal subsegments and transitions are coindexed, allowing features to be inherited from a consonant by its adjacent subsegmental transitions. This inheritance occurs when two conditions are met: first, a constraint enforcing surface agreement between these subsegments is ranked high; and second, a change in transition is preferred over a change in the consonantal subsegment itself.Footnote 6

With these binary features and surface correspondence relationships, the analytic consequences are the same as in (12), as illustrated in the tableau in (14):

Alternatively, auditory dimensions like F1 or F2 can be represented using simple discrete values (Flemming Reference Flemming2002). For instance, the F2 feature in a transition adjacent to [t] can be specified with a higher value (e.g., 4), while a transition adjacent to [p] can be assigned a lower value (e.g., 1), as in (15). These values reflect the F2 loci of the relevant consonants. This strategy remains compatible with the current proposal, provided that changes in these discrete values are penalised by the relevant P-map constraints.

Transition features may not have specific values but rather exist as unary phonetic cues. For example, coronal transitions can be manifested by [high F2 frequency], while labial transitions are represented by [low F2 frequency], without the need for specific feature values. Within the Bidirectional Phonetics and Phonology (BiPhon) framework (Boersma Reference Boersma2007, Reference Boersma, Benz and Mattausch2011, Reference Boersma, Noske and Botma2012; Boersma & Hamann Reference Boersma and Hamann2008, Reference Boersma, Hamann, Boersma and Hamann2009; Boersma & Leussen Reference Boersma and Leussen2017), cue constraints that refer to such transitional cues can theoretically capture the C2 dominance effect in the syncope context. This is illustrated in the tableau in (16), where [t] represents [high F2 frequency], and [p] denotes [low F2 frequency].

In (16), the BiPhon model is implemented within parallel OT. For the underlying form |atipu|, potential pairs of a surface form (in slashes) and phonetic form (in square brackets) are given as candidates. The faithfulness constraints presented here are of two types: those evaluating the mapping from an underlying representation to a surface form (e.g., Max-C) and cue constraints regulating the mapping from a surface form to a phonetic form (e.g., */pV/[_tV] and */Vt/[Vp_]). Similarly, markedness constraints operate on different representational levels: some evaluate surface forms (e.g., Syncope), while others evaluate phonetic forms (e.g., *[Vt_pV]). With these constraints, the grammar correctly selects candidate (16d) as optimal. Details are as follows. For |atipu|, its surface representation should be /atpu/ due to the high-ranked Syncope constraint and Max-C. This surface form is then phonetically realised as [ap_pu] under the pressure of the constraint that demands the same frequency feature on both sides of the acoustic silence (i.e., *[Vt_pV]). The alternative phonetic form [at_tu] in (16c) is ruled out due to the P-map ranking between the cue constraints, in which */pV/[_tV], which protects the release transition, outranks */Vt/[Vp_], which preserves the closure transition. Once again, the crucial role is played by the distinction between the release transition and the closure transition, as determined by the constraint ranking reflecting the P-map hierarchy.

Although there are various potential ways to specify transitional features, including those previously mentioned, I assume that considerable phonetic detail is represented in relevant forms for the complete formulation of the P-map constraints. Given that subsegmental transitions are assumed to represent real acoustic events at consonant–vowel boundaries rather than being abstract units, it is natural to suppose that features encoded by such subsegmental units reflect detailed acoustic events. Therefore, I assume that transitions encode auditory features like F1 or F2, rather than abstract features.

In addition, it is assumed that these auditory features are context-dependent and continuous. This is based on the phonetic observation that there are no clear invariant acoustic cues for some features, a phenomenon known as the lack of invariance in the speech signal (see Delattre et al. Reference Delattre, Liberman and Cooper1955). For example, a consonant’s place of articulation can be detected by F2 transition properties, but there are no absolute values serving as invariant landmarks for each place feature, since relevant F2 frequencies vary as a function of the vocalic context. In fact, not only the F2 transition but also many other acoustic cues are context-dependent, while invariant acoustic cues are rare (Reetz & Jongman Reference Reetz and Jongman2020: 227; cf. Blumstein & Stevens Reference Blumstein and Stevens1980). Note that this assumption has additional advantages: grammatically controlled language-specific phonetic events, such as coarticulation, can be consistently analysed within the same framework as their phonological counterparts, such as place assimilation or cluster simplification (e.g., Flemming Reference Flemming2001, Reference Flemming, Embarki and Dodane2011). While this article focusses on phonological processes with contrastive outcomes, the boundary between such processes and their corresponding phonetic events is often unclear. Given this, I maintain the position that phonological processes, such as cluster simplification, can be derived through the computation of continuous auditory features, leaving open the possibility of a unified model of phonetics and phonology.Footnote 7 However, as mentioned earlier, the assumptions about the nature of these features are not central to the proposal. Readers should have no problem understanding the analyses from §3 onward, even if they assume one of the alternatives mentioned above instead of the specific suggestion that the features are continuous and context-dependent.

On the assumption that features are considerably detailed, I propose that the correspondence constraints for the transitions, previously introduced in (7) and (8), can be more precisely defined as follows:

These constraints evaluate the perceptual differences in the release and closure transitions between representations i and j. Assuming that the transitions contain only features representing the first three formant frequencies (F1, F2 and F3) at the onset or offset of the trajectory, these specifications can be quantified as in (19).Footnote 8 The d-value denotes the perceptual distance between two transitions.

This models the perceptual distances of Bark-scaled formant frequencies (Traunmüller Reference Traunmüller1990) between i and j. To illustrate, consider English preconsonantal t/d deletion, such as /bæd bɔɪ/ → [bæbbbɔɪ] (Dilley & Pitt Reference Dilley and Pitt2007). In English, the onset frequencies of the first three formants in Barks for [bɔ] are 3.80 (F1), 9.26 (F2) and 14.17 (F3), while the values for [dɔ] are 3.53 (F1), 11.66 (F2) and 14.63 (F3) (Kewley-Port Reference Kewley-Port1982). Using these measurements and the metric in (19), the d-value between [bɔ] and [dɔ] at the onset of [ɔ] can be calculated as 1.42, assuming $a = b = c = 0.\bar {3}$ (summing to 1.00). Thus, the change from [bɔ] to [dɔ] violates Ident [release transition] > 0.5, which penalises onset changes with d-values exceeding 0.5, but not Ident [release transition] > 1.5, as the d-value is less than 1.5. Similarly, the d-value for the perceptual difference between [æb] and [æd] can be obtained as 0.94, given the formant values in Barks: 4.32 (F1), 11.21 (F2) and 13.51 (F3) for [æb] and 3.73 (F1), 12.25 (F2) and 14.62 (F3) for [æd]. In principle, the English cluster simplification case can be analysed based on these d-values, as in the tableau in (20).

In (20), the faithfulness constraints are ranked hierarchically according to the P-map principle. In other words, this stratification means that more perceptually distinct changes incur more severe faithfulness violations. In addition to this basic principle that larger changes in the transition correspond to a larger d-value, considering that onset changes are perceptually weighted more than offset changes, the P-map principle implies the following hierarchical relationship among the Ident [transition] > $d_{ij}$ constraints:

The condition in (21) states that changes in the release transition are perceptually more significant than those in the closure transition for the same d-value, allowing the possibility that a minor change in the release transition can be overridden by a more significant change in the closure transition.

This formalisation directly implements the P-map by being sensitive to actual acoustic-perceptual differences. The core proposal of the P-map is that it guides speakers to identify the smallest perceptual change from the input, minimising overall deformation (Steriade Reference Steriade, Hanson and Inkelas2009). This idea can, in theory, be grammatically encoded using correspondence constraints in various formalisations. However, the extent to which these different forms of faithfulness constraints reflect the P-map can vary. For instance, a P-map ranking of contextual faithfulness constraints such as Ident (place) /__V ≫ Ident (place) /__C suggests that the perceptual difference for the same place contrast is invariably greater in the prevocalic context than in the preconsonantal context. However, in fact, small differences in CV transitions can be overshadowed by significant differences in VC transitions. As Steriade (Reference Steriade, Hume and Johnson2001) points out, the t/ʈ apical contrast relies primarily on formant differences in the closure transition, particularly in F3, while the release transition’s contribution is negligible because the tongue tip slides towards the alveolar position during the retroflex closure. Consequently, in a Swedish word like /væɖslig/ ‘worldly’, realised as [væɖʂɭig] (Eliasson Reference Eliasson and Andersen1986: 280; Hamann Reference Hamann2003), apical assimilation shows a progressive direction, indicating a preference for a minimal change in the release transition over significant changes in the closure transition.Footnote 9 In contrast, the Ident [transition] constraints, which reflect the exact acoustic difference between forms, more effectively explain such progressive apical assimilation. One hypothesis consistent with (21) is that changes in the closure transition take precedence over changes in the release transition when the d-value associated with the closure transition is significantly larger. For example, while Ident [closure transition] > 0.5 should be outranked by Ident [release transition] > 0.5 according to the condition in (21), Ident [closure transition] > 2.5 does not necessarily need to be outranked by Ident [release transition] > 0.5. A ranking in which Ident [closure transition] > 2.5 dominates Ident [release transition] > 0.5 could lead to progressive assimilation. Therefore, this formulation, which directly refers to the d-value, more realistically reflects the P-map.

However, despite these considerations, it is true that (21) is too intricate for analytical convenience, requiring exact d-values for each change and the threshold where strict onset dominance can be subverted. I henceforth assume that the perceptual effects of potential onset changes from cluster simplification generally exceed those of offset changes, as shown in (22).Footnote 10

This allows further simplifications of the relevant constraints, abstracting away from detailed measurements. Consider again the English cluster simplification case analysed in (20). Given the condition in (22), the tableau in (20) can be reduced to the simpler grammar in (23):

This is similar to (11), except that more features, that is, F1 and F3, are considered here according to the metric in (19). By assuming the ranking condition in (22), it is possible to reduce the effort involved in measuring d-values and thresholds while maintaining formal rigour.

Considering the above, for the remainder of this article, I will adopt two simplifications for analytical convenience. First, given that the relevant onset changes are, in general, perceptually more significant than the offset changes in the cluster simplification contexts, the d-values are not fully specified. Second, all relevant feature specifications are calculated by a metric, so features are not manifested on constraints unless necessary. These strategies allow capturing C2 dominance with a simple ranking of Ident [release transition] over Ident [closure transition], abstracting away from detailed measurements.

2.3. Framework: the Realised Input model

I adopt the Realised Input model of phonological grammar (Flemming Reference Flemming2008b) as a framework for the current proposal. This decision is not based on the superiority of this model over alternatives in treating the syncope problem with subsegmental features. Rather, it is due to the fact that the Realised Input model is specifically designed to allow P-map correspondence constraints to reference phonetic details, thereby connecting the P-map principle with the roles of phonetic details in phonology. Since the current proposal argues for the role of subsegmental details in conjunction with the P-map principle, it would be a good choice to rely on a framework designed to address faithfulness to phonetic details, without further assumptions. Additionally, adopting the Realised Input model provides an ancillary benefit in the context of cluster simplification, because cross-linguistic patterns of cluster reduction processes, including cluster simplification, are determined by phonetic generalisations beyond simple phonological considerations: cluster reduction is not reported in languages where C1 has a phonetically audible release (e.g., Jun Reference Jun2002; Flemming Reference Flemming2008b).Footnote 11 If cluster reduction is at least partially determined by such phonetic factors, the grammar needs to refer to those facts. Thus, introducing subsegments alone may be insufficient for modelling cluster reduction, as it only provides a solution to the syncope problem, which is just one of several issues related to cluster reduction processes. Although this article focusses on solving the syncope problem, it is ideal if the solution to the syncope problem is carried out within a framework that is claimed to provide a solution to other issues concerning cluster reduction processes.Footnote 12

Figure 2 illustrates the basic architecture of a simplified version of the Realised Input model. For the current purposes, the model’s details are simplified by assuming there are only two grammatical components: Phonetic Realisation and Phonotactics. For the full version of the model and a detailed discussion, see Flemming (Reference Flemming2008b) or Kim (Reference Kim2024c: ch. 2).

Figure 2 Simplified version of the Realised Input model of phonology.

In this model, raw inputs with abstract sound strings are first sent to the Phonetic Realisation component, which assigns phonetic substance to a raw input and returns a realised input. This component is called Phonetic Realisation in that it determines language-specific phonetic patterns, resulting in phonetically enriched representations. The Phonotactics component receives the realised input and returns the surface output. This component essentially determines a language’s surface phonological patterns through the interplay between markedness constraints that encode phonotactic requirements and faithfulness constraints that refer to phonetic details. This component is referred to as Phonotactics to differentiate it from Phonetic Realisation, as it regulates surface phonotactic patterns. Although these two components share constraints and their rankings, they differ in that Phonetic Realisation activates only a subset of those constraints, while Phonotactics activates all constraints. Following Kim (Reference Kim2024c), it is assumed here that Phonetic Realisation activates, in particular, two types of constraints: phonetic articulatory constraints that determine gestural coordination or segmental duration, and cue realisation constraints that demand the realisation of target phonetic cues.

To illustrate how this architecture works and how phonology can refer to language-specific phonetic details, consider a hypothetical example where a coronal stop is deleted before another stop (as in /atpa/ → [apppa]). As already mentioned, empirical evidence suggests that cluster reduction processes, including simplification, largely depend on the presence or absence of an audible release of C1. For a language to allow cluster reduction, it should not have a release of C1, possibly due to the fact that such a release strongly signals the presence of C1 (e.g., Jun Reference Jun2002; Flemming Reference Flemming2008b). The tableau in (24) illustrates how such phonetic details are provided to /atpa/ by the Phonetic Realisation component:

When the two consonants are not overlapped, subsegmental transitions could be realised by connecting them to the adjacent consonants even without a host vowel (see fn. 1). However, in the given grammar, such an open transition is not allowed because the Overlap constraint, which requires adjacent consonants in the input to be articulatorily overlapped, outranks the cue realisation constraint for transitions, ruling out candidate (24a) (cf. Gafos Reference Gafos2002). Although the cue realisation constraint requires the realisation of release and closure transitions with target feature values, such as F2 target values, for each consonant in the input, the enforced gestural overlap between two consonants prevents /t/ from realising its release transition and /p/ from realising its closure transition.Footnote 13 Consequently, the grammar selects (24b) as optimal, where the consonants are articulatorily overlapped without the realisation of the transitions between them.

Then, constraints in Phonotactics, including the P-map correspondence constraints and *CC, can access the relevant phonetic detail: whether C1 is released or not. This component is where our main interest lies: all processes that have contrastive and phonological consequences, such as neutralisation, deletion, or insertion, take place as a result of interactions between the relevant markedness constraints and the faithfulness constraints in this component. The tableau in (25) shows how cluster simplification can occur in Phonotactics when C1 is not released. Suppose that Ident [release transition] is universally undominated, in that there is no convincing evidence demonstrating that cluster reduction processes effectively remove buccal C2 or alter its major place of articulation.Footnote 14 This implies that other constraints, including *CC, are generally dominated by Ident [release transition], or at best, tied with it as in (25). Suppose that *CC is sensitive only to clusters without C1 release, reflecting the observation that cluster reduction processes do not affect clusters where C1 is released. Given these assumptions and the fact that Overlap outranks Realise [transition], when *CC triggers cluster reduction by dominating Ident [closure transition], the outcome must be (25d), which removes C1.

In contrast, the Phonotactics component does not yield cluster simplification when the given realised input form has an open transition between the two consonants. Given that a realised input form with a released C1 suggests that the Realise [transition] constraint outranks the Overlap constraint in Phonetic Realisation (cf. (24), where Overlap outranks the cue realisation constraints), Realise [transition] is also ranked over Overlap in Phonotactics. Under this ranking condition, although the relative ranking of other constraints, including *CC, remains unchanged, the result is now different: cluster simplification is blocked by Realise [transition]. Since Realise [transition] requires the realisation of the release and closure transitions of the consonants in the input forms, the grammar opts to maintain the open transition of the realised input form, ruling out candidates that do not meet this requirement.

Although a serial implementation of the grammatical architecture has been shown thus far for illustrative purposes, it is, in principle, possible to perform a parallel implementation without separating the two components of Phonetic Realisation and Phonotactics, as these two components share the same constraint ranking (see also Kim Reference Kim2024a,Reference Kimb,Reference Kimc). The tableau in (27) shows an example of such a parallel implementation.

Constraints in the tableau are categorised into two groups: phonetic realisation constraints (marked with a superscript pr) and general phonological markedness and correspondence constraints. Phonetic realisation constraints, such as Overlap and Realise [transition], were the only activated constraints in the Phonetic Realisation component in the serial implementation (see (24)). It is worth noting that the correspondence constraints here evaluate the relationship between the designated candidate (here (27a), marked with D) and other candidates rather than directly referring to the input (cf. McCarthy Reference McCarthy1999; Jun Reference Jun2002). The designated candidate is chosen only by phonetic realisation constraints from candidates that have not undergone contrastive changes like neutralisation, deletion or insertion (Flemming Reference Flemming2008b; Kim Reference Kim2024a,Reference Kimb,Reference Kimc). In this tableau, (27a) and (27b) are such candidates. Between these two, which differ only in the phonetic release of C1, (27a) is selected as the designated candidate because it satisfies the higher-ranked Overlap constraint among the phonetic realisation constraints. The optimal candidate is then determined by evaluating all constraint violations, with (27d) preserving the release transition specified in (27a).

Within this Realised Input model, whether serial or parallel, the introduced constraint set generates a stable prediction about potential typological patterns involving syncope. When the top-ranked Syncope constraint enforces interconsonantal vowel deletion, only three patterns can emerge from the given constraints and their P-map ranking: [atttpppa] (syncope with released C1), [att˺ppa] (syncope with unreleased C1), and [apppa] (syncope with C1 deletion). The grammar does not predict the potential output with C2 deletion, *[attta], due to the ranking Ident [release transition] ≫ Ident [closure transition]. For instance, when Ident [release transition] is ranked highest with Syncope, and Ident [closure transition] is dominated by the constraints in the top and second tiers, it is possible to posit the factorial typology in (28) based on the relative ranking of the remaining constraints (Max-C is assumed to be ranked lowest, ensuring cluster simplification as a response to the *CC constraint).Footnote 15 The model is implemented in a parallel way, as in (27).

From the typology presented above, it is evident that cluster simplification is possible only when both Overlap and *CC dominate Realise [transition]. The ranking of Overlap over Realise [transition] means that a consonant cluster could be realised without C1 release if there were no cluster simplification, aligning with the observation that cluster reduction processes only affect clusters without C1 release. In other words, the typology predicts that cluster simplification, even in the syncope context, can only take place in a grammar where it could result in a cluster without C1 release if there were no cluster simplification (i.e., only in the grammar where OverlapRealise [transition]). In contrast, a grammar that would result in a cluster with C1 release never undergoes cluster simplification, even with a higher ranked *CC, as in (28e). When Realise [transition] outranks Overlap, the expected outcome is a cluster with C1 release, even after the application of syncope.

Given that our primary focus is on the interaction between cluster simplification and syncope, subsequent sections will operate under the assumption that in the full grammar *CC is undominated and Overlap outranks Realise [transition], invariably resulting in cluster simplification, as in (28f). Under this assumption, I will not present constraints that are not central to the discussion, such as Overlap and Realise [transition], and will instead focus on the P-map ranking of the Ident [transition] constraints and their interaction with phonotactic markedness constraints.

In summary, the Realised Input model provides a formal apparatus for P-map constraints that require reference to substantial phonetic details. Regardless of whether the model is implemented serially or in parallel, it allows phonological patterns to be determined based on phonetic details such as the realisation of C1 release or formant transitions. In this respect, the Realised Input model is beneficial in explaining the syncope problem within a broader context involving phonetic details such as C1 release. It can be used not only to solve the syncope problem but also to provide a grammatical account of the phonetic generalisation underlying cluster reduction processes.

3. Analysis

This section illustrates how the current proposal can successfully be applied to cluster simplification processes in various contexts. In §3.1, I begin with analyses of typical medial cluster simplification cases from Basque and Tunica, with the latter showing an interaction between syncope and cluster simplification. Then, §3.2 demonstrates that the proposal consistently accounts for triconsonantal cluster simplification, using Hungarian and Tangale as primary examples. Finally, §3.3 analyses final cluster simplification in Lardil, which is fed by apocope.

3.1. Medial cluster simplification

As a motivation for cluster simplification, I reformulate the *CC constraint to penalise only sequences of supralaryngeal heterorganic consonants, excluding a cluster with a glottal consonant.

Languages may remove one consonant in the target cluster to satisfy this constraint unless a potential violation of Max-C is fatal. However, this markedness constraint can also be satisfied by a geminate or a homorganic cluster, which preserves the total duration of the consonants without a violation of Max-C. Therefore, it is necessary to consider factors that favour deletion over forming a geminate or homorganic cluster. The constraint in (30) penalises lengthened gestures formed by geminates and homorganic clusters:

Ranking this constraint over Max-C enables cluster simplification instead of place assimilation. For example, consider the Basque cluster simplification in (2a), repeated in (31):

In this language, neither heterorganic clusters nor geminates are preferred over consonant deletion, suggesting that both *CC and *LongC outrank Max-C.Footnote 16 The tableau in (32) illustrates this:

As presented in the tableau, consonant deletion results in changes in the relevant transitions.Footnote 17 Deleting [t] in the realised input can change the closure transition, as in candidate (32e), while removing [k] can alter the following release transition as in (32d). The perceptual significance of these changes determines which consonant is targeted for deletion. The ranking of Ident [release transition] over Ident [closure transition] reflects the perceptual difference between release and closure transition, favouring (32e) over (32d).

Note that potential geminate formation can also result in changes in transitions. This suggests that the directional asymmetry in place assimilation can also be accounted for by the ranking of the same Ident constraints. This possibility will be discussed in detail in §4. In the present case, the C2 dominance effect is accompanied by cluster simplification. Candidate (32c), which preserves consonant duration, is ruled out by *LongC, although both (32c) and (32e) satisfy Ident [release transition].

An example of cluster simplification fed by syncope can be found in Tunica. In this language, word-final high vowels are syncopated when they are followed by another word with initial-syllable stress, creating a potential VʰkCV cluster, which is illicit in the language. Thus, to avoid this illegal cluster, a further application of the simplification process is enforced, removing the velar stop. If the second consonant is another stop, only [k] is deleted, leaving its pre-aspiration as in (33a). If the second consonant is a sonorant, then both [k] and the pre-aspiration are removed as in (33b).Footnote 18

The key to capturing the C2 dominance effect in the syncope context is identifying the relevant correspondence relationships. Since syncope removes the interconsonantal high vowel, its associated transitions are also absent from the surface. The Ident [transition] constraints only evaluate changes in transitions with correspondences between the relevant representations, meaning that the removed transitions do not affect the evaluations given by these correspondence constraints. In other words, the Ident [transition] constraints do not penalise the loss of the transitions in the syncopated vowels, although they may be penalised by other constraints such as Max [transition] or Realise [transition].

In (34), the vowel deletion process is controlled by the top-ranked Syncope constraint, which eliminates candidate (34a), and *CC disallows an intervocalic heterorganic cluster as in (34b). The remaining grammar determines which consonants are affected and how the cluster is repaired. The ranking of Ident [release transition] over Ident [closure transition] identifies C1 as the target based on changes in transitions present in both the realised input and the provided surface candidates. The ranking of *LongC over Max-C favours cluster simplification over place assimilation as a repair strategy.

3.2. Simplification of triconsonantal clusters

The P-map principle predicts that, in VC1C2C3V clusters, the second consonant is the target for deletion because it is less distinct from $\varnothing $ than the first and the third consonants are. Experimental literature supports this prediction (Haggard Reference Haggard1973; Klatt Reference Klatt1973), as discussed by Steriade (Reference Steriade, Hanson and Inkelas2009: 169). When VC1C2C3V clusters are shortened, they are confused with VC1ːC3V and, in turn, with further simplified VC1C3V. This prediction is derivable from the current implementation of the P-map theory. To illustrate this more clearly, consider optional interconsonantal stop deletion in Hungarian, in which a stop surrounded by two non-approximant consonants may undergo optional deletion, reducing triconsonantal clusters to biconsonantal clusters (Côté Reference Côté2000; Siptár & Törkenczy Reference Siptár and Törkenczy2000).

To analyse this triconsonantal cluster simplification, following Steriade (Reference Steriade, Hanson and Inkelas2009: 170), I assume that the process is motivated by the size-of-cluster constraint C//V, defined in (36):

With this markedness constraint, the default ranking of Ident [transition] constraints predicts C2 to be the target consonant, since its deletion does not yield a change in transitional properties. In other words, C2 deletion in the triconsonantal cluster does not violate any Ident [transition] constraints because this consonant lacks transitions into or out of it.Footnote 19

Here, the Ident [transition] constraints are specified for F2 and nasality, meaning that the perceptual difference between transitions is evaluated mainly based on factors such as nasality and second formant frequencies. The emphasis on nasality is due to the homorganic nature of the first two consonants in the cluster, which have similar formant values, with the main difference being their potential to nasalise adjacent transitions. As can be seen in the tableau, the optimal output is (37d), because deleting C2 does not result in any changes to the F2 or nasality of the transitions.

The current proposal also captures the interaction between syncope and simplification of triconsonantal clusters. Consider cluster simplification in Tangale, a West Chadic language spoken in Nigeria. In Tangale, a stem-final vowel is removed when it is followed by an inflectional suffix or an NP complement. As illustrated in (38), when this syncope process results in a triconsonantal cluster whose first two members are homorganic, cluster simplification occurs as a repair strategy.

The analysis of this process is illustrated in the tableau in (39).

In the analysis, the top-ranked Syncope constraint removes the stem-final interconsonantal vowel, eliminating candidate (39a) and the C//V constraint disallows the resulting triconsonantal cluster in (39b). The consonant to be deleted is determined by the hierarchy of the Ident [transition] constraints. C3 cannot be removed, as in (39c), as it causes a change in F2 values, violating Ident [release transition: F2 & nasal]. Similarly, removing C1 in (39d) yields a change in the nasality of the closure transition, violating Ident [closure transition: F2 & nasal]. Since removing the second consonant does not violate any Ident [transition] constraints, candidate (39e) is selected as the optimal output.

Note that the contextual faithfulness analysis fails to predict the Tangale pattern. Although a ranking of the relevant P-map constraints for segmental contexts – such as Max-C/V__VMax-C/C__VMax-C/V__CMax-C/C__C – can generally capture the simplification process targeting triconsonantal clusters, these constraints lose their analytical power when the target cluster is fed by syncope, as in Tangale. The tableaux in (40) show that both input- and output-oriented contextual faithfulness constraints fail to predict the correct optimal output.

3.3. Apocope and final cluster simplification

A final cluster can also be targeted by simplification processes, as seen in Lardil. In this language, a non-apical second consonant of a final cluster cannot surface due to restrictions on complex final clusters. This final cluster simplification can be fed by apocope, which targets nominals with more than two morae. The examples in (41) demonstrate that final vowels and C2s in clusters, which are present in future and non-future forms, are absent in uninflected forms. In these uninflected forms, the vowels and C2s are not followed by a suffix and are therefore subject to apocope and final cluster simplification.

Because final cluster simplification in Lardil involves apocope, conventional contextual faithfulness constraints face again the same problem as in syncope contexts. If the relevant faithfulness constraints are input-oriented, the grammar predicts that C1 will be the target consonant rather than C2, since C2 can be protected by higher-ranked Max-C/__V input as the final vowel is present in the input. This is shown in (42):

The output-oriented alternative also cannot rescue the contextual faithfulness constraints. The tableau in (43) shows that output-oriented faithfulness constraints cannot identify the target consonant, since whichever of C1 and C2 is retained in the surface form will be postvocalic and word-final, as in candidates (43c) and (43d).

In contrast to the contextual faithfulness account, the interaction with apocope is not problematic in the current approach, as illustrated in (44):

As shown in (44), since the release transition hosted by the final vowel in the realised input form is removed by apocope, there is no surface correspondent of the release transition out of the derived final consonant. Therefore, the removal of the final consonant in candidate (44c) does not critically violate any Ident [transition] constraints. In contrast, a potential deletion of the first consonant of the cluster, as in candidate (44d), is correctly ruled out because it fatally violates Ident [closure transition].

It is worth noting that what happens in Lardil is, in fact, more complicated than what has been analysed above. In Lardil, labial and velar consonants are deleted in final position even when they are not part of final clusters (e.g., /wuŋkunuŋ/ → [wuŋkunu] ‘queenfish’; Staroverov Reference Staroverov, Assmann, Bank, Georgi, Klein, Weisser and Zimmermann2014: 431).Footnote 20 This invokes a complex opacity problem called ‘fed counterfeeding’ by Kavitskaya & Staroverov (Reference Kavitskaya and Staroverov2010), in which apocope and final deletion have both feeding and counterfeeding relationships. As a result of this opaque interaction, apocope underapplies when it and final consonant deletion have already applied, as in (45).

The mapping from /dibiɾdibi/ to [dibiɾdi] is opaque because both the input and output forms end in a vowel, demonstrating the underapplication of the apocope process. In other words, although the final vowel in [dibiɾdi] meets the environmental condition for apocope, it remains unaffected, resulting in opacity. This is because apocope has already occurred before this final vowel emerges in the derivation. While such an opaque interaction is challenging to analyse within classic OT, the current approach, in which correspondence constraints can reference phonetically detailed realised inputs, handles it effectively. In recent studies (Kim Reference Kim2024a,Reference Kimb,Reference Kimc), I have demonstrated that faithfulness to phonetically enriched representations provides a comprehensive solution to opacity. For instance, in the Lardil case, it can be assumed that phonetically longer vowels are more strongly protected by the relevant faithfulness conditions than phonetically shortened vowels, which may instead undergo complete deletion. Indeed, cross-linguistically, vowels in final positions frequently have shortened durations due to factors such as devoicing (e.g., Myers & Hansen Reference Myers and Hansen2007). If the Phonetic Realisation component derives this vowel shortening, the resulting shortened vowels can be clearly distinguished from regular vowels that fully maintain their duration in realised input forms. The tableau in (46) illustrates how these shortened vowels are derived within the Phonetic Realisation component:

Suppose that the shortened vowel has a duration between zero and that of the full vowel [i] (e.g., [i] = 80ms and [i̯] = 40ms). This phonetically shortened final vowel is subsequently eliminated on the surface by the *V̯ constraint, which penalises overly shortened vowels. Ident [V: duration] > 30ms, which penalises changes in length of the degree of the difference between [i̯] and $\emptyset $ or [i̯] and [i] (40ms), is ranked lower than Ident [V: duration] > 60ms, which penalises changes of a greater degree, such as a change from [i] to $\emptyset $ (80ms), according to the P-map hierarchy. Consequently, in Phonotactics, short vowels (i.e., underlyingly word-final vowels) are deleted, whereas vowels of normal length (including vowels whose word-final position is derived) are preserved. The tableau in (47) illustrates how this surface opacity is derived. Note that a transparent output like candidate (47e) could be optimal if Final V Shortening were ranked above all Ident [V: duration] constraints.

4. Extension

This section extends the current proposal to major place assimilation and debuccalisation. Like cluster simplification, these processes usually target medial clusters and display the C2 dominance effect (Jun Reference Jun2002, Reference Jun, van Oostendorp, Ewen, Hume and Rice2011; McCarthy Reference McCarthy2008, Reference McCarthy2011). It is argued here that place assimilation and debuccalisation result in changes in transitions and thus can also be regulated by Ident [transition] constraints. This allows us to analyse the patterns of these processes in the same way as cluster simplification, regardless of whether syncope is involved or not.

4.1. Place assimilation

Cross-linguistically, place assimilation in medial clusters is typically regressive, targeting the first member of the cluster rather than the second (Webb Reference Webb1982; Mohanan Reference Mohanan and Goldsmith1993; Jun Reference Jun1995, Reference Jun, Hayes, Kirchner and Steriade2004). This directional asymmetry in place assimilation is illustrated in (48) with Korean examples, where non-velar nasals and stops assimilate in place to the following stops.

Unlike cluster simplification, the emergence of place assimilation as a repair strategy for *CC can be attributed to the ranking of Max-C over *LongC. To satisfy the higher-ranked Max-C constraint, this language prefers forming a geminate consonant or homorganic sequence over deleting a consonant, as shown in (49):

In (49), place assimilation is triggered by the high-ranked *CC constraint, which rules out the faithful candidate (49a), similarly to typical cluster simplification cases. The C2 dominance effect is also captured by the same Ident constraints, and their ranking, that were implemented for the analyses of cluster simplification (cf. Flemming Reference Flemming2008a, who employs similar correspondence constraints to explain the manner asymmetry in place assimilation). C1 is the target of place assimilation, since changing its place incurs only a non-fatal violation of Ident [closure transition], as in the winning candidate (49c), while changing the place of C2, as in (49b), is penalised by the higher-ranked Ident [release transition] constraint.

Place assimilation fed by syncope can also be captured by the same analysis. Keley-i (Austronesian, spoken in the Philippines) presents such a case. In this language, when an infix with a nasal attaches to referent-focus past roots with CeCeC or CeCaC forms, the first root vowel is removed, and the infix-final nasal assimilates in place to the following consonant.

The same set of Ident constraints accounts for the C2 dominance effect in both cluster simplification and place assimilation, so the regressive direction of place assimilation conditioned by syncope should also be explainable by these constraints. This is depicted in the tableau in (51). The ranking of the Ident [transition] constraints determines the direction of the process, and Max-C outranking *LongC prevents deletion from being a potential repair for heterorganic clusters.

4.2. Debuccalisation

In some languages, cluster reduction is achieved by debuccalising the first member of the cluster, showing the C2 dominance effect (McCarthy Reference McCarthy2008). For instance, when a medial cluster undergoes debuccalisation, it always results in the form [VʔCV], not [VCʔV], as observed in languages such as Arbore (Hayward Reference Hayward1984), Cariban languages (Gildea Reference Gildea1995), Malay, Thai (Lodge Reference Lodge1992) and Toba Batak (Hayes Reference Hayes1986). This subsection demonstrates that debuccalisation affects adjacent transitions and can be regulated by the Ident [transition] constraints, resulting in the C2 dominance effect. This effect is consistent regardless of the presence of syncope.

Before performing a specific analysis, let us consider the acoustic characteristics that make glottal consonants distinct from buccal ones. As is well-known, glottal stops are usually not realised with complete closure in most languages (Ladefoged & Maddieson Reference Ladefoged and Maddieson1996: 74–76). These articulatory properties of glottal stops also influence their acoustic realisation. Although they inherently impose characteristics such as creaky voice or stiff phonation, in intervocalic positions they are likely to display formant structures mediating between surrounding vowels. Similarly, in consonant clusters like [VʔCV], glottal stops exhibit evident formant transitions reflecting the characteristics of the adjacent supralaryngeal consonant (cf. Foulkes & Docherty Reference Foulkes, Docherty, Ohala, Hasegawa, Ohala, Granville and Baily1999). Glottal fricatives also show similar transitional characteristics. The vocal tract shape of the glottal fricative is usually similar to its neighbouring vowels, and its transitional characteristics reflect its surrounding environment (Keating Reference Keating1988). Although [h] is voiceless, major formant landmarks are apparent during its production, suggesting that glottal fricatives in clusters can also mediate transitions between vowels and adjacent buccal consonants. These acoustic facts demonstrate that debuccalising a consonant can affect the properties of the adjacent transition by projecting the adjacent consonant’s characteristics to a vowel. For instance, when [attppa] undergoes C1 debuccalisation, the properties of C2 [p] can be projected across the resulting glottal consonant, as in [apʔppa] or [aphppa].

These acoustic properties of glottal consonants suggest that debuccalisation can also be regulated by Ident [transition] constraints. To illustrate, consider the debuccalisation process in Toba Batak, exemplified in (52). In this case, a word-final stop undergoes debuccalisation when it is followed by another consonant.

To analyse this pattern, I introduce a constraint that penalises voiceless transitions. This constraint reflects the need to ensure that the transitions are as amplified as possible, keeping the higher perceptual resolution. To put it more simply, this constraint prohibits consonantal properties from being projected onto transitions in a weakened manner as they pass through the glottal consonant.

With this constraint, debuccalisation in Toba Batak can be analysed as in (54). In this tableau, the ranking of *LongC over *VlsTrans determines that reduction is achieved through debuccalisation rather than place assimilation. Then, the ranking of Ident [release transition] over Ident [closure transition] captures the C2 dominance effect.

Syncope does not pose a problem for the analysis of debuccalisation. Consider the Panare examples in (55), where syncope feeds debuccalisation. In this language, when the verb -utu- ‘give’ is followed by a CV-initial suffix, the stem’s final high vowel is deleted, feeding debuccalisation of the preceding stop.

The tableau in (56) analyses this pattern. First, candidate (56a), which preserves the realised input structure, violates the top-ranked Syncope. Since this syncope removes all correspondence relationships the deleted vowel could host, further deletion of a consonant does not affect the evaluation provided by Ident [release transition] or Ident [closure transition]. Then, C1 is determined as the target by the violations related to the remaining surface vowels. C1 undergoes debuccalisation, not place assimilation, since *LongC outranks *VlsTrans.Footnote 21

5. Exceptions to the C2 dominance effect

While the C2 dominance effect is observed in most languages exhibiting cluster simplification, place assimilation or debuccalisation, it is not invariably absolute across all languages. Two primary exceptions are attested cross-linguistically: cases in which a glottal C2 undergoes deletion or assimilation, and instances where the C2 dominance effect is overridden by morphological asymmetry.

Consider the C2 glottal deletion first. The C2 dominance effect fails to manifest when C2 is glottal, as observed in Afar (Bliese Reference Bliese1981), Arbore (Hayward Reference Hayward1984), Chitimacha (Swadesh Reference Swadesh and Hoijer1946) and Tonkawa (Hoijer Reference Hoijer and Hoijer1946), among others. For example, in Arbore, underlying /h/ following another consonant undergoes total assimilation to the preceding consonant, as in (57).

This assimilatory process targets glottal fricatives, not buccal consonants, possibly to preserve subsegmental transitions. As hypothesised in §4.2, glottal consonants lack inherent transitions adjacent to vowels; such transitions exist only when an influential buccal consonant is nearby. Thus, removing the buccal consonant from a buccal–glottal sequence eliminates the transitions. Constraints protecting subsegments, such as Max [transition], can prevent this by requiring the preservation of realised input transitions. In the Arbore case, Max [transition] compels the preservation of the buccal consonant and the change of the glottal consonant when *Voiceless Transition prohibits buccal–glottal sequences.Footnote 22

This formalisation yields an intriguing prediction: when a cluster containing a glottal consonant undergoes change, the glottal consonant, rather than the buccal consonant, is invariably targeted (see also McCarthy Reference McCarthy2007, who makes the same prediction). This is supported by the fact that most, if not all, non-morphological exceptions to the C2 dominance effect involve the deletion or modification of glottal consonants. Additionally, to the best of my knowledge, there are no known cases where a buccal consonant invariably undergoes reduction before a glottal consonant.

In summary, the current approach differentiates between the reduction of supralaryngeal consonant sequences and the modification of sequences containing glottal consonants. General reduction processes, including cluster simplification, are motivated by *CC to avoid heterorganic buccal consonant sequences, and consistently target C1 by the ranking of the P-map constraints. In contrast, the modification of sequences containing glottals is triggered by constraints like *Voiceless Transition , with Max [transition] ensuring that the glottal consonant is targeted regardless of its position.

The second type of exception to the C2 dominance effect comprises cases in which the positional asymmetry is overturned by a morphological asymmetry (e.g., Jun Reference Jun1995; McCarthy Reference McCarthy2007; Lamont Reference Lamont2015). For instance, when C1 belongs to a root and C2 is at the beginning of a suffix, the affixal C2 may undergo deletion or assimilation instead of the root C1. Such a morphological dominance effect is found in many languages, including Dutch (McCarthy Reference McCarthy2007), Nankina, Bari dialects, Nungon and Ma Manda (see Lamont Reference Lamont2015 and the references therein). An example of the morphological dominance effect is provided in (59) with place assimilation in the Kukú dialect of Bari, where the initial palatal consonant of qualitative suffixes assimilates to the place of the preceding root-final consonant.

This morphological dominance effect can be captured by introducing P-map faithfulness constraints sensitive to morphology: Ident root [release transition] ≫ Ident root [closure transition] ≫ Ident affix [release transition]. This ranking prioritises the root’s closure transition over the affix’s release transition.Footnote 23

In the tableau in (60), the optimal output is (60d), where C2 assimilates its place to C1. This is because the root closure transition is prioritised over the affixal release transition by the ranking of the morphologically indexed P-map constraints. Candidate (60e), which modifies the root transition by assimilating C1 to C2, cannot win because it violates Ident root [closure transition].

6. Alternatives

This section compares the current proposal with two alternative approaches to the syncope problem, namely HS (McCarthy Reference McCarthy2010, Reference McCarthy, McCarthy and Pater2016) and Contiguity constraints (Kenstowicz Reference Kenstowicz1994; McCarthy & Prince Reference McCarthy and Prince1995).Footnote 24 It provides a concise overview of how these alternative approaches treat the syncope problem and examines potential issues that have been or could be raised regarding these alternatives in cluster reduction contexts. It is important to emphasise that this discussion is not intended to invalidate alternative approaches in a general sense. Rather, the aim is to argue that a broader and more coherent explanation for cluster reduction processes can be given by the current approach, particularly in light of the P-map principle, by addressing aspects that may not have been fully or explicitly considered in these alternative approaches.

6.1. Harmonic Serialism

McCarthy (Reference McCarthy2008, Reference McCarthy2009, Reference McCarthy2011) argues that C2 dominance effects in cluster reduction processes and the associated syncope problem can be accounted for within the framework of HS. As outlined in McCarthy (Reference McCarthy2011), P-map correspondence constraints in the form of contextual faithfulness constraints can be implemented in HS to derive the expected results correctly. The tableaux in (61) illustrate how McCarthy’s suggestion addresses the syncope problem.Footnote 25

In the HS grammar illustrated in (61), Syncope must outrank *CC to allow syncope to create a cluster targeted by *CC. At Step 1, the grammar selects [atpa] as the output, which serves as the input for Step 2. The optimal output at Step 2 is a candidate with C1 deletion, that is, [apa], by the ranking condition Max-C/_VMax-C/_C. Crucially, syncope applies at Step 1, and because the Syncope constraint is ranked highest, it can condition the environment for further cluster reduction processes.

Although implementing P-map constraints as contextual faithfulness constraints in HS seems to work for the core patterns, there are situations where HS appears to be incompatible with the P-map principle. One such case arises when dealing with instances of irreducible parallelism, a phenomenon known to be challenging for HS (Flemming Reference Flemming2013; Adler & Zymet Reference Adler and Zymet2021). An illustrative example of this problem is root fusion in Sino-Japanese vocabulary (Itô Reference Itô1986; Itô & Mester Reference Itô, Mester, Otake and Cutler1996; Kurisu Reference Kurisu2000; Adler & Zymet Reference Adler and Zymet2021, among others). In Japanese, a sequence of one- or two-syllable roots fuses, deleting the first root’s final vowel and resulting in a voiceless geminate with place assimilation. However, syncope is blocked if it conditions place assimilation that yields a voiced geminate. For example, /betu-kaku/ ‘different style’ undergoes both syncope and place assimilation, resulting in [bekkaku], while /betu-bin/ ‘separate carrier’ does not undergo syncope, as it would create an illegal voiced geminate. Thus, it is realised as [betsubin] instead of *[bebbin] (examples from Itô & Mester Reference Itô, Mester, Otake and Cutler1996: 15). This type of interaction is recognised as being problematic for HS because of the restriction that only a single operation can be applied at each derivational step.Footnote 26

The issue can be illustrated as follows. As demonstrated in the tableaux in (62), Syncope must be ranked high enough to create the environment for place assimilation between voiceless obstruents.

However, the same grammar fails to capture the case where syncope is blocked to avoid creating a voiced geminate. Since Syncope is ranked high and Gen in HS allows only one derivational change per step, the grammar returns a candidate with vowel deletion at Step 1, failing to block syncope. The constraint that penalises voiced geminates, *VdGem, cannot play any role in the evaluation.

In principle, this issue could be circumvented by allowing /u/ before a voiceless obstruent and that before a voiced obstruent to interact differently with syncope through constraint rankings rather than derivational steps. For instance, an undominated contextual faithfulness constraint like Max-V/__C [+voi] could block the vowel deletion process before a voiced obstruent, resulting in (63a) rather than (63b). However, while such an analysis is technically possible, it remains unclear how such a constraint and its ranking would be motivated under the P-map principle: why, for instance, is Max-V/__C[−voi] outranked by Max-V/__C [+voi], rather than vice versa? As long as the P-map principle is assumed to be the key element for analysing the syncope problem, introducing such a constraint becomes problematic. Given that this constraint refers to perceptually arbitrary contexts, its justification under the P-map principle is difficult.

Contrary to HS, such a situation is not a problem for the current model, where syncope and place assimilation can be applied simultaneously, as shown in (64):

A similar situation where the explanatory powers of HS and the P-map principle are difficult to reconcile can be found in opaque contexts. Recall from §3.3 that in Lardil, /dibiɾdibi/ is realised as [dibiɾdi], with underapplication of apocope. In HS, the apocope constraint must be top-ranked to induce final vowel deletion in the underlying form at the first step, but this hierarchy forces the deletion of [i] in [dibiɾdi] as well. Although introducing a hierarchy of contextual faithfulness constraints (e.g., Max-V/_CApocopeMax-V) may avoid this issue in HS (cf. Hauser & Hughto Reference Hauser and Hughto2020), such an approach encounters a similar problem to the one identified for the irreducible parallelism case: it is unclear why this preconsonantal-specific faithfulness constraint should dominate other constraints. Justifying this ranking proves difficult. In particular, granting legitimate status to P-map faithfulness constraints as an HS solution to the syncope problem, as illustrated in (61) above, directly conflicts with attempts to incorporate contextual faithfulness constraints into HS to resolve the opacity problem. This conflict arises because it is difficult to establish perceptual justification for assigning greater protection to underlyingly preconsonantal vowels than to final vowels.

In summary, the aforementioned issues show that, in certain contexts, the compatibility between the HS model and the P-map principle is questionable. It is worth noting, however, that this discussion does not preclude the possibility of reconciling P-map faithfulness constraints with HS in contexts of irreducible parallelism or opacity through alternative means. The focus of the argument is not to demonstrate an inherent structural flaw in HS, but rather to highlight that the current approach, centred on a set of proposed P-map constraints, offers greater flexibility in analysing cluster reduction processes across diverse environments, without introducing additional assumptions or mechanisms to deal with irreducible parallelism or opacity.

6.2. Contiguity

As discussed throughout this article, subsegments are crucial in analysing the C2 dominance effect in the syncope context. However, one may ask whether the proposal can be more formally implemented without referring directly to transitions, using a P-map version of Contiguity constraints. This is a valid question, and at least for cluster simplification, it is possible to account for the C2 dominance effect by ranking Contiguity (CV), which specifically preserves a CV sequence, above Contiguity (VC), which protects a VC sequence, as shown in (65) (cf. Kenstowicz Reference Kenstowicz1994; McCarthy & Prince Reference McCarthy and Prince1995: 123; Lamontagne Reference Lamontagne1997). Suppose that these constraints assign a violation mark specifically when an input consonant originally adjacent to a vowel surfaces adjacent to a different consonant or vowel, thus losing its original adjacency relationship.

However, this analysis has a critical problem. With this type of Contiguity constraints, cluster simplification is enforced in the syncope context regardless of the relative ranking of *CC. This predicts unconditional cluster simplification whenever syncope occurs because, for an input form like /atipu/, when /i/ is deleted, a surface output like [atpu] will inevitably violate both Contiguity (CV) and Contiguity (VC). First, when the CV sequence /ti/ loses its vowel, [t] becomes adjacent to [p], violating Contiguity (CV). Similarly, the loss of the vowel in the VC sequence /ip/ leads to a violation of Contiguity (VC), as [p] becomes adjacent to [t]. The tableau in (66) illustrates this:

This result is undesirable because not all languages with a syncopated interconsonantal vowel undergo cluster simplification. Using Contiguity constraints sacrifices explanatory power for cases of syncope without simplification, in return for resolving the syncope problem.

To address this issue while retaining the Contiguity constraints, one might consider a scenario where Max-C is also ranked high, thereby excluding candidates with cluster simplification such as (66c) or (66d). However, this approach proves ineffective, as the top-ranked Max-C constraint actually results in place assimilation rather than preserving a cluster without reduction, as demonstrated in (67).

Again, in response to this, one could propose introducing a new faithfulness constraint, such as Ident [place], to make (67b) optimal. However, the introduction of Ident [place] engenders additional complexities. For instance, the Ident [place] constraint fails to capture the C2 dominance effect in place assimilation: it cannot differentiate between candidates (67e) and (67f). Attempts to resolve this issue by refining Ident [place] through stratified contextual faithfulness constraints (e.g., Ident [place]/_V ≫ Ident [place]/_C) would reintroduce the original problem, in which these contextual faithfulness constraints function inadequately in syncope contexts.

7 Conclusion

In this article, it has been argued that the syncope problem in cluster simplification can be resolved by constraints that refer to the transitions into and out of the vowels adjacent to the target cluster. Cluster simplification results in changes in the adjacent transitions; therefore, Ident [transition] constraints that evaluate such changes can also regulate the process and its patterns. Since syncope removes not only the targeted vowel itself but also the transitions contained in that vowel, Ident [transition] constraints cannot enforce faithfulness to those transitions, as they have no output correspondents. This explains why the C2 dominance effect can be consistent even in the syncope context. It has also been shown that the same approach accounts for the syncope problem not only in cluster simplification, but also in major place assimilation and debuccalisation, demonstrating the extensibility of the current proposal.

Acknowledgements

I would like to express my sincere gratitude to the associate editor and three anonymous reviewers for their constructive and insightful comments. Special thanks go to Edward Flemming, Donca Steriade and Jongho Jun for generously sharing their valuable insights on this research. Any remaining errors are, of course, my own responsibility.

Competing interests

The author declares no competing interests.

Footnotes

1 As an anonymous reviewer noted, no current subsegmental theories propose that subsegments can exist independently of segments; thus, discussing this assumption in detail may be unnecessary. However, certain scenarios remain where this assumption becomes relevant. For instance, release transitions could be maintained within an open transition or a significantly reduced vowel rather than a full vowel. In such cases, if subsegments cannot exist independently, the only remaining possibility is that the transition loses its connection to the original host vowel and attaches to an adjacent consonant to survive, thereby ruling out the possibility that subsegmental transitions stand alone without a host segment.

2 In Figure 1, the transitional subsegments belong only to the vowels, not to the consonants.

3 Other auditory qualities in transitions can also be affected, depending on the consonants involved in simplification. For example, when [anpu] is neutralised to [apu], not only the F2 values but also the nasality of the transition is affected (e.g., Hawkins & Stevens Reference Hawkins and Stevens1985; see also Stanton Reference Stanton2017 and the references therein). Similarly, changes in laryngeal status (F1 with F0 and VOT; e.g., Lisker Reference Lisker1975; Abramson & Lisker Reference Abramson, Lisker and Fromkin1985) or changes related to the distinction between approximants and other consonants (continuity and gradualness of the transition; e.g., Liberman et al. Reference Liberman, Delattre, Gerstman and Cooper1956; see also Kawahara & Pangilinan Reference Kawahara, Pangilinan and Kubozono2017) may be observed.

4 The correspondence between input and output subsegments is based on the principle of one-to-one correspondence: each input subsegment corresponds to a single output subsegment, and vice versa. Therefore, one input subsegment cannot correspond to multiple output subsegments, nor can one output subsegment correspond to multiple input subsegments. Additionally, it is assumed that correspondence occurs between subsegments of the same type. Consequently, an input release transition corresponds only to an output release transition, and an input closure transition corresponds only to an output closure transition.

5 In standard Q-theory, each segment consistently has three subsegments. For the sake of concise description, I have focussed on representing only the transitional subsegments. It is also worth noting here that Q-Theory does not necessarily assume binary features, as an anonymous reviewer points out. Indeed, in Q-Theory, subsegments are assumed to be associated with articulatory phases, such as vowel targets, release transitions and closure transitions (Shih & Inkelas Reference Shih and Inkelas2019). Thus, it should be acknowledged that the binary feature analysis couched within the framework of Q-Theory is not the only option within Q-Theory. More phonetically detailed features, such as those proposed in this article, would also be compatible with the theory.

6 Although the inheritance of features through surface correspondence is assumed here to demonstrate that the current proposal can be implemented within pre-existing subsegmental theories, such as Q-Theory or Aperture Theory (Steriade Reference Steriade, Huffman and Krakow1993), this article does not necessarily assume that subsegmental transitions obtain features in this way. Instead, the manifestation of consonantal properties in transitions is captured through a cue realisation constraint, rather than relying on surface correspondences. See §2.3 for details.

7 Dilley & Pitt (Reference Dilley and Pitt2007) demonstrate that when a heterorganic cluster is reduced in English, the result is phonetically varied rather than consistently phonological. The /np/ sequence in pinpoint /pɪnpɔɪnt/ may maintain the alveolar gesture ([pɪnpɔɪnt]), undergo place assimilation ([pɪmpɔɪnt]), or be fully simplified ([pɪpɔɪnt]). Notably, the F2 value of the closure transition from [ɪ] varies gradually for each outcome. In [pɪnpɔɪnt], it is around the midpoint between coronal and labial transitions. In [pɪmpɔɪnt] or [pɪpɔɪnt], it is higher than a typical labial transition, suggesting remnant alveolar gesture. This unclear boundary between phonology and phonetics can be uniformly dealt with if the two share continuous features. For instance, it is possible to imagine a version of Harmonic Grammar with weighted phonetic realisation constraints that evaluate such continuous values (e.g., Flemming Reference Flemming2001). In such an approach, cluster reduction could be triggered by a constraint (F2C1−F2C2)2 requiring F2 assimilation in C1 closure and C2 release transitions, countered by other constraints (F2C1−Target F2C1)2 and (F2C2−Target F2C2)2 that demand preserving target F2 values. The result depends on their relative weighting. For instance, even if (F2C2−Target F2C2)2 always outweighs (F2C1−Target F2C1)2 according to the P-map, a higher-than-expected weight for the latter would yield more coronal F2 transition maintenance and a higher F2 value compared to a typical labial transition. While this article does not develop such a sophisticated model, it is advisable to keep open the possibility of implementing the current proposal within such a more refined framework in future research.

8 Other potential transitional properties, such as nasality or continuity of the transition, can be integrated into this formalisation with appropriate constants that mediate the differences in the measurement scales. For instance, nasality of the transition might be quantitatively translated in terms of the nasal proportion within a given interval (cf. Stanton Reference Stanton2017: 79 ff.), or changes in the formant properties by the nasality of the adjacent consonant (Wright Reference Wright, Ohala and Jaeger1986; Flemming Reference Flemming2008a: 14). This means that a squared difference in the nasalised transitions, multiplied by an appropriate constant, can be added to the metric given in (19). These properties, particularly nasality, will be mentioned when necessary.

9 Although this progressive apical assimilation can, in principle, be explained by introducing very specific contextual faithfulness constraints targeting only clusters involving a retroflex, such as Ident (anterior) /V[__, apical, stop]C ≫ Ident (anterior) /C[__, apical, stop]V (e.g., Steriade Reference Steriade, Hume and Johnson2001: 240), it remains difficult to directly compare these P-map constraints with others referring to different contexts.

10 In most situations, it can be safely assumed that release transitions generally dominate closure transitions. Since this article does not directly address apical assimilation and our primary concern is the C2 dominance effect in cluster reduction processes, the condition in (22) will adequately serve our purposes without causing any particular issues.

11 For example, in languages where C1 stops are canonically released, such as Arabic, Chontal, Hindi, Kutenai, Motilone, Russian, Tillamook, Upper Chehalis, Wikchamni and Zoque, place assimilation or consonant deletion is reportedly unattested. In contrast, in languages where C1 stops are not audibly released, such as Basque, Catalan, German, Diola-Fogny, English, Korean, Yakut and West Greenlandic, place assimilation or cluster simplification is widely attested (Jun Reference Jun2002: 5 and the references therein).

12 Adopting the Realised Input model also facilitates the analysis of opaque interactions (Kim Reference Kim2024a,Reference Kimb,Reference Kimc). In §3.3, it will be shown that Lardil, where apocope and final cluster simplification interact opaquely, can be effectively analysed within this framework.

13 Since Realise [transition] constraints require transitions to have target feature values of the adjacent consonant, realising a release transition like [pta] instead of [ppa] does not, in principle, satisfy this constraint.

14 It is worth noting that some reported cases prefer preservation of root C1 features over affixal C2 features. However, these cases can be attributed to the morphological root–affix asymmetry rather than to any inherent perceptual asymmetry between C1 and C2. Importantly, the root C2 remains unaffected in contexts where the affixal C2 may be subject to the reduction process. For further discussion of these cases and analyses, see §5.

15 Note that, although the factorial typology presented here is based on the relative ranking changes among Realise [transition], Overlap and *CC as an example, an even more complex factorial typology constructed by flexibly altering the rankings of Ident [closure transition] and Max-C still does not predict outcomes such as *[attta], where C2 deletion applies (cf. McCarthy Reference McCarthy2009, who provides a comparable typology within the framework of HS).

16 Geminates are interpreted here as the preservation of the existing consonant, rather than the deletion of a consonant accompanied by compensatory lengthening of the adjacent consonant. Therefore, the change from a heterorganic cluster to a geminate does not violate Max-C.

17 Although not represented in the tableau, as explained in §2.3, such changes are controlled by the Realise [transition] constraint, which requires the realisation of transitions with target feature values of the adjacent consonant. Even when the Realise [transition] constraint is ranked lower than the Ident [transition] constraints, there should be no instances where [atk] is realised instead of [akk] to satisfy Ident [transition]. This is naturally regulated by assuming that constraints such as *ShortGesture, which prohibit excessively short consonantal gestures (like those that occur briefly in the transition between vowels and consonants), are generally undominated. However, it is possible to consider cases where *ShortGesture is ranked lower. In such instances, remnant consonantal gestures may appear, similar to how English pinpoint can be realised as [pɪ npɔɪnt]. In this case, the deletion is executed incompletely (see also fn. 7).

18 I interpret [h] that precedes [k] as a pre-aspiration, following Anderson’s (Reference Anderson2017: 25) description of Tunica phonetics and phonology. In a medial cluster, this [h] systematically occurs only before a voiceless stop (i.e., [p, t, k] and [tʃ]). Why this pre-aspiration is removed before a sonorant but preserved before a stop could be understood in terms of the general dispreference for a pre-aspirated sonorant (e.g., *ʰC[+son]).

19 In Hungarian, when a stop consonant appears as the second element in a sequence of three consecutive consonants, it may exhibit a very brief release accompanied by a weak burst. However, even in such instances, it is evident that the stop is not followed by an open transition interval (see Bárkányi & Kiss Reference Bárkányi and Kiss2020: Figures 7 and 8).

20 I assume that cluster simplification and velar or labial deletion are regulated by different purposes in Lardil, considering them as separate phonological processes. In fact, laminal palatal consonants such as [tj ɲ] are always deleted when they are the second member of the final cluster, but can be preserved when they occupy the final position alone (e.g., /piɳtaɲi/ → [piɳtaɲ] ‘rainbow type’, Staroverov Reference Staroverov, Assmann, Bank, Georgi, Klein, Weisser and Zimmermann2014: 432), suggesting such a possibility. Compare this with (41d), in which the final palatal [tj] is removed as a consequence of apocope. In contrast, when a final homorganic consonant cluster whose members are both non-coronal is formed by apocope, the whole cluster is removed, as in /muŋkumuŋku/ → muŋkumuŋk → [muŋkumu] ‘wooden axe’ (Staroverov Reference Staroverov, Assmann, Bank, Georgi, Klein, Weisser and Zimmermann2014: 431).

21 It is unclear why an oral stop is mapped onto a glottal fricative instead of a glottal stop in this context, given that a glottal stop exists in the phonemic inventory of Panare. One possible explanation for this mapping could be the avoidance of a glottal stop before an obstruent consonant. Payne & Payne (Reference Payne and Payne2013) hypothesise that /h/ is realised as [ʔ] before C[+son] while remaining [h] before C[-son]. Based on this observation, it is possible to posit a higher-ranked constraint such as *ʔC[−son], which prohibits the realisation of a glottal stop before an obstruent. This constraint would result in the mapping of an oral stop to a glottal fricative rather than a glottal stop.

22 Alternatively, these transitions may be protected by Realise [transition] instead of Max [transition]; see §3.2.

23 Split root/affix faithfulness constraints have already been discussed by McCarthy (Reference McCarthy2008) as one option to address cluster reduction processes targeting C2. The novel aspect of the constraints used in (60) is that they refer to acoustic transitions rather than to the segments themselves.

24 Due to space limitations, it is not possible to discuss all major alternative approaches here. Readers interested in the Targeted Constraint approach to the syncope problem, which is not covered in this article, are referred to Wilson (Reference Wilson2001) for an outline of the proposal, and to McCarthy (Reference McCarthy2002) for a critical discussion of this approach.

25 The following is a brief description of how HS works: HS makes optimisation a sequential process by changing the nature of Gen, in contrast to the single global optimisation process in parallel OT. In order to enable the grammar to choose an optimal output that differs from the underlying form by more than one step, optimisation applies serially. The output generated from the first step serves as the input for the subsequent optimisation step, in which all candidates that are a single step away from this new input are taken into consideration. This iterative process continues until the input and the optimised output converge, meaning they become identical.

26 Many recent studies have raised the issue of irreducible parallelism in HS (Adler Reference Adler2017; Wei Reference Wei, Gallagher, Gouskova and Yin2018; Wei & Walker Reference Wei and Walker2018, Reference Wei and Walker2020; Stanton Reference Stanton2020; Adler & Zymet Reference Adler and Zymet2021, among others). However, Lamont (Reference Lamont2023) suggests that, at least in some previously cited cases involving reduplication, such as those in Mbe and Logoori, these putative derivational lookahead effects do not actually pose a challenge for HS.

References

Abramson, Arthur & Lisker, Leigh (1985). Relative power of cues: F0 shift versus voice timing. In Fromkin, Victoria (ed.) Phonetic linguistics: essays in honor of Peter Ladefoged. New York: Academic Press, 2533.Google Scholar
Adler, Jeffrey (2017). The nature of conspiracy: implications for parallel versus serial derivation. Master’s thesis, University of California, Santa Cruz.Google Scholar
Adler, Jeffrey & Zymet, Jesse (2021). Irreducible parallelism in phonology: evidence for lookahead from Mohawk, Maragoli, Sino-Japanese, and Lithuanian. NLLT 39, 367403.Google Scholar
Anderson, Patricia M. (2017). Yanatame nisa luhchi yoroni: lexicography, language revitalization, and the new Tunica dictionary. PhD dissertation, Tulane University.Google Scholar
Bárkányi, Zsuzsanna & Kiss, Zoltán G. (2020). Neutralisation and contrast preservation: voicing assimilation in Hungarian three-consonant clusters. Linguistic Variation 20, 5683.10.1075/lv.16010.barCrossRefGoogle Scholar
Bliese, Loren F. (1981). A generative grammar of Afar. Arlington, TX: Summer Institute of Linguistics and University of Texas at Arlington.Google Scholar
Blumstein, Sheila E. & Stevens, Kenneth N. (1980). Perceptual invariance and onset spectra for stop consonants in different vowel environments. JASA 67, 648662.10.1121/1.383890CrossRefGoogle ScholarPubMed
Boersma, Paul (2007). Some listener-oriented accounts of h-aspiré in French. Lingua 117, 19892054.10.1016/j.lingua.2006.11.004CrossRefGoogle Scholar
Boersma, Paul (2011). A programme for bidirectional phonology and phonetics and their acquisition and evolution. In Benz, Anton & Mattausch, Jason (eds.) Bidirectional Optimality Theory. Amsterdam: Benjamins, 3372.10.1075/la.180.02boeCrossRefGoogle Scholar
Boersma, Paul (2012). A constraint-based explanation of the McGurk effect. In Noske, Roland & Botma, Bert (eds.) Phonological architecture: empirical, theoretical and conceptual issues. Berlin: Mouton de Gruyter, 299312.Google Scholar
Boersma, Paul & Hamann, Silke (2008). The evolution of auditory dispersion in bidirectional constraint grammars. Phonology 25, 217270.CrossRefGoogle Scholar
Boersma, Paul & Hamann, Silke (2009). Cue constraints and their interactions in phonological perception and production. In Boersma, Paul & Hamann, Silke (eds.) Phonology in perception. Berlin: Mouton de Gruyter, 55110.10.1515/9783110219234.55CrossRefGoogle Scholar
Boersma, Paul & Leussen, Jan-Willem (2017). Efficient evaluation and learning in multi-level parallel constraint grammars. LI 48, 349388.Google Scholar
Cohen, Kevin B. (2000). Aspects of the grammar of Kukú. Munich: Lincom Europa.Google Scholar
Côté, Marie-Hélène (2000). Consonant cluster phonotactics: a perceptual approach. PhD dissertation, Massachusetts Institute of Technology.Google Scholar
Courtz, Henk (2008). A Carib grammar and dictionary. Toronto: Magoria Books.Google Scholar
Delattre, Pierre C., Liberman, Alvin M. & Cooper, Franklin S. (1955). Acoustic loci and transitional cues for consonants. JASA 27, 769773.10.1121/1.1908024CrossRefGoogle Scholar
Dilley, Laura C. & Pitt, Mark A. (2007). A study of regressive place assimilation in spontaneous speech and its implications for spoken word recognition. JASA 122, 23402353.10.1121/1.2772226CrossRefGoogle ScholarPubMed
Dorman, Michael F., Studdert-Kennedy, Michael & Raphael, Lawrence J. (1977). Stop consonant recognition: release bursts and formant transitions as functionally equivalent context-dependent cues. Perception and Psychophysics 22, 109122.10.3758/BF03198744CrossRefGoogle Scholar
Eliasson, Stig (1986). Sandhi in peninsular Scandinavian. In Andersen, Henning (ed.) Sandhi phenomena in the languages of Europe. Berlin: de Gruyter, 271300.10.1515/9783110858532.271CrossRefGoogle Scholar
Flemming, Edward S. (2001). Scalar and categorical phenomena in a unified model of phonetics and phonology. Phonology 18, 744.10.1017/S0952675701004006CrossRefGoogle Scholar
Flemming, Edward S. (2002). Auditory representations in phonology. New York: Routledge.Google Scholar
Flemming, Edward S. (2006). The role of distinctiveness constraints in phonology . Ms, Massachusetts Institute of Technology.Google Scholar
Flemming, Edward S. (2008a). Asymmetries between assimilation and epenthesis . Ms, Massachusetts Institute of Technology.Google Scholar
Flemming, Edward S. (2008b). The realized input . Ms, Massachusetts Institute of Technology.Google Scholar
Flemming, Edward S. (2011). La grammaire de la coarticulation. In Embarki, Mohamed & Dodane, Christelle (eds.) La coarticulation: des indices, direction et representation. Paris: L’Harmattan, 189211.Google Scholar
Flemming, Edward S. (2013). A note on parallel and serial derivations. Class notes, Massachusetts Institute of Technology.Google Scholar
Foulkes, Paul & Docherty, Gerard J. (1999). Sociophonetic variation in ‘glottals’ in Newcastle English. In Ohala, John J., Hasegawa, Yoko, Ohala, Manhari, Granville, Daniel & Baily, Ashley C. (eds.) Proceedings of the 14th International Congress of Phonetic Sciences (ICPhS). Berkeley, CA: University of California, Berkeley, 10371040.Google Scholar
Fujimura, Osamu, Macchi, Marian J. & Streeter, Lynn A. (1978). Perception of stop consonants with conflicting transitional cues: a cross-linguistic study. Language and Speech 21, 337346.10.1177/002383097802100408CrossRefGoogle ScholarPubMed
Gafos, Adamantios I. (2002). A grammar of gestural coordination. NLLT 20, 269337.Google Scholar
Gildea, Spike (1995). A comparative description of syllable reduction in the Cariban language family. IJAL 61, 62102.Google Scholar
Haas, Mary R. (1946). A grammatical sketch of Tunica. In Hoijer, Harry (ed.) Linguistic structures of Native America. New York: Viking Fund, 337366.Google Scholar
Haggard, Mark (1973). Abbreviation of consonants in English pre- and post-vocalic clusters. JPh 1, 924.Google Scholar
Hale, Kenneth (1973). Deep-surface canonical disparities in relation to analysis and change: an Australian example. In Hoenigswald, Henry M. (ed.) Diachronic, areal and typological linguistics. The Hague: Mouton de Gruyter, 401458.10.1515/9783111418797-018CrossRefGoogle Scholar
Hamann, Silke (2003). The phonetics and phonology of retroflexes. PhD dissertation, University of Utrecht.Google Scholar
Hauser, Ivy & Hughto, Coral (2020). Analyzing opacity with contextual faithfulness constraints. Glossa 5, Article no. 82 (33 pp.).Google Scholar
Hawkins, Sarah & Stevens, Kenneth N. (1985). Acoustic and perceptual correlates of the non-nasal–nasal distinction for vowels. JASA 77, 15601575.10.1121/1.391999CrossRefGoogle ScholarPubMed
Hayes, Bruce (1986). Assimilation as spreading in Toba Batak. LI 17, 467499.Google Scholar
Hayward, Richard J. (1984). The Arbore language: a first investigation. Hamburg: Buske.Google Scholar
Hoff, Berend J. (1968). The Carib language. The Hague: Martinus Nijhoff.Google Scholar
Hohulin, Lou & Kenstowicz, Michael (1979). Keley-i phonology and morphophonemics. South-East Asian Linguistic Studies 4, 241254.Google Scholar
Hoijer, Harry (1946). Tonkawa. In Hoijer, Harry (ed.) Linguistic structures of Native America. New York: Viking Fund, 289311.Google Scholar
Hualde, José I. (1991). Basque phonology. New York: Routledge.Google Scholar
Itô, Junko (1986). Syllable theory in prosodic phonology. PhD dissertation, University of Massachusetts Amherst.Google Scholar
Itô, Junko & Mester, Armin (1996). Stem and word in Sino-Japanese. In Otake, Takashi & Cutler, Ann (eds.) Phonological structure and language processing: cross-linguistic studies. Berlin: Mouton de Gruyter, 1344.10.1515/9783110815825.13CrossRefGoogle Scholar
Jun, Jongho (1995). Perceptual and articulatory factors in place assimilation: an Optimality Theoretic approach. PhD dissertation, University of California, Los Angeles.Google Scholar
Jun, Jongho (2002). Positional faithfulness, sympathy and inferred input. Ms, Yeungnam University.Google Scholar
Jun, Jongho (2004). Place assimilation. In Hayes, Bruce, Kirchner, Robert & Steriade, Donca (eds.) Phonetically based phonology. Cambridge: Cambridge University Press, 5886.10.1017/CBO9780511486401.003CrossRefGoogle Scholar
Jun, Jongho (2011). Positional effects in consonant clusters. In van Oostendorp, Marc, Ewen, Colin J., Hume, Elizabeth V. & Rice, Keren (eds.) The Blackwell companion to phonology, volume 2, chapter 46. Oxford: Wiley-Blackwell, 11031123.Google Scholar
Kavitskaya, Darya & Staroverov, Peter (2010). When an interaction is both opaque and transparent: the paradox of fed counterfeeding. Phonology 27, 255288.10.1017/S0952675710000126CrossRefGoogle Scholar
Kawahara, Shigeto & Pangilinan, Melanie (2017). Spectral continuity, amplitude changes, and perception of length contrasts. In Kubozono, Haruo (ed.) Phonetics and phonology of geminate consonants. Oxford: Oxford University Press, 1333.Google Scholar
Keating, Patricia A. (1988). Underspecification in phonetics. Phonology 5, 275292.10.1017/S095267570000230XCrossRefGoogle Scholar
Kenstowicz, Michael (1994). Syllabification in Chukchee: a constraints-based analysis. In Proceedings of the Formal Linguistics Society of the Midwest. Iowa City: Department of Linguistics, University of Iowa, 160181.Google Scholar
Kewley-Port, Diane (1982). Measurement of formant transitions in naturally produced stop consonant–vowel syllables. JASA 72, 379389.10.1121/1.388081CrossRefGoogle ScholarPubMed
Kidda, Mairo E. (1993). Tangale phonology: a descriptive analysis. Berlin: Dietrich Reimer.Google Scholar
Kim, Yeong-Joon (2024a). Overapplication opacity as a consequence of phonetic faithfulness . Paper presented at WCCFL 42, University of California, Berkeley, April 2024.Google Scholar
Kim, Yeong-Joon (2024b). Phonetic faithfulness in counterfeeding opacity . Paper presented at NELS 54, Massachusetts Institute of Technology, January 2024.Google Scholar
Kim, Yeong-Joon (2024c). Phonetic faithfulness in phonological opacity. PhD dissertation, Massachusetts Institute of Technology.Google Scholar
Kingston, John (1994). Change and stability in the contrasts conveyed by consonant releases. In Keating, Patricia A. (ed.) Phonological structure and phonetic form: papers in laboratory phonology III. Cambridge: Cambridge University Press, 354361.10.1017/CBO9780511659461.020CrossRefGoogle Scholar
Klatt, Dennis (1973). Durational characteristics of pre-stressed word initial C clusters in English. MIT RLE Quarterly Progress Report 108, 253260.Google Scholar
Kurisu, Kazutaka (2000). Richness of the Base and root fusion in Sino-Japanese. Journal of East Asian Linguistics 9, 147185.10.1023/A:1008323215522CrossRefGoogle Scholar
Ladefoged, Peter & Maddieson, Ian (1996). The sounds of the world’s languages. Oxford: Blackwell.Google Scholar
Lamont, Andrew (2015). Progressive place assimilation in Optimality Theory. Master’s thesis, Eastern Michigan University.Google Scholar
Lamont, Andrew (2023). Serial reduplication is empirically adequate and typologically restrictive. LI 54, 797839.Google Scholar
Lamontagne, Greg (1997). Relativized contiguity part I: contiguity and syllabic prosody. Ms, University of British Columbia. ROA #150.Google Scholar
Liberman, Alvin M., Delattre, Pierre C., Gerstman, Louis J. & Cooper, Franklin S. (1956). Tempo of frequency change as a cue for distinguishing classes of speech sounds. Journal of Experimental Psychology 52, 127137.10.1037/h0041240CrossRefGoogle ScholarPubMed
Lisker, Leigh (1975). Is it VOT or a first-formant transition detector? JASA 57, 15471551.10.1121/1.380602CrossRefGoogle ScholarPubMed
Lodge, Ken (1992). Assimilation, deletion paths and underspecification. JL 28, 1352.10.1017/S0022226700014985CrossRefGoogle Scholar
McCarthy, John J. (1999). Sympathy and phonological opacity. Phonology 16, 331399.CrossRefGoogle Scholar
McCarthy, John J. (2002). On targeted constraints and cluster simplification. Phonology 19, 273292.10.1017/S0952675702004347CrossRefGoogle Scholar
McCarthy, John J. (2007). Slouching towards optimality: coda reduction in OT-CC. Phonological Studies 7, 89104.Google Scholar
McCarthy, John J. (2008). The gradual path to cluster simplification. Phonology 25, 271319.10.1017/S0952675708001486CrossRefGoogle Scholar
McCarthy, John J. (2009). The P-Map in Harmonic Serialism. Ms, University of Massachusetts Amherst.Google Scholar
McCarthy, John J. (2010). An introduction to harmonic serialism. Language and Linguistics Compass 4, 10011018.10.1111/j.1749-818X.2010.00240.xCrossRefGoogle Scholar
McCarthy, John J. (2011). Perceptually grounded faithfulness in Harmonic Serialism. LI 42, 171183.Google Scholar
McCarthy, John J. (2016). The theory and practice of Harmonic Serialism. In McCarthy, John J. & Pater, Joe (eds.) Harmonic Grammar and Harmonic Serialism. Sheffield: Equinox, 4787.Google Scholar
McCarthy, John J. & Prince, Alan (1995). Faithfulness and reduplicative identity. Ms, University of Massachusetts Amherst and Rutgers University. ROA #60.Google Scholar
Mohanan, K. P. (1993). Fields of attraction in phonology. In Goldsmith, John (ed.) The last phonological rule: reflections on constraints and derivations. Chicago: University of Chicago Press, 61116.Google Scholar
Myers, Scott & Hansen, Benjamin B. (2007). The origin of vowel length neutralization in final position: evidence from Finnish speakers. NLLT 25, 157193.Google Scholar
Ohala, John J. (1990). The phonetics and phonology of aspects of assimilation. In Kingston, John & Beckman, Mary E. (eds.) Papers in Laboratory Phonology I: between the grammar and physics of speech. Cambridge: Cambridge University Press, 258275.10.1017/CBO9780511627736.014CrossRefGoogle Scholar
Payne, Thomas E. & Payne, Doris L. (2013). A typological grammar of Panare: a Cariban language of Venezuela. Leiden: Brill.10.1163/9789004242197CrossRefGoogle Scholar
Phillips, Dennis P., Hall, Susan E. & Boehnke, Susan E. (2002). Central auditory onset responses, and temporal asymmetries in auditory perception. Hearing Research 167, 192205.10.1016/S0378-5955(02)00393-3CrossRefGoogle ScholarPubMed
Prince, Alan & Smolensky, Paul ([1993] 2004). Optimality Theory: constraint interaction in generative grammar. Oxford: Blackwell. Originally published in 1993 as technical report no. 2 of the Rutgers Center for Cognitive Science.10.1002/9780470759400CrossRefGoogle Scholar
Reetz, Henning & Jongman, Allard (2020). Phonetics: transcription, production, acoustics, and perception. 2nd edition. Hoboken, NJ: Wiley-Blackwell.Google Scholar
Rose, Sharon & Walker, Rachel (2004). A typology of consonant agreement by correspondence. Lg 80, 475531.Google Scholar
Sapir, J. David (1965). A grammar of Diola-Fogny. Cambridge: Cambridge University Press.Google Scholar
Shih, Stephanie S. & Inkelas, Sharon (2019). Autosegmental aims in surface-optimizing phonology. LI 50, 137196.Google Scholar
Siptár, Péter & Törkenczy, Miklós (2000). The phonology of Hungarian. Oxford: Oxford University Press.10.1093/oso/9780198238416.001.0001CrossRefGoogle Scholar
Stanton, Juliet (2017). Constraints on the distribution of nasal-stop sequences: an argument for contrast. PhD dissertation, Massachusetts Institute of Technology.Google Scholar
Stanton, Juliet (2020). Gurindji nasal cluster dissimilation as trigger deletion. JL 56, 157195.10.1017/S0022226718000506CrossRefGoogle Scholar
Staroverov, Peter (2014). Opacity in Lardil: stratal vs. serial derivations in OT. In Assmann, Anke, Bank, Sebastian, Georgi, Doreen, Klein, Timo, Weisser, Philipp & Zimmermann, Eva (eds.) Topics at InfL, number 92 in Linguistische Arbeits Berichte. Leipzig: Institut für Linguistik, Universität Leipzig, 427458.Google Scholar
Steriade, Donca (1993). Closure, release and nasal contours. In Huffman, Marie K. & Krakow, Rena A. (eds.) Nasals, nasalization, and the velum. San Diego, CA: Academic Press, 401470.Google Scholar
Steriade, Donca (1999). Phonetics in phonology: the case of laryngeal neutralization. UCLA Working Papers in Linguistics 2, 25145.Google Scholar
Steriade, Donca (2001). Directional asymmetries in place assimilation: a perceptual account. In Hume, Elizabeth & Johnson, Keith (eds.) The role of speech perception in phonology. San Diego, CA: Academic Press, 219250.10.1163/9789004454095_013CrossRefGoogle Scholar
Steriade, Donca (2009). The phonology of perceptibility effects: the P-map and its consequences for constraint organization. In Hanson, Kristin & Inkelas, Sharon (eds.) The nature of the word: studies in honor of Paul Kiparsky. Cambridge, MA: MIT Press, 151180.Google Scholar
Swadesh, Morris (1946). Chitimacha. In Hoijer, Harry (ed.) Linguistic structures of Native America. New York: Viking Fund, 312336.Google Scholar
Traunmüller, Hartmut (1990). Analytical expressions for the tonotopic sensory scale. JASA 88, 97100.10.1121/1.399849CrossRefGoogle Scholar
Valentine, J. Randolph (2001). Nishnaabemwin reference grammar. Toronto: University of Toronto Press.Google Scholar
Walker, Rachel (2000). Long-distance consonantal identity effects. WCCFL 19, 532545.Google Scholar
Walker, Rachel (2020). Gradient activity in Korean place assimilation. NELS 50, 207220.Google Scholar
Webb, Charlotte (1982). A constraint on progressive consonantal assimilation. Linguistics 20, 309321.10.1515/ling.1982.20.3-4.309CrossRefGoogle Scholar
Wei, Wei (2018). Lookahead effect in Mbe reduplication. In Gallagher, Gillian, Gouskova, Maria & Yin, Sora Heng (eds.) Supplemental proceedings of the 2017 Annual Meeting on Phonology. Washington: Linguistic Society of America, 8 pp.Google Scholar
Wei, Wei & Walker, Rachel (2018). Lookahead effects in the reduplication-phonology interface. NELS 48, 183196.Google Scholar
Wei, Wei & Walker, Rachel (2020). A lookahead effect in Mbe reduplication: implications for Harmonic Serialism. LI 51, 845859.Google Scholar
Wilson, Colin (2001). Consonant cluster neutralization and targeted constraints. Phonology, 18, 147197.10.1017/S0952675701004043CrossRefGoogle Scholar
Wright, James T. (1986). The behavior of nasalized vowels in the perceptual vowel space. In Ohala, John J. & Jaeger, Jeri J. (eds.) Experimental phonology. Orlando, FL: Academic Press, 4568.Google Scholar
Wright, Richard A. (2004). A review of perceptual cues and robustness. In Hayes, Bruce, Kirchner, Robert & Steriade, Donca (eds.) Phonetically based phonology. Cambridge: Cambridge University Press, 3457.10.1017/CBO9780511486401.002CrossRefGoogle Scholar
Figure 0

Figure 1 The auditory subsegments and features of the sound string [atpu] (NF = noise frequency; NL = noise loudness).

Figure 1

Figure 2 Simplified version of the Realised Input model of phonology.