Hostname: page-component-745bb68f8f-s22k5 Total loading time: 0 Render date: 2025-02-11T17:57:31.683Z Has data issue: false hasContentIssue false

How tight is the link between alternations and phonotactics?

Published online by Cambridge University Press:  11 February 2025

Jongho Jun*
Affiliation:
Department of Linguistics, Seoul National University, Seoul, Republic of Korea.
Hanyoung Byun
Affiliation:
Department of Linguistics, Seoul National University, Seoul, Republic of Korea.
Seon Park
Affiliation:
Department of Linguistics, Seoul National University, Seoul, Republic of Korea.
Yoona Yee
Affiliation:
Department of Linguistics, Seoul National University, Seoul, Republic of Korea.
*
Corresponding author: Jongho Jun; Email: jongho@snu.ac.kr
Rights & Permissions [Opens in a new window]

Abstract

This study tests the hypothesis that alternation patterns with strong lexical support are more robust than those with no, or weak, lexical support. Focusing on three alternation patterns in Korean with varying productivity and generality, we measured lexical support in two ways. First, we conducted an acceptability-rating experiment investigating Korean speakers’ judgements on non-words with and without violations of the phonotactic constraints motivating the alternations. In addition, we performed a simulation of learning a maximum entropy (MaxEnt) Harmonic Grammar from a dictionary corpus. The results of the experiment and computational modelling confirmed the hypothesis by showing that if an alternation is robust, its associated phonotactic constraint is learned with a high weight from the MaxEnt simulation, and it affects the participants’ well-formedness ratings for non-words. Consequently, the results of this research support the claim of a tight link between alternations and phonotactics.

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

1. Introduction

This study addresses the relationship between alternations and phonotactics. In standard Optimality Theory (Prince & Smolensky, [Reference Prince and Smolensky1993] 2004), the same phonological constraints are employed to account for both alternations and their corresponding phonotactic restrictions, and thus a very tight link between alternations and phonotactics is assumed by default. This assumption is in line with previous studies (Hayes Reference Hayes, Kager, Pater and Zonneveld2004; Prince & Tesar Reference Prince, Tesar, Kager, Pater and Zonneveld2004; Jarosz Reference Jarosz2006, Reference Jarosz, Danis, Mesh and Sung2011; Pater & Tessier Reference Pater, Tessier, Slabakova, Montrul and Prévost2006; Chong Reference Chong2017, Reference Chong2021) showing that alternations are easier to learn if they enforce lexically supported phonotactic patterns than if they do not. One well-known alternation pattern with mismatching phonotactics is derived environment, or non-derived environment blocking, effects (Kiparsky Reference Kiparsky and Fujimura1973 and others). Derived environment effect phenomena, in particular morphologically derived environment effects, have been described in the literature as phonological processes applying across morpheme boundaries in languages where their potential target sequences are attested within morphemes. In languages with derived environment effect patterns, the same sequences would be well-formed in lexical phonotactics but ill-formed across morphemes. Chong’s (Reference Chong2017; Reference Chong2019) recent studies on two well-known cases of derived environment effects, namely Korean palatalisation and Turkish velar deletion, suggest that derived environment effect phenomena do not necessarily pose a problem for the claim of a link between phonotactics and alternations. The results of Chong’s corpus studies show that the constraint for Korean palatalisation is general and active both across stem–suffix boundaries and within morphemes whereas the constraint for Turkish velar deletion is active only in a small number of morphological contexts, and inactive within morphemes, indicating that phonotactics and alternations do not completely mismatch. In addition, the results of Chong’s (Reference Chong2017; Reference Chong2021) artificial language learning experiment show, in line with the previous studies mentioned above, that alternations with no phonotactic support are harder to learn than those with phonotactic support. Accordingly, alternations with no phonotactic support are predicted to be rare or unlikely.

However, not all previous phonological studies argue or assume that there is a tight link between alternations and phonotactics. Note that in early generative phonology (Chomsky & Halle Reference Chomsky and Halle1968), in which lexical phonotactics is expressed by morpheme structure constraints whereas alternations result through the application of phonological rules, phonotactics may in principle be independent of alternations. Some recent studies (e.g., Paster Reference Paster2013; Do & Yeung Reference Do and Yeung2021) claim that alternations with matching phonotactics are accidental and language-specific, and thus there is no universal link between them.

Given the two opposing views on the connection between phonotactics and alternations, we test the following hypothesis for a link between phonotactics and alternations.

We focus on three alternation patterns in Korean: palatalisation, vowel harmony and laryngeal obligatory contour principle (OCP), which are known to have derived environment effects, and which vary in productivity and generality. As will be presented in §2, in Korean, palatalisation is highly productive and general whereas vowel harmony and laryngeal OCP are limited in both productivity and generality.

The test is carried out in two ways. First, we measure lexical support for each of the three alternation patterns, using Korean speakers’ judgement ratings on non-words, under the assumption that speakers’ judgements are a good proxy for lexical phonotactics. The hypothesis in (1) predicts that the more robust an alternation pattern is, the more its associated phonotactic restriction will affect participants’ non-word ratings. Second, we measure lexical support for the same alternation patterns by conducting a simulation of learning a maximum entropy Harmonic Grammar (MaxEnt; e.g., Goldwater & Johnson Reference Goldwater, Johnson, Spenader, Eriksson and Dahl2003; Hayes & Wilson Reference Hayes and Wilson2008). Specifically, we trained the UCLA Phonotactic Learner (Hayes & Wilson Reference Hayes and Wilson2008) on a Korean monomorpheme lexicon, on the assumption that the learned weights of constraints reflect over- or under-representation of a pattern in the lexicon. The hypothesis in (1) predicts that the more robust an alternation pattern is, the more likely its associated phonotactic constraint is to be learned and the greater its weight will be.

In order to measure lexical support for the alternation patterns, we conduct an acceptability judgement experiment on Korean non-words in addition to MaxEnt modelling on a Korean corpus for the following reasons. First, the corpus adopted in the present study might be flawed in part, and therefore, it may not be a good representative of the linguistic environment of an average Korean speaker. More importantly, speakers’ phonotactic knowledge might not directly reflect statistical patterns in the raw lexicon if the speakers are biased towards under- or over-learning certain lexical patterns (Hayes et al. Reference Hayes, Zuraw, Siptar and Londe2009; Becker et al. Reference Becker, Ketrez and Nevins2011; Hayes & White Reference Hayes and White2013, and others). Finally, by conducting both MaxEnt modelling on lexical data and an acceptability judgement test, we are able to find out to what extent speakers’ phonotactic grammar is derived from the lexical data.

The results of the experiment and computational modelling, which are highly correlated with each other, are overall consistent with the predictions made from the hypothesis in (1), although some results are unclear. Specifically, the participants’ ratings were lower for non-words violating the palatalisation constraint, and a high-weighted general palatalisation constraint was learned in the MaxEnt simulation. Further, in contrast, the participants’ ratings were not lower for non-words violating vowel harmony constraints than for those satisfying them, and no vowel harmony constraints were learned in the simulation. Finally, the results of the acceptability-rating experiment showed no indication of laryngeal OCP, which might be due to a floor effect, and laryngeal OCP constraints prohibiting only some subsets of the potential target sequences were learned with low weights in the simulation. Based on these results, we conclude that there is a tight link between alternations and phonotactics, which we believe should be captured in the grammar and/or in its learning algorithm.

The rest of this article is organised as follows. In §2, we present background information of the selected alternation phenomena in Korean, discussing their productivity and generality. In §§3 and 4, we conduct an experiment checking Korean speakers’ phonotactic judgements and perform computational modelling of Korean phonotactic grammar using a dictionary corpus. We report the procedures and results of the experiment and computational modelling, and compare their results to see if both methods of inquiry yield convergent results or not. In §5, we discuss some alternative analyses and interpretations of the results of the experiment and computational modelling. The final section concludes the article.

2. Some alternation patterns in Korean

This section presents background information about the three alternation patterns in Korean that we will focus on to test the hypothesis in (1). Our main concern will be to examine how productive and general each of these patterns is. As illustrated below, two of the three cases (palatalisation and vowel harmony) concern the constraint that drives the alternation itself, while the remaining case (laryngeal OCP in compound tensing) involves a constraint that blocks the alternation. Consequently, in our discussion, we will concentrate on the effect of application of the process in question in the former cases and the effect of blocking of the process in question in the latter case.

2.1. Palatalisation

In Korean, coronal stops before high front vocoids (henceforth TI) become alveo-palatal affricates, as shown in (2a) and (2b). This palatalisation does not apply within morphemes, as shown in (2c), and thus it has been considered an instance of a derived environment effect.

Korean palatalisation applies obligatorily at stem–suffix boundaries, regardless of whether the suffixes are inflectional, as in (2a), or derivational, as in (2b) (Cho Reference Cho, Hanson and Inkelas2009: 466). This indicates that Korean palatalisation in morphologically derived environments is not only productive but also general.

In order to have a better understanding of the productivity and generality of palatalisation, we need to consider two complications involving other processes which may interact with palatalisation. First, the palatalised forms occurring before inflectional suffixes (2a) are not the only attested surface forms. In Korean, noun stem-final coronal obstruents are in variation mainly with [s] before vowel-initial suffixes. Thus, when noun stems ending in coronal stops are combined with /i/-initial suffixes, [s]-final variants may occur in addition to the palatalised forms, as shown in (3a). This stem-final obstruent variation is not limited to the palatalisation context, as shown in (3b) (see Jun Reference Jun2010 for more details of Korean noun stem-final obstruent variation).

Given that either palatalised forms or [s]-final variants, not [tʰ]-final forms, are attested in a palatalising context, as can be seen in (3a), we may consider palatalisation to apply obligatorily, and the resulting palatalised forms such as [pacʰi] may further be subject to free variation targeting stem-final obstruents, optionally yielding [s]-final forms such as [pasi]. Note that although innovative [s]-final forms may occur as free variants of the palatalised forms, none of the attested surface forms in the palatalisation context violate the palatalisation constraint. Thus, it is true that the palatalisation constraint, that is, *TI, is never violated across stem–suffix boundaries. This palatalisation, optionally followed by noun stem-final obstruent variation, is quite robust, and is extended to loanwords. When English words ending in stops are borrowed into Korean, either vowel epenthesis or final coda unreleasing applies: for example, Matt [mɛtʰɨ] \textasciitilde [mɛt˺] (personal name).Footnote 1 Such loanword forms take different allomorphs of the nominative and accusative case markers, following a general allomorphic rule of Korean: vowel- and consonant-initial suffix allomorphs are used after consonant- and vowel-final stems, respectively. Specifically, when loanwords for English words ending in a coronal stop are inflected with nominative case markers, stem forms ending in an epenthetic vowel are suffixed with -ka (4a). In contrast, stem forms ending in an unreleased stop are suffixed with -i (4b). Here, stem forms before -i can end in either [cʰ] or [s], and thus sequences violating the palatalisation constraint, such as [tʰ-i], are unlikely to occur.Footnote 2

Note that because loanword stem forms with vowel epenthesis end in [tʰ], /tʰ/ can be considered as the underlying form of the loanword stem-final consonant in (4). Then, it is plausible to assume that [cʰ]-final forms occurring before -i are the result of applying palatalisation, and [s]-final forms occur due to the stem-final obstruent variation. This indicates that palatalisation, optionally followed by stem-final obstruent variation, applies to loanwords, confirming its high productivity.

The second complication is that TI sequences occurring in compounding and prefix–stem combinations usually undergo n-insertion, in which /n/ is inserted at the juncture of two morphemes when the first morpheme ends with a consonant and the following morpheme begins in a high front vocoid /i/ or /j/. This n-insertion is often optional, and thus TI sequences, which violate the palatalisation constraint, may surface, as shown in (5).Footnote 3 However, it seems that such non-palatalised forms are minor variants in the n-insertion context.Footnote 4

Consequently, it seems that there are no morphologically derived environments in Korean where the palatalisation constraint is violated frequently.

In summary, Korean palatalisation is highly productive at stem–suffix boundaries, and can also be considered general, given that it applies in all morphologically derived environments, except where n-insertion may apply.

2.2. Vowel harmony

To aid the understanding of Korean vowel harmony, let us first examine the vowel phoneme inventory of a dialect of Korean spoken in the Seoul-Gyeonggi area (henceforth, Seoul Korean), shown in (6).

This 10-vowel inventory can be considered a conservative one. The front rounded vowels /y/ and /ø/ are often diphthongised as [wi] and [we], respectively, surfacing faithfully only in some limited environments. In addition, it is controversial whether /e/ and /ɛ/ are merged completely or not. It seems true that the distinction between /e/ and /ɛ/ is completely lost at least among many younger speakers of Seoul Korean (see Lee Reference Lee1996 and Cho Reference Cho and Aronoff2016 for relevant discussion).

Korean has two types of vowel harmony, which hold in different domains: the inflection of verbs and adjectives (henceforth verbal inflection) and ideophones. The harmonic groups of vowels, traditionally called ‘light’ and ‘dark’, are different between the two vowel harmonies. The light vowel group consists only of /a o/ in the vowel harmony system of verbal inflection, as shown in (7a). In contrast, the light vowel group for ideophones is a larger set, as shown in (7b).

Since alternations involving vowel harmony occur only in verbal inflection, we will focus on the type of vowel harmony in verbal inflection in the remainder of this article.

In Korean, verb stems never occur in isolation; they are always suffixed. Verbal inflectional suffixes may be classified into the following four types, based on Choi (Reference Choi1985) and Kang (Reference Kang2006).

In Korean verbal paradigm, all vowel-initial inflectional suffixes have two basic allomorphs, [a]- and [ʌ]-initial, as illustrated in (9).Footnote 5

The choice between the two allomorphs depends on which harmonic group the last stem vowel belongs to. The specific selection mechanism is illustrated in (10).

[a]-initial forms are chosen when the last stem vowel is either /a/ or /o/, whereas [ʌ]-initial forms are chosen after any other stem vowel.

As discussed by Kang (Reference Kang2012: 7–12), in Middle Korean, vowel harmony was widely attested. Not only verbal but also nominal inflections showed harmonic alternations. Harmony also held within native Korean roots. Sino-Korean roots did not obey vowel harmony, but they constituted only a minority of the Middle Korean vocabulary (Kim Reference Kim2007). Thus, it is generally accepted that vowel harmony was pervasive and productive in Middle Korean. Since then, a variety of historical changes, such as vowel shift, vowel loss, monophthongisation, and the expansion of Sino-Korean vocabulary, have disrupted the vowel harmony system, resulting in its limited application in Contemporary Korean. Specifically, vowel harmony alternation takes place only in verbal inflection, not in nominal inflection.Footnote 6 In addition, there exist disharmonic root morphemes in the Korean lexicon, as shown in (11), and thus Korean vowel harmony can be considered an instance of derived environment effect.

Since Korean vowel harmony applies only in verbal inflection, it cannot be considered a general process. Moreover, even within verbal inflection, the domain of application of vowel harmony is subject to several further limitations. First, only vowel-initial, not consonant-initial, suffixes participate in vowel harmony: for example, /mʌk-A/ [mʌkʌ] *[mʌka] ‘eat-decl.ind’, but /mʌk-ca/ [mʌkc’a] *[mʌkc’ʌ] ‘eat-hort’ (where /A/ represents the underlying form of the alternating suffix-initial vowel).Footnote 7 Second, only the initial vowels of vowel-initial suffixes alternate due to vowel harmony, and non-initial suffix vowels have no harmonic variants, as can be seen in (9c) and (9d). Third, only the last vowel of the stem can trigger vowel harmony, and stem vowels other than the last one do not affect vowel harmony: for example, /k’apul-A/ [k’apulʌ], *[k’apula] ‘behave badly-decl’. Consequently, vowel harmony holds only between the last vowel of the verbal stem and the initial vowel of the following vowel-initial suffixes. Given that vowel harmony applies only in such a restricted domain, the generality of Korean vowel harmony is quite limited.

In order to ascertain how productive vowel harmony in Korean verbal inflection is, let us begin by considering which vowel, /a/ or /ʌ/, is the underlying form of the alternating suffix-initial vowel. Recall that [ʌ]-initial forms are used for a larger set of trigger stem vowels, that is, all vowels other than /a o/. This is not the only reason why /ʌ/ can be considered as the underlying vowel. The surface forms of the alternating suffixes not adjacent to the preceding stems provide more definite evidence for the underlying form of the suffix-initial vowel. As discussed above, vowel harmony applies only when the alternating suffixes are adjacent to stems. For instance, as shown in (12a), when the decl.ind suffix is immediately preceded by a stem with a light vowel /a o/, it is realised as [a]. In contrast, as shown in (12b), when the same suffix follows another suffix, it is always realised as [ʌ]. Note that the preceding suffix is realised as a light alternant, [as’], and thus the [ʌ]-initial form of the decl.ind suffix in (12b) cannot be due to vowel harmony. This clearly suggests that the underlying vowel of the decl.ind suffix is /ʌ/.

Given that it is possible that all attested [ʌ]-initial forms of the alternating suffixes are simply the underlying forms, it is only in [a]-initial suffix forms that we can be sure vowel harmony has applied.

Since vowel harmony shows exceptions and variations, [a]-initial suffix forms are not as frequent as expected when the last stem vowel is a light vowel /a o/, as experimentally shown in some previous studies on novel Korean words (Kang Reference Kang2012, Reference Kang2016; Jang Reference Jang and Erlewine2017) and as recently discussed and emphasised by Jo (Reference Jo2023). The most prominent exceptional pattern is that when the last stem vowel is /a/, disharmonic forms are attested often: for example, /mac-A/ [mac-a] \textasciitilde [mac-ʌ] ‘correct-decl’. Even more frequent exceptions occur when the stems are multi-syllabic p-irregular (in which stem-final /p/ appears as [w] before vowel-initial suffixes), regardless of whether the last stem vowel is /a/ or /o/: for example, /kølop-A/ [kølow-ʌ] (\textasciitilde[kølow-a]) ‘painful-decl’ and /alɨmtap-A/ [alɨmtaw-ʌ] (\textasciitilde[alɨmtaw-a]) ‘beautiful-decl’. Accordingly, the only case where vowel harmony is known to apply obligatorily is with regular verb and adjective stems with /o/ as the last stem vowel: for example, /nok-A/ [nok-a], *[nok-ʌ] ‘melt-decl.ind’. However, the results of some recent experimental studies on Korean vowel harmony (Kang Reference Kang2012; Jang Reference Jang and Erlewine2017) show that exceptional disharmonic forms occur even after novel stems with /o/ as the last stem vowel, and that the rate of vowel harmony in novel Korean words does not differ depending on the quality of the last stem vowel, whether /a/ or /o/. Thus, the exceptionless harmony pattern triggered by /o/ is not extended to novel Korean words.

Some quantitative information about how often Korean speakers apply vowel harmony in speech can be obtained from the results of Kang’s (Reference Kang2012) production experiment.Footnote 8 As shown in (13), in Kang’s experiment, exceptional disharmonic forms occurred much more often for novel Korean words than for real Korean words.

Furthermore, at least nine out of 30 Korean participants produced only [ʌ]-initial disharmonic forms for novel Korean stems with a light vowel, and additional six participants produced ʌ-initial forms for over 80% of the same novel stems (Kang Reference Kang2012: 51, Figure 20).Footnote 9 A similar pattern indicating complete loss of vowel harmony was observed in the results of Jang’s (Reference Jang and Erlewine2017) production experiment in which three out of 13 Seoul Korean participants produced only [ʌ]-initial forms for 12 verb stems with light vowels (four real and eight novel stems). Consequently, the results of the previous experimental studies using novel Korean words suggest that the true productivity of Korean vowel harmony is quite low, or even zero for some speakers.

In summary, Korean vowel harmony is neither general nor productive. The generality of vowel harmony is clearly limited in that it applies only between stem-last vowels and initial vowels of vowel-initial suffixes in verbal inflection. The productivity of vowel harmony is also limited since exceptions are frequent in existing words, and even more frequent in novel words.

2.3. Laryngeal OCP in compound tensing

For an effective illustration of phonation alternations in Korean compound tensing, let us start by examining the Seoul Korean consonant phoneme inventory, which is shown in (14).

Note that obstruents in Seoul Korean and many other dialects of Korean show three-way phonation contrasts between lenis, aspirated and tense. Among them, the laryngeally marked consonants (aspirated and tense) will be the focus of the current discussion.

In Korean, when a compound consists of two nouns, and the second noun stem (N2) begins with a lenis obstruent, tensing can occur as shown in (15).

This tensing is often optional, and it shows lexical variation. One interesting tendency found in previous studies on Seoul Korean compound tensing (Zuraw Reference Zuraw2011; Kim Reference Kim2016) is the avoidance of tensing in words with laryngeally marked segments. Specifically, tensing is less likely when N2 has tense or aspirated consonants, as in (15d) and (15e). This trend can plausibly be interpreted as the effect of laryngeal co-occurrence restrictions or laryngeal OCP. According to Ito’s (Reference Ito2014) study on compound tensing in Yanbian Korean (a variety spoken in China’s Yanbian Korean Autonomous Prefecture), this laryngeal OCP effect is more extensive in Yanbian Korean than in Seoul Korean, in that tensing is less likely even when the first component noun stem (N1) has a laryngeally marked segment. Moreover, the laryngeal OCP effect that was found significant in existing compound nouns was mirrored in novel compounds. The laryngeal OCP effect has also been observed in Kang & Oh’s (Reference Kang and Mira2019) acceptability rating experiments for existing and novel Korean compounds, which employed speakers of Jeolla Korean (a variety spoken in the southwestern province of South Korea). Thus, it seems that the laryngeal OCP effect is active and productive in compounds in various dialects of Korean.

Note that there exist monomorphemes with multiple laryngeally marked consonants, as shown in (16), and thus the laryngeal OCP effect can be considered an instance of a derived environment effect.

Let us now consider how general and how productive the laryngeal OCP is in Korean. We are not aware of any morphological domains other than noun–noun compounding in which laryngeal OCP is active. In Korean, there are several tensing processes other than compound tensing which may create forms with multiple laryngeally marked onset consonants. First, post-obstruent tensing, in which a lenis obstruent becomes tense after an obstruent, is known to be an automatic process, and thus it is generally assumed to apply obligatorily. To the best of our knowledge, no previous studies report the blocking of post-obstruent tensing by the laryngeal OCP. In addition, the initial lenis consonants of adjective, noun and verb stems in Korean become tense optionally, as shown in (17).

The lexical meaning of adjectives and some nouns, but not verbs, can be intensified with this word-initial tensing. Kang & Oh (Reference Kang and Mira2016) collected (Jeolla) Korean speakers’ acceptability ratings on word-initial tensing by employing a set of real Korean words which do not undergo word-initial tensing. Based on their statistical analysis of the experiment results, Kang & Oh argue that word-initial tensing is preferred in words with tense consonants. This is the opposite of the laryngeal OCP effect.Footnote 10 Thus, the results of Kang & Oh’s experiment show no laryngeal OCP effect in word-initial tensing. Consequently, to the best of our knowledge, compound tensing is the only clear case, in which laryngeal OCP is effective in Seoul Korean and other dialects of Korean, and thus laryngeal OCP cannot be considered general in Korean.

Let us now consider how productive the laryngeal OCP is in Korean compound tensing. Note that tensing in compounding is variable, and laryngeal OCP decreases the tensing rate when stems have laryngeally marked consonants, as can be seen in (18a).Footnote 11 Using the tensing rate data from the previous studies in (18a), we have calculated the rates of blocking by the laryngeal OCP, as shown in (18b).

The resulting rates of blocking in (18b) indicate that the blocking by the laryngeal OCP is far from obligatory in both Seoul and Yanbian Korean. This partial blocking effect is extended to novel compounds, as reported by the experimental studies on Yanbian Korean (Ito Reference Ito2014) and Jeolla Korean (Kang & Oh Reference Kang and Mira2019). Kang & Oh (Reference Kang and Mira2016) discuss their experimental results by providing mosaic plots and statistical tests, not detailed rates of compound tensing, and so we cannot calculate the rate of blocking of compound tensing using their published results. However, Kang & Oh clearly show that the presence of aspirated or tense onsets in N2 do not completely block compound tensing, regardless of whether N2 is real or novel, and that the blocking effect is higher in real compounds than in novel compounds. In summary, the results of the previous experimental studies on Korean compound tensing suggest that the laryngeal OCP blocks compound tensing, even when N2 is a novel stem, but the blocking effect is only partial, not complete.

In this section, we have discussed how general and how productive the laryngeal OCP is in Korean. We may conclude that both generality and productivity of the laryngeal OCP effect are limited, since the effect is attested only in noun–noun compounding, and it is only partial, not even close to complete.

2.4. A summary

Based on the discussions presented in the previous sections, we may determine how the three alternation patterns in Korean differ in robustness which we consider to be a function of productivity (namely, rate of application) and generality (namely, breadth of application domain). It seems clear that Korean palatalisation is highly robust, given that both its productivity and generality are high. In contrast, Korean vowel harmony and laryngeal OCP are low in robustness, since they are limited in both productivity and generality, as summarised in (19).

The robustness summarised in (19) may lead us to make the following prediction based on the hypothesis in (1): in Korean, palatalisation will have more lexical support compared to vowel harmony and laryngeal OCP. To test this prediction, we conduct an acceptability rating experiment for non-words in §3 and a computational modelling of learning a phonotactic grammar in §4, modelled loosely on Daland et al. (Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011).

3. Experiment

In this section, to measure lexical support for each of the three alternation patterns described in the previous section, we perform an acceptability-rating experiment investigating Seoul Korean speakers’ judgements on non-words. From the hypothesis in (1), it is predicted that the more robust an alternation is, the more the corresponding phonotactic pattern will influence participants’ non-word ratings. More specifically, participants’ ratings for target non-words violating a phonotactic constraint will be lower than those for control non-words satisfying it only when the associated alternation is robust. This prediction will be tested against the results of the current experiment.

3.1. Method

This section provides information about the stimuli, participants, and procedure utilised in the experiment.

3.1.1. Stimuli

In the experiment, we created 201 Korean non-words, of which eight were practice items and the rest were test items. (A list of test non-words is provided in the Supplementary Material.) The practice items were presented to participants before the main experiment. They were chosen to expose experimental participants to a wide range of acceptability, from high to intermediate to low.

The test stimuli consisted of three sets, each of which was intended to investigate phonotactics related to one of the three effects (palatalisation, vowel harmony and laryngeal OCP). We did our best to ensure several conditions. First, the words do not sound like any particular real Korean words or phrases. They also do not violate any major phonotactic constraints of Korean other than those we were examining in this study.Footnote 12 Further, all stimuli consist of two open syllables, of which the second may include a glide.Footnote 13 Thus, the stimuli can be schematised as C1V1C2(G)V2, where C = consonant, V = vowel and G = glide. For V1 and V2, we use six simple vowels: /i e a o ʌ u/. Specific details about each set are provided in the following paragraphs (see the Supplementary Material for more details about them).

Sixty-six Korean non-words were used to explore the palatalisation effect. In this set, the lenis and aspirated coronal stops, /t tʰ/, and their non-coronal counterparts, /p k pʰ kʰ/, occur in C2 position. The high front vocoids, /i j/, and the other glide, /w/, occur in V2 and G positions. As shown in (20), 20 target non-words with TI sequences and 20 control non-words with non-coronal stops followed by high front vocoids such as /pi/ (henceforth PI) were created. In addition, control non-words with coronal stops followed by vocoids other than high front vocoids such as /ta/ (henceforth TA) were created. Specifically, 10 control non-words with vowels other than /i/ and four control non-words with /w/ were added to the list.

Six non-words with non-coronal stops followed by a labial-velar glide such as /pw/ were adopted to reduce the imbalance of the stimuli set in which a palatal glide /j/ was used predominantly over a labial-velar glide /w/. In addition, six non-words which were originally created to investigate laryngeal OCP and/or vowel harmony turned out to be useful to investigate palatalisation, and thus were also considered in the analysis as part of the current set of non-words. As a result, among 66 non-words in this set, 20 were target items with TI, 17 control items with TA, 23 control items with PI and 6 fillers with non-coronal stops followed by a labial-velar glide.

Eighty Korean non-words were used to explore the vowel harmony effect. The V1–V2 sequences used exhausted all possible sequence permutations of the six vowels, /a e i o u ʌ/, and for each V1–V2 sequence, two non-words were created, thus yielding 72 non-words in total (6 vowels × 6 vowels × 2 items), as shown in (21).

Eight non-words which were originally created to investigate palatalisation turned out to be useful to investigate vowel harmony, and thus were also considered in the analysis as part of the relevant set of non-words. Among 80 non-words in this set, 33 were disharmonic target items and 47 were harmonic control items.

Seventy-five Korean non-words were used to explore the laryngeal OCP effect in Korean phonotactics. The C1–C2 onset sequences used exhausted all possible sequence permutations of three consonant phonation types, that is, lenis, aspirated and tense. For each C1–C2 phonation type sequence except for lenis–lenis, non-words with eight different V1–V2 sequences were created. Eight lenis–lenis non-words had already been created for the other two conditions, thus making up 72 non-words (3 phonations × 3 phonations × 8 vowel sequences), as shown in (22).

For C1 and C2, we used four lenis obstruents /p t c k/ as well as their aspirated and tense counterparts. Among the non-words created to investigate vowel harmony and/or palatalisation, 11 of them, including eight items with lenis–lenis sequences, met the relevant conditions required for this set of non-words. These items were thus also considered in the analysis as part of this set of non-words. The resulting set consists of 32 target non-word items with two laryngeally marked consonants (henceforth ‘double-lar’) and 43 control items which can be further divided into eight non-words with no laryngeally marked consonants (henceforth ‘no-lar’) and 35 non-words with a single laryngeally marked consonant (henceforth ‘single-lar’).

All the Korean non-words from the three sets were recorded as stimuli in a soundproof booth with a high-quality microphone. One female speaker of Seoul Korean produced all of the stimuli in isolation with declarative intonation. The recorded stimuli were RMS equalised in Praat (Boersma & Weenink Reference Boersma and Weenink2020) and checked to ensure that they could be heard as intended. Five Seoul Korean speakers were employed to transcribe all recorded audio stimuli in the standard Korean orthography. Non-words which were incorrectly transcribed were re-recorded until all five of the transcribers identified them correctly. The stimuli were pseudo-randomised in order, such that stimuli of the same set never followed each other and were always separated by stimuli of different sets.Footnote 14 We controlled the presentation-order effect by using four different pseudo-randomised orders.

3.1.2. Participants

Participants were recruited from the community at Seoul National University through online advertising. They were restricted to native speakers of Seoul Korean with ages of at least 18 years. Well-formedness ratings to the test stimuli and demographic information of participants were collected through an online Google Forms survey. All participants ( $n=57$ ) were compensated with online gift cards for their participation.

Data from participants who reported themselves as being non-native speakers of Seoul Korean or who did not agree to the IRB statement were excluded from the analysis. In addition, data from participants who chose the same response option for two-thirds or more of the stimuli were excluded from the analysis. The final set of data included responses from 49 participants (38 female and 11 male; mean age = 25.37).

3.1.3. Procedure

The participants were instructed to rate the stimuli, on a 1–6 Likert scale, according to how acceptable they sounded as standard Seoul Korean words, not loanwords.Footnote 15 A rating of 1 meant ‘unlikely as a Korean word’, whereas 6 meant ‘likely as a Korean word’. The stimuli were provided in the form of a sound file, which was hosted on YouTube and linked to each question. The instruction did not include any statement about morphological composition of the stimuli, under the expectation that the participants would view the stimuli spoken in isolation as monomorphemic, based on their disyllabic length, as mentioned in footnote 13. The participants proceeded through the experiment at their own pace. The experiment lasted approximately 25 minutes.

3.2. Results

The mean of all the received responses was 3.05 (see the Supplementary Material for the mean response score, shown under ‘human rating’, for each test non-word). As anticipated, the mean well-formedness ratings of the eight practice items covered a relatively wide range of the scale, as shown in Figure 1.

Figure 1 A plot of ratings of practice items.

Three practice items, [neho] (4.0), [kʌta] (3.55) and [pahe] (3.41), which we, as native speakers of Korean, consider perfectly well-formed, received clearly higher scores than three practice items, [kocɨ] (2.45), [les’ʌ] (2.39) and [pot’ja] (1.63), which we consider ill-formed.Footnote 16 Excluding the practice items, we analysed participant ratings on the rest of the stimuli. The total number of responses was 9,457 (49 participants × 193 test non-words). Recall that the test non-words were created in three sets to investigate the three effects – palatalisation, vowel harmony and laryngeal OCP – in the lexicon. In the following subsections, we will analyse and discuss the results separately for each set of test non-words.

3.2.1. Palatalisation

This section concerns the analysis of the results pertaining to palatalisation. The participants gave different ratings for non-words with different consonant–vocoid (CV) sequence types, as shown in Figure 2.Footnote 17

Figure 2 Palatalisation: Mean Likert rating in the survey of Korean non-words, by CV sequence types. Smaller semitransparent points represent individual non-words, averaged across participants, and larger points represent overall averages. The number of non-words of the corresponding type is shown in parentheses.

Note in Figure 2 that the target non-words with TI (2.45) were rated lower than the control non-words with TA or PI (3.13), as would be expected if the palatalisation constraint affects the participants’ phonotactic judgements. In order to provide a statistical analysis of the observed differences, we fitted a linear mixed effects model in a Bayesian framework, using the brms package (Bürkner Reference Bürkner2018) in R (R Core Team 2022). We chose a Bayesian implementation of the model for a variety of reasons: it allows us to retrieve quantitative information about a hypothesis of interest, to make direct comparisons between levels of an independent variable, and to fit complex random effects structures, not worrying about convergence issues (Nicenboim & Vasishth Reference Nicenboim and Vasishth2016; Franke & Roettger Reference Franke and Roettger2019). For a hypothesis that a parameter is larger, or smaller, than zero, we will conclude that there is compelling evidence for this hypothesis if the 95% credible interval (CI) for the parameter does not include zero, and/or if the posterior probability of the hypothesis is close to one (Franke & Roettger Reference Franke and Roettger2019). The model used the brms default priors, with four sampling chains of 2,000 iterations each and a warm-up period of 1,000 iterations. (The same Bayesian setting was used in the statistical model analyses conducted in the rest of the article.) Response ratings and the CV sequence type were included as dependent and independent variables, respectively. The CV sequence type with three levels, TA, PI and TI (baseline), was dummy coded. Following Barr et al. (Reference Barr, Levy, Scheepers and Tily2013), we included the maximal random effect structure justified by the data. Specifically, random intercepts were included for participant and non-word stimulus, and a random slope was included for participant. The results of the Bayesian analysis, presented in (23), show that it is likely true that the target non-words with TI are rated lower than control items with TA and PI, given that 95% CIs for the coefficients TA and PI do not include zero.Footnote 18

In order to retrieve more quantitative information, we accessed the samples of the fitted model. The sample distributions show that the posterior probabilities of the propositions that TI < TA and that TI < PI are 99.98% and 98.55%, respectively, suggesting that there is compelling evidence for the differences.

In order to have a more accurate understanding of the current ratings pattern, let us consider whether the participants’ ratings differ depending on the specific target TI sequence. Four TI sequences, /ti/, /tʰi/, /tj/ and /tʰj/, were used in the target non-words. As can be seen in Figure 3, the mean ratings for three of them, that is, /tʰi/, /tj/ and /tʰj/, are clearly low, below 2.5, and we can consider them to be ill-formed, given that the three ill-formed practice items received scores below 2.5, as shown in Figure 1.

Figure 3 Palatalisation: Mean Likert rating of target non-words with TI sequences and control non-words with tA and Pi.

The non-words with /ti/ received relatively higher mean ratings, that is, 3.3, than the rest of the target non-words, but they received relatively lower ratings than their control non-words with tA and Pi. To provide a statistical analysis of the observed differences, we fitted a Bayesian mixed effects linear regression model to the data under consideration. Response ratings and consonant–vocoid sequence type – /ti/ (reference), /tʰi/, /tj/, /tʰj/, /tA/ and /Pi/ – were included as the dependent and independent variables, respectively. The consonant–vocoid sequence type was dummy coded. Random intercepts were included for participant and non-word stimulus, and a random slope was included for participant. The results of the Bayesian analysis, presented in (24), show that it is likely true that the target non-words with /ti/ are rated higher than the rest of target non-words with /tʰi tj tʰj/, given that 95% CIs for the corresponding coefficients do not include zero. The sample distributions also show that the posterior probabilities of the propositions that non-words with /ti/ are higher in ratings than those with /tʰi tj tʰj/ are 100%, suggesting that there is compelling evidence for the differences. These differences can be understood if we consider that aspirated consonants and complex onsets are more marked than lenis consonants and simplex onsets, respectively.

Let us now compare /ti/ and its controls, /tA/ and /Pi/. As can be seen in (24), it is likely true that the target non-words with /ti/ are rated lower than those with /tA/, given that the 95% CI for the relevant coefficient, that is, tA, does not include zero. This conclusion is confirmed by the sampling distributions showing that the posterior probability of the proposition that non-words with /ti/ are lower in ratings than those with /tA/ is 100%. On the other hand, it is not clear that the target non-words with /ti/ are rated lower than those with /Pi/, given that the 95% CI for the coefficient Pi includes zero. Also, the sampling distributions show that the posterior probability of the proposition that non-words with /ti/ are lower in ratings than those with /Pi/ is 92.2%. These results suggest that there is some evidence that target items with /ti/ are lower than controls with /Pi/, but the difference is not large enough to be compelling. Thus, it seems that non-words with /ti/ were dispreferred by Korean participants, as suggested by the comparison with controls with /tA/, and that this pre-/i/ dispreference might not be confined to coronal consonants, as suggested by the comparison between targets with /ti/ and controls with /Pi/.

In summary, most of the target TI sequences received clearly low mean ratings, signalling ill-formedness. /ti/, which received the highest ratings among the TI sequences, can still be considered ill-formed, mainly based on its comparison with /tA/ and other clearly well-formed sequences which will be presented in the sections below. These results indicate that the robust alternation patterns of Korean palatalisation are phonotactically supported.

3.2.2. Vowel harmony

If vowel harmony affected the participants’ judgements, the disharmonic target non-words would be rated lower than harmonic control ones. This expectation was not met, as can be seen in Figure 4.

Figure 4 Vowel harmony: mean Likert rating in the survey of Korean non-words, by vowel sequence type (disharmonic and harmonic).

Note that the mean response scores were higher for non-words with disharmonic vowel sequences than for those with harmonic vowel sequences. Thus, there is no indication that disharmonic words were less acceptable than harmonic ones. This is confirmed by the results of a Bayesian linear mixed effects model analysis of relevant ratings data. Response ratings and the vowel sequence type were included as dependent and independent variables, respectively. The vowel sequence type with two levels, harmonic and disharmonic (baseline), was dummy coded. Random intercepts were included for participant and non-word stimulus, and a random slope was included for participant. The results of the Bayesian analysis, presented in (25), suggest that there is no vowel harmony effect, given that the coefficient harmonic is negative. This is confirmed by the sample distributions showing that the posterior probability that harmonic sequences are higher in ratings than disharmonic ones is only 6.4%.

In summary, disharmonic non-words were not lower in the participants’ ratings than harmonic ones. This leads us to conclude that Korean vowel harmony, which is limited in both generality and productivity, is not supported by phonotactics.

3.2.3. Laryngeal OCP

This section discusses the results pertaining to laryngeal OCP. The participants gave different ratings for non-words with different number of laryngeally marked consonants, as shown in Figure 5.

Figure 5 Laryngeal OCP: Likert rating in the survey of Korean non-words, by number of laryngeally marked consonants.

Note in Figure 5 that as the number of laryngeally marked consonants increases, the mean ratings decrease, and that the double-lar non-words were rated the lowest. Most of double-lar items (28/32) received scores below 2.5. It seems clear that they were judged to be ill-formed, given that, as shown in Figure 1, the three ill-formed practice items received scores below 2.5. In contrast, all eight no-lar items received scores above 3.5. In addition, single-lar items received lower mean ratings, which span a wide range of the scale, compared to no-lar items. This suggests that single-lar items may be somewhat ill-formed, possibly violating a constraint that prohibits a (single) laryngeally marked consonant. The key question is whether double-lar items with two laryngeally marked consonants exhibit a superadditive effect, one that exceeds the cumulative effect of two segmental markedness constraints or two instances of the same segmental constraint violation. Such superadditivity is expected under the influence of the Laryngeal OCP which disallows the co-occurrence of two laryngeally marked consonants. (For an in-depth discussion of superadditive effects in phonological data, see Shih Reference Shih2017.)

In order to determine if there is an effect of laryngeal OCP on participants’ judgements, we conducted a Bayesian linear mixed-effects model analysis using relevant ratings data. The dependent variable was the response ratings, while the independent variables included the phonation type (lenis vs. laryngeal) of C1 and C2, as well as their interaction. It should be noted that the interaction term, which is adopted to capture superadditivity, would be statistically significant if the laryngeal OCP is truly effective. We dummy coded the phonation type with two levels, lenis (baseline) and laryngeal. We included random intercepts for participant and non-word stimulus. All fixed factors were included as by-participant random slopes.

The results of the Bayesian analysis, presented in (26), show that the coefficients for the two fixed factors, the C1 and C2 phonation types, are negative, and their 95% CIs do not include zero. This suggests that it is likely true that the target non-words are rated lower when either C1 or C2 is laryngeally marked compared to when neither is lenis. Thus, the effects of (positional) markedness constraints prohibiting (single) laryngeally marked consonants are confirmed. However, considering that the coefficient for the interaction term is positive – which is the opposite of the prediction under the influence of the Laryngeal OCP – it seems unlikely that a laryngeal OCP effect exists in the ratings data. This conclusion is supported by the sample distributions, which show that the posterior probability of the interaction being below 0 is a mere 3.2%.

For a more accurate understanding of the current ratings data, let us consider whether the participants’ ratings differ depending on the specific phonation type of an onset consonant. As shown in Figure 6, all three different types of double-lar items with (i) tense, (ii) aspirated and (iii) tense and aspirated (henceforth tense//asp) onset sequences were lower in ratings than single-lar non-words with an aspirated onset (henceforth single-asp) which were in turn lower than no-lar items. This is consistent with the response pattern shown in Figure 5. However, single-lar items with a tense onset (henceforth single-tense, $\textrm {mean rating} = 2.29$ ) were almost as bad as double-lar items, especially items with two aspirated onsets (henceforth double-asp, $\textrm {mean rating} = 2.22$ ).

Figure 6 Laryngeal OCP: Likert rating in the survey of Korean non-words, by onset type.

In summary, the participants’ ratings patterns in Figure 6 indicate the following hierarchy: no-lar > single-asp > single-tense and double-lar.

To check whether there is a laryngeal OCP effect, we focus on the response results for double-lar items with at least one tense onset, that is, double-tense and tense//asp, which are parallel in onset combination with compounds where blocking of tensing by the laryngeal OCP may occur. If double-tense items are subject to laryngeal OCP, the reduction in ratings from single-tense to double-tense items would be larger than the reduction from no-lar to single-tense items. This predicted superadditive effect did not occur, as can be seen in (27).

If tense//asp items are subject to laryngeal OCP, the reduction from no-lar items to tense//asp items would be larger than the sum of the reductions from no-lar to single-tense items and to single-asp items. As above, the predicted superadditivity effect did not emerge, as can be seen in (28).

All the above comparisons show subadditivity, not superadditivity, which is not consistent with the prediction under the effect of laryngeal OCP. This, along with the results of the Bayesian mixed effects linear regression analysis shown in (26), indicates that the laryngeal OCP did not affect the participants’ phonotactic judgements.

However, before we conclude that there is no laryngeal OCP effect in Korean speakers’ phonotactic judgements, note in Figure 6 that the ratings assigned to single-tense items are already quite low, and the ratings of double-lar items can hardly be lowered further. Floor effects of this type might prevent us from finding the superadditive effect. Consequently, based on the results presented in this section, we do not know for sure whether the laryngeal OCP in Korean is phonotactically supported or not.

3.2.4. Summary

Mean participant ratings of the target and control items of each of the three sets of non-words are provided in (29).

In the set for palatalisation, mean ratings are lower for target items than for control items. Patterns vary depending on the type of TI sequences. Specifically, non-words with /ti/ received higher ratings than the rest of target items, but Korean participants did not judge them to be well-formed. Target non-words with /ti/ (3.3) were clearly lower in mean ratings than their control items with /tA/ (4.29), as well as no-lar items in the laryngeal OCP set (4.14) and non-words in the vowel harmony set, whether disharmonic (3.99) or harmonic (3.8), all of which can be plausibly assumed to be well-formed. Thus, it seems that Korean participants considered all different types of target non-words in the palatalisation set to be ill-formed. This leads us to conclude that Korean cross-morpheme palatalisation with high generality and productivity has strong phonotactic support.

In contrast, in the set for vowel harmony, target items with disharmonic sequences received slightly higher ratings than control items with harmonic sequences, which is the opposite of the prediction under the hypothesis that vowel harmony holds in Korean speakers’ phonotactic judgements. The two sets, disharmonic (3.99) and harmonic (3.8), were close in mean ratings to no-lar items of the laryngeal OCP set (4.14), but much higher than single-lar (2.79) and double-lar (2.02) items. It seems that the set of non-words created to test Korean vowel harmony, whether disharmonic or harmonic, were judged to be well-formed. Thus, we may conclude that Korean vowel harmony in verbal inflection with limited generality and productivity has no phonotactic support.

Finally, in the set for laryngeal OCP, the superadditive effect, expected under the effect of laryngeal OCP, did not arise. However, even single-lar control items with a tense onset were very low in the adopted acceptability scale, and thus it was almost impossible to capture the superadditive effect. Consequently, the current response data are unclear about whether the laryngeal OCP active in compound tensing with limited generality and moderate productivity is phonotactically supported or not.

4. Computational modelling

In this section, to test our research hypothesis presented in (1), we measure the lexical support for the three alternation patterns in Korean by conducting a simulation of learning a MaxEnt grammar (Goldwater & Johnson Reference Goldwater, Johnson, Spenader, Eriksson and Dahl2003; Hayes & Wilson Reference Hayes and Wilson2008). Specifically, we train the UCLA Phonotactic Learner (Hayes & Wilson Reference Hayes and Wilson2008) on a Korean lexicon. Given that observed/expected (O/E) values, used in some previous studies (Pierrehumbert Reference Pierrehumbert1993; Frisch et al. Reference Frisch, Pierrehumbert and Broe2004 and others) for examining lexical statistics, are mathematically flawed (Wilson & Obdeyn Reference Wilson and Obdeyn2009), it seems that the weights of constraints in the learned MaxEnt grammar are the only reliable indicator of over- or underrepresentation of a lexical pattern currently available. It is predicted from the hypothesis in (1) that the more robust an alternation pattern is, the more likely its associated phonotactic constraint is to be learned and the higher its weight will be.

The learned grammar with weighted constraints will assign well-formedness ratings to the non-words adopted in the acceptability-rating experiment. The resulting model ratings will be used in testing a prediction from our research hypothesis. More specifically, model ratings for target non-words violating a phonotactic constraint will be lower than those for control non-words satisfying it only when the associated alternation is robust.

In addition, the model ratings will be compared to the participants’ ratings (which will be referred to as ‘human ratings’) presented in the previous section to figure out whether and how much the participants’ phonotactic judgements are based on over- or underrepresentation of a pattern in the lexicon. It is often assumed in the phonological literature that human speakers make phonotactic judgements on the basis of their phonotactic grammar, which in turn reflects lexical frequency (Hayes & Wilson Reference Hayes and Wilson2008; Albright Reference Albright2009; Daland et al. Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011; Chong Reference Chong2017, and others). If it is true that Korean participants responded using their phonotactic grammars that are constructed based on lexical representation, the model ratings should correlate well with the participants’ ratings.

In the remainder of this section, the method of the modelling is first described, and then the results of the modelling are reported and analysed to test our main research hypothesis in (1) and its predictions.

4.1. Method

In order to construct a Seoul Korean lexicon, we first selected words listed as standard Korean in the Standard Korean dictionary (henceforth the SKD, National Institute of the Korean Language 1999; downloaded from https://stdict.korean.go.kr/mypage/download/downloadList.do on 17 September 2020). Among them, we further selected words with a frequency of at least one occurrence in the Sejong corpus. We extracted the citation-form pronunciations of the selected words from the SKD. In addition, we determined their morphological composition (simplex/complex) and etymological origin (native/Sino-Korean/loan) mainly based on the SKD. We prepared three monomorphemic word lexicons, not weighted by token frequency: (i) one with native Korean words ( $n = 5,335$ ), (ii) one with Sino-Korean words ( $n = 32,702$ ) and (iii) one with both native and Sino-Korean words ( $n = 38,037$ ).Footnote 19 Each of these was used as training data for modelling phonotactic grammars.

We conducted a simulation of learning a MaxEnt grammar by using UCLA phonotactic learner (Hayes & Wilson Reference Hayes and Wilson2008). The feature system fed to the learner was constructed mainly based on Cho (Reference Cho2012). Aspirated and tense consonants had different values for the feature [aspirated], but the two were specified as [+tense], thus forming a natural class. The feature [+spread glottis] was assigned only to /h/. Departing from Cho, we distinguished between onset and coda consonants by using the feature [rhyme], as in Daland et al. (Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011) and Park (Reference Park2020). For the vowels, we adopted a nine-vowel system, /i e a ʌ o u ɨ y ø/, not distinguishing between /e/ and /ɛ/, which are known in the literature to be no longer contrastive, at least in younger Korean speakers’ speech. Although the front rounded vowels, /y ø/, are known to be diphthongised often, it is still possible that they are pronounced as monophthongs at least in some words, and thus we decided to adopt a conservative choice by distinguishing them in the vowel inventory. As recently discussed by Jun (Reference Jun and Aronoff2018), it is quite controversial what the feature involved in Korean vowel harmony is. A variety of features, such as backness, lowness, ATR and RTR, have been proposed. Given this indeterminacy, we specified /a o/ as [+RTR] and the rest of the Korean vowel phonemes as [−RTR], but make no theoretical commitment to this identification of the harmonising feature.

The learner settings were as follows. The length of constraint (i.e., the maximum gram size), the maximum number of constraints to discover and the highest O/E threshold were set at 2, 300 and 0.3, respectively.Footnote 20 Constraints including complement natural classes (designated by the symbol \^~) were allowed to be found. Autosegmental tiers, called ‘projections’, were created and fed to the learner so that constraints for nonlocal processes like vowel harmony and laryngeal OCP could be found. Specifically, constraints can be defined on three different projections: (i) default segmental projection, that is, full representation; (ii) vowel projection, that is, the substring consisting of all and only vowels; and (iii) laryngeal projection, that is, the substring of all and only obstruent onsets. Palatalisation, vowel harmony and laryngeal OCP constraints were expected to be found from default, vowel and laryngeal projections, respectively. Finally, no prespecified constraints were fed to the learner.

4.2. Simulation results

The UCLA Phonotactic Learner creates phonotactic constraints through an inductive learning algorithm and assigns numerical weights to them according to the principle of maximum entropy (see Hayes & Wilson Reference Hayes and Wilson2008 for relevant details). If a form is underrepresented in the training data, a constraint prohibiting it will likely be learned. The more underrepresented a form is, the higher the weight of the constraint prohibiting it. To illustrate, some high-weighted constraints learned from the lexicon of native and Sino-Korean monomorphemes are shown in (30).

The two constraints in (30) reflect well-known phonological restrictions in Korean. In Korean, the velar nasal and true consonant clusters are not allowed at all in the onset.

The learner gave penalty scores to each test non-word depending on the weight of constraints violated by it and the number of violations, as defined by Hayes & Wilson (Reference Hayes and Wilson2008: 383, (4)). Note that such penalty scores (often called harmony scores) are known to match with human well-formedness intuitions (Breiss & Hayes Reference Breiss and Hayes2020: 362). For model predictions about acceptability of a non-word which will be compared to its human ratings, we used negative values of the penalty scores assigned to it which increase as acceptability increases like human ratings. Henceforth, the negative penalty scores will be referred to as ‘model ratings’. Since we trained the learner on three different lexicons, we were provided with three sets of model ratings.

Which lexicon and which set of model ratings should we rely on? We opt for the lexicon consisting of native and Sino-Korean monomorphemes, based on a plausible assumption, adopted by Chong (Reference Chong2019), that ordinary Korean speakers cannot reliably distinguish between native and Sino-Korean words. Still, since we were curious about whether and how the three lexicons differ in explaining the human ratings, we calculated the correlations (Spearman) between human ratings and each of the three sets of model ratings.Footnote 21 As can be seen in (31), the ratings given by the MaxEnt trained on both native and Sino-Korean words show the highest correlation with the human ratings although there is only a small difference between the lexicons consisting of only Sino-Korean words and both native and Sino-Korean words, which is not surprising if we consider the fact that the latter lexicon consists mostly of Sino-Korean words (85.97%, i.e., 32,702 out of 38,037).

In the remainder of this article, the set of native and Sino-Korean monomorphemes in our Seoul Korean corpus will represent the Seoul Korean lexicon, and the ratings given by a MaxEnt model trained on it will be referred to as ‘model ratings’.

As shown under ‘native and Sino-Korean’ in (31), the correlation between human and model ratings (0.73) is high. This high correlation between human and model ratings indicates that the Korean participants’ well-formedness judgements were heavily based on the frequency distribution of the Korean lexicon, confirming the assumption that speakers’ judgements are a good proxy for lexical phonotactics. In Figure 7, the human ratings for the entire test set of non-words, are plotted against the model ratings. (Human and model ratings for all test non-words are provided in the Supplementary Material.)

Figure 7 All test non-words: Correlation between human and model ratings (z-transformed; $n = 193$ ; dashed line = regression line).

The scatter plot in Figure 7 shows an overall trend that the human ratings increase as the model ratings increase, as is consistent with the high positive correlation between the human and model ratings, mentioned above.

Let us now consider how the correlations between human and model ratings differ depending on the stimuli. For each set of test stimuli, the correlation (Spearman) between observed human ratings, presented in the previous section, and the corresponding model ratings were calculated as shown in (32).

The correlation between human and model ratings is high for the non-words in the laryngeal OCP set, and moderate for the non-words in the palatalisation set. However, the correlation for non-words in the set of vowel harmony (0.20) is low. As will be shown below, the main reason for this relatively low correlation is that many of the non-words in the vowel harmony set were predicted to be rated as almost perfect, and thus the model rating values did not vary sufficiently, leaving little room for correlations with human ratings. In the following subsections, we discuss model ratings for each set of test non-words, testing the predictions from our research hypothesis in (1), and comparing them with the corresponding human ratings.

4.2.1. Palatalisation

In this section, we report and discuss the modelling results pertaining to palatalisation to test the specific predictions, shown in (33), which can be made from the hypothesis that Korean palatalisation, which is general and highly productive across morphemes, has strong lexical support.

As shown in (34), the first prediction in (33a) seems to be confirmed since a general palatalisation constraint prohibiting TI sequences was learned in the current simulation, and its weight (3.286) is relatively high.

The weight 3.286 is not as high as the weights of constraints reflecting exceptionless generalisations in Korean, for example, no velar nasal in the onset (6.605) and no true onset consonant clusters (7.682), as shown in (30). However, as will be shown below, it is much higher than the weights of any constraints with the effect of vowel harmony or laryngeal OCP.

The second prediction in (33b) is also confirmed by the model ratings in the violin plot in Figure 8.

Figure 8 Palatalisation: Model ratings of non-words, by CV sequence type. Crossbars represent mean ratings.

Note that target non-words with TI sequences are clearly lower in mean ratings than control non-words with TA and PI sequences. Recall that the observed lower ratings of non-words with TI sequences were also true for human ratings. The specific word-by-word correspondence can be seen in Figure 9, in which the human ratings for the palatalisation set are plotted against the model ratings.

Figure 9 Palatalisation: Correlation between human and model ratings ( $n = 66$ ; dashed line = regression line; filled circle = target with TI, empty triangle = control with TA, empty circle = control with PI).

Note that about half of the non-words with TI sequences, that is, target items (marked with filled circle) are located in the lower left quadrant of the plot, indicating that they are both predicted and judged to be ill-formed. Most of the control items with TA or PI sequences (marked with empty triangle and circle, respectively) are both predicted and judged to be better formed than the target items.

For a more accurate understanding of the current model ratings pattern, let us consider whether the model ratings differ depending on the specific target TI sequence. Recall that four TI sequences, /ti/, /tʰi/, /tj/ and /tʰj/, were used in the target non-words. As can be seen in Figure 10, non-words with no glide (/ti tʰi/) received higher ratings than those with a glide (/tj tʰj/). Among target non-words with no glide, those with a lenis onset (/ti/) received higher ratings than those with an aspirated onset (/tʰi/). Non-words with /ti/ were lower in mean ratings than control non-words with /tA/ and /Pi/.

Figure 10 Palatalisation: Mean model rating of target non-words with TI sequences and control non-words with tA and Pi. Smaller semitransparent points represent individual non-words, and larger points represent overall averages.

To provide a statistical analysis of the observed differences, we fitted a Bayesian linear regression model to the current model ratings data. Model ratings and consonant–vocoid sequence type – /ti/ (reference), /tʰi/, /tj/, /tʰj/, /tA/ and /Pi/ – were included as the dependent and independent variables, respectively. The consonant–vocoid sequence type was dummy coded. The results of the Bayesian analysis, presented in (35), suggest that it is likely true that the target non-words with /ti/ are rated higher than target non-words with /tj tʰj/, and that the target non-words with /ti/ were rated lower than control non-words with /tA Pi/, given that 95% CIs for the corresponding coefficients do not include zero. The sample distributions confirm all these conclusions, and, in addition, show that the posterior probability of the proposition that non-words with /ti/ are higher than those with /tʰi/ is 95.38%, suggesting that we can be 95.38% certain that items with /ti/ sequences are more well-formed than those with /tʰi/.

The most important observation here is that model ratings of target non-words with /ti/, as well as other TI, sequences are meaningfully lower, compared to control non-words with /tA Pi/, confirming the prediction in (33b). This is in part consistent with the corresponding results of an acceptability-rating experiment, presented in §3.2.1. Recall that there is compelling evidence that the participants gave lower ratings to the non-words with /ti/ relative to /tA/, not /Pi/. Thus, it seems that the lexical support for palatalisation is even stronger in the MaxEnt simulation than in speakers’ phonotactic judgements. An additional discrepancy between model and human ratings for non-words in the palatalisation set can be seen; the difference between non-words with /tʰi/ vs. /tj tʰj/ is larger in model ratings than in human ratings. This might be in part due to a floor effect in which the human ratings were already low for non-words with /tʰi/ (2.44), and so the participants could not give much lower ratings for non-words with /tj tʰj/.

In conclusion, the two predictions in (33) are confirmed by the results of the current simulation, indicating that lexical items with TI sequences in the Korean lexicon are underrepresented, and thus that general and productive cross-morphemic palatalisation in Korean has strong lexical support.

4.2.2. Vowel harmony

In this section, we report and discuss the modelling results pertaining to vowel harmony to test the specific predictions presented in (36) which can be made from the hypothesis that Korean vowel harmony with limited generality and productivity has no, or weak, lexical support.

To test the prediction in (36a), we investigated the vowel projection constraints learned in the current simulation. All vowel projection constraints learned either were trivial (such as *## ‘no word without a vowel’) or else involved front rounded vowels [y ø] or the high back unrounded vowel [ɨ], all of which have highly limited distributions in Korean. Note that none of these vowels were used in the current set of non-words. What is important here is that no vowel harmony constraints prohibiting disharmonic vowel sequences were learned, confirming the prediction in (36a). This indicates that disharmonic vowel sequences are not underrepresented in the Korean lexicon.

To test the second prediction in (36b), we investigated the distribution of model ratings of the non-words in the vowel harmony set. As can be seen in Figure 11, the mean model ratings are slightly higher for disharmonic non-words than for harmonic ones, thus confirming the prediction in (36b).Footnote 22

Figure 11 Vowel harmony: Model ratings of Korean non-words, by the type of vowel sequence (disharmonic vs. harmonic). Crossbars represent mean ratings.

Recall from §3.2.2 that higher scores for disharmonic words were also found in the human ratings. The specific word-by-word correspondence can be seen in Figure 12, in which the human ratings for the vowel harmony set are plotted against the model ratings.

Figure 12 Vowel harmony: Correlation between human and model ratings ( $n = 80$ ; dashed line = regression line; filled circle = disharmonic, empty circle = harmonic).

Note in Figure 12 that many items, either disharmonic (filled circle) or harmonic (empty circle), were located near the high end of the model rating scale. Specifically, as shown in (37), the majority of the test items in the vowel harmony set (68 out of 80 items) were predicted to be perfect (i.e., 0) or near-perfect (i.e., $-1.62$ and $-1.69$ ). The limited variation due to the dominance of such (almost) perfectly well-formed non-words can explain why the correlation between human and model ratings was relatively low (0.20) for the non-words in the vowel harmony set, presented in (32).

In conclusion, the two predictions in (36) are confirmed by the results of the current simulation, indicating that lexical items with disharmonic vowel sequences in the Korean lexicon are not underrepresented, and thus that Korean vowel harmony with limited generality and productivity has no lexical support.

4.2.3. Laryngeal OCP

In this section, we report and discuss the modelling results pertaining to laryngeal OCP to test the specific predictions presented in (38) which can be made from the hypothesis that laryngeal OCP in Korean with limited generality and productivity has no, or weak, lexical support.

To test the first prediction in (38a), we investigated the laryngeal projection constraints learned in the current simulation. No general laryngeal OCP constraint (i.e., *[+tense][+tense]) was learned. However, two of the three constraints learned on the laryngeal projection, (39a) and (39b), express more specific laryngeal OCP generalisations. In (39a) *[+tense][+aspirated], with a weight of 1.256, penalises aspirated–aspirated and tense–aspirated onset sequences, and in (39b) *[+aspirated][−aspirated,+tense], with a weight of 0.564, penalises aspirated–tense onset sequences. No constraint prohibiting tense–tense sequences was learned on the laryngeal projection. In summary, relatively low-weighted laryngeal OCP constraints were learned only for some potential target sequences, which we may consider to be consistent, at least in part, with the prediction in (38a). The set of learned laryngeal OCP constraints and their weights suggest that only some of onset sequences violating the laryngeal OCP are weakly underrepresented and the rest are not underrepresented in the Korean lexicon. This may indicate that laryngeal OCP in Korean compounding has weak lexical support.

To test the second prediction in (38b), we investigated the distribution of model ratings of the non-words in the laryngeal OCP set. The model ratings differ for non-words with different numbers of laryngeally marked consonants, as shown in Figure 13.

Figure 13 Laryngeal OCP: Model rating in the MaxEnt simulation by number of laryngeally marked consonants.

Note in Figure 13 that as the number of laryngeally marked consonants increases, the model ratings decrease, with the double-lar non-words being predicted to have the lowest ratings. Recall from §3.2.3 that the observed hierarchy, no-lar > single-lar > double-lar, was also true for human ratings. The more specific word-by-word correspondence can be seen in Figure 14, in which the human ratings for the laryngeal OCP set are plotted against the model ratings.

Figure 14 Laryngeal OCP: Correlation between human and model ratings ( $n = 75$ ; dashed line = regression line; empty circle = no-lar ( $n = 8$ ); grey circle = single-lar ( $n = 35$ ); black circle = double-lar ( $n = 32$ )).

Note that many of the double-lar items, that is, target items (marked with black circles), are located in the lower left quadrant of the plot, indicating that they are both predicted and judged to be ill-formed. Among control items, all eight no-lar items (marked with empty circles) are located at the upper right quadrant of the plot, indicating that they are both predicted and judged to be well-formed, whereas single-lar items (marked with grey circles) show an intermediate pattern between double and no-lar items, covering a relatively wide range.

In order to have a more accurate and better understanding of the ratings pattern under consideration, let us consider how the patterns differ depending on the specific phonation type of an onset consonant. From Figure 15, in which model ratings of test non-words are plotted by the onset type, we can make some observations. First, single-lar items are not as good as no-lar items, as we have seen in Figure 13. Second, single-asp items are higher in model ratings than double-asp (and all other double-lar) items. Third, single-tense items are higher in model ratings than double-tense (and tense//asp) items, but lower than double-asp items. Thus, having one tense onset is already quite bad in the simulated grammar, as is also the case in the human judgements presented in §3.2.3. As we can see by comparing Figure 6 and Figure 15, one notable difference between human and model ratings is that the differences between single-tense vs. double-tense and tense//asp items are smaller in human ratings than in model ratings, which can be attributed to a floor effect.

Figure 15 Laryngeal OCP: Model rating in the MaxEnt simulation, by the onset type.

As we have done in §3.2.3, let us now check whether there is a superadditive effect, focusing on the modelling results for double-lar items with at least one tense onset, that is, double-tense and tense//asp. If double-tense items are subject to laryngeal OCP, the reduction in ratings from single-tense to double-tense items would be larger than the reduction from no-lar to single-tense items. This predicted superadditivity effect did not occur, as can be seen in (40).

If tense//asp items are subject to laryngeal OCP, the reduction in ratings from no-lar items to tense//asp items would be larger than the sum of the reductions from no-lar to single-tense items and from no-lar to single-asp items. This predicted superadditivity emerged, as can be seen in (41).

Thus, only non-words with tense//asp, not double-tense, sequences show a superadditive effect in mean model ratings, which is consistent with the laryngeal OCP constraints in (39a) and (39b), learned in the MaxEnt simulation. However, the evidence for a superadditive effect in (41) is not compelling, according to the results of a Bayesian linear regression model analysis (not shown), in which model ratings and the onset consonant type were included as the dependent and independent variables, respectively. The sample distributions show that the posterior probability of the proposition that Δ(no-lar, tense//asp) > Δ(no-lar, single-tense) + Δ(no-lar, single-asp) is only 90% (where Δ(A, B) represents the difference between A and B). Consequently, model ratings data indicate that the laryngeal OCP is not active in the Korean lexicon.

In summary, the results of the MaxEnt simulation indicate that only some subsets of potential target sequences of laryngeal OCP are possibly slightly underrepresented in the Korean lexicon, since co-occurrence constraints were learned only for those subsets (and with relatively low weights), not for the rest of the target sequences, and model ratings for double-lar non-words show no compelling evidence for superadditivity, which is predicted to occur under the hypothesis that laryngeal OCP is active in the Korean lexicon. These results lead us to conclude that the laryngeal OCP in Korean has no, or possibly weak, lexical support.

4.2.4. Summary

In this section, we have reported and discussed the results of the MaxEnt simulation, investigating whether target sequences are underrepresented in the Korean lexicon. The investigation was performed in two parts.

First, we investigated the learned constraints and their weights, summarised in (42).

Note that a general constraint was learned only for palatalisation, and its weight is clearly higher than the weights of the constraints learned for other effects. For vowel harmony, no constraint, general or specific, was learned. For laryngeal OCP, constraints were learned, but they prohibit only some subsets of the target double-lar sequences. These modelling results indicate that TI sequences are underrepresented; disharmonic vowel sequences are not underrepresented; and some, but not all, double-lar sequences are underrepresented, and those only weakly so.

Second, we investigated the patterns of well-formedness ratings assigned to the test non-words by the learned MaxEnt model. To test the effects of palatalisation and vowel harmony, we compared the model ratings between target and control items. In contrast, to investigate the effect of laryngeal OCP, we focussed on whether there is a superadditive effect in the model ratings of double-lar items. The relevant model ratings along with the corresponding human ratings are summarised in (43).

In the set for palatalisation, mean model ratings are clearly lower for target items than for control items. Model ratings vary depending on the type of TI sequences. Non-words with /ti/ are higher in model ratings than the rest of target items, as in human ratings, but they are still lower than the corresponding control items. Thus, all different types of target non-words in the palatalisation set received low model ratings, indicating that palatalisation affected the model’s ratings for non-words. In contrast, in the set for vowel harmony, target items with disharmonic sequences (−0.25) are slightly higher in model ratings than control items with harmonic sequences (−0.71), as in human ratings. This indicates that vowel harmony did not affect the model’s ratings. Finally, in the set for laryngeal OCP, we found that there is no compelling evidence for a superadditive effect, indicating that the laryngeal OCP did not affect the model’s ratings.

As summarised in (44), the above two sets of investigation together indicate that target sequences of palatalisation are underrepresented in the Korean lexicon, but those of vowel harmony are not. In addition, only some subsets of target sequences of laryngeal OCP may be underrepresented. These indications lead us to conclude that Korean palatalisation has strong lexical support whereas vowel harmony and laryngeal OCP has no, or weak, lexical support. Recall in §3.2.4 that the same conclusions were made from the results of the experiment, except for unclear lexical support for laryngeal OCP.

Given that Korean palatalisation applying across morphemes is high in robustness whereas vowel harmony in verbal inflection and laryngeal OCP in compound tensing are low in robustness, the results of the current research summarised in (44) generally support the hypothesis in (1).

5. Discussion

The results of the current experiment and computational modelling, presented in the previous section, show that the robustness of an alternation correlates well with the strength of the corresponding phonotactics, supporting the claim that there is a tight link between alternations and phonotactics. We believe that human speakers’ mental grammar is equipped with a mechanism responsible for this link. These results and indications are consistent with the following previous studies and theories that assume a universal link between alternations and phonotactics: (i) previous studies (Hayes Reference Hayes, Kager, Pater and Zonneveld2004; Prince & Tesar Reference Prince, Tesar, Kager, Pater and Zonneveld2004; Jarosz Reference Jarosz2006, Reference Jarosz, Danis, Mesh and Sung2011; Pater & Tessier Reference Pater, Tessier, Slabakova, Montrul and Prévost2006; Chong Reference Chong2017, Reference Chong2021) showing that alternations with phonotactic support are easier to learn than those with no phonotactic support; (ii) studies on derived environment effects (Chong Reference Chong2017) arguing that true derived environment patterns with a perfect mismatch between phonotactics and alternations are elusive; (iii) phonological theories assuming a unified mechanism for alternations and phonotactics, such as Optimality Theory (Prince & Smolensky [Reference Prince and Smolensky1993] 2004); and (iv) modelling studies showing that a generalisation typically leaks from a tautomorphemic domain into a heteromorphemic domain (Martin Reference Martin2011). On the other hand, the current research results and findings are inconsistent with previous studies and theories that argue for no universal link between alternation and phonotactics, including early generative phonology (Chomsky & Halle Reference Chomsky and Halle1968) and some more recent studies (e.g., Paster Reference Paster2013; Do & Yeung Reference Do and Yeung2021).

This section explores alternative interpretations and analyses of the results from the current experiment and modelling. First, we will address whether the participants’ phonotactic judgements for a non-word vary based on the non-word’s lexical category. We will then consider the possibility that the lexical support originates from a sub-lexicon or stratum. Finally, we will discuss alternative explanations that attribute the results of the current study to diachronic changes or phonetic naturalness.

5.1. Lexical category of non-words

We conducted an acceptability rating experiment on non-words to measure lexical support for an alternation pattern. One might argue that participants’ ratings for a non-word may differ depending on its lexical category. In the experiment, the instruction to the participants included no indication of which lexical category the non-words belong to. Nonetheless, given that isolation forms of nouns, not verbs, in Korean are possible and frequent in natural speech, and nouns form a majority of the Korean lexicon, it is likely (or at least possible), as pointed out by an anonymous reviewer, that the participants viewed the non-words, presented in isolation, as nouns. Note that nouns are predominant in our Korean corpus: among native and Sino-Korean monomorphemes ( $n = 38,037$ ), nouns 92.49% ( $n = 35,179$ ), verbs and adjectives 3.54% ( $n = 1,345$ ) and other categories 3.98% ( $n = 1,513$ ). Then, it is possible that the current experiment measured the support from the noun lexicon, not from the entire Korean lexicon.

We have two responses to this argument. First, given that, as mentioned above, nouns form a majority of words in the vocabulary of Korean, phonotactic regularities of nouns are not significantly different from those of the Korean vocabulary as a whole. Accordingly, the Korean participants’ phonotactic judgements for non-words would not be significantly different, regardless of whether they consider the non-words to be nouns or any Korean word.

In addition, more importantly, we do not think that the participants’ phonotactic judgements for a non-word vary greatly depending on the assumed lexical category of the non-word. It seems that speakers’ phonotactic judgements for a non-word are mainly based on the phonotactic probability of its component sequences in the entire lexicon, regardless of which lexical category the non-word belongs to.Footnote 23 Previous modelling studies on English phonotactics (Hayes & Wilson Reference Hayes and Wilson2008; Albright Reference Albright2009; Daland et al. Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011) show that there are high correlations between speakers’ phonotactic judgements for non-words and the corresponding predictions of the phonotactic models trained on the English lexicon, indicating that English speakers’ phonotactic judgements are mainly determined by phonotactic probability in the entire English lexicon, rather than being affected by factors such as the assumed lexical category and the semantic meaning of the non-words. The speakers’ judgement data used in those modelling studies were from experiments in which non-words were presented in isolation (Daland et al. Reference Daland, Hayes, White, Garellek, Davis and Norrmann2011), as verbs (Albright & Hayes Reference Albright and Hayes2003), and as both verbs and nouns (Albright Reference Albright2009: 28, fn. 17). In particular, Albright compared ratings for the same set of non-word items by two different groups of native English-speaking subjects. For one group of subjects, non-word items were presented as verbs, and for the other group, they were presented as both verbs and nouns. The correlation between the mean ratings from the two groups is quite high, 0.853, which we think indicates that the speakers’ phonotactic well-formedness judgements are not significantly affected by the assumed lexical category of the non-word.

In summary, we believe that the results of the present experiment reflect the phonotactic well-formedness in the Korean lexicon, not the noun sub-lexicon, although the noun sub-lexicon is not substantially different in phonotactic regularities from the entire Korean lexicon, and thus Korean speakers’ well-formedness judgements would be more or less same for nouns and typical Korean words.

5.2. Support from a sub-lexicon or stratum

Based on the results of the experiment and modelling, we have concluded that Korean vowel harmony has no lexical support. However, one might argue, as suggested by an anonymous reviewer, that the current experiment and modelling are not appropriate for measuring the lexical support for Korean vowel harmony. This argument is based on the idea that since vowel harmony alternation in Korean is attested only in verbal inflection, the lexical support for it can come from a sub-lexicon consisting of verbs and adjectives.

We think that this potential problem is part of a more general issue, that is, whether the lexical support for an alternation can come from a sub-lexicon or stratum characterised by lexical category or etymological origin of words in which the alternation is attested. We have already rejected a possibility, in §4.2, in which the lexical support comes from a lexical stratum consisting of either native or Sino-Korean words, under the assumption that ordinary Korean speakers cannot reliably distinguish between native and Sino-Korean words. Let us now consider the possibility in which the lexical support for an alternation pattern can come from words of a particular lexical category in which the alternation is attested. More specifically, should we investigate phonotactics of the verbal lexicon to measure the lexical support for Korean vowel harmony? Recall that the present study tests the research hypothesis that alternations with strong lexical support are more robust than those with no, or weak, lexical support. In comparing the robustness of a pattern and its lexical support, the generality of an alternation (which defines its robustness) needs to be measured by checking all its potential target environments across different lexical categories, and, similarly, lexical support must be measured by checking alternation-provoking sequences in the entire lexicon, not in a sub-lexicon consisting of words belonging to a particular lexical category. Accordingly, measuring the phonotactic support for Korean vowel harmony from the verb lexicon is not directly relevant to our goal, that is, testing the hypothesis in (1).

Nonetheless, for the sake of curiosity, we decided to investigate whether Korean vowel harmony is supported by phonotactics of verbs and adjectives. Following the suggestion by the anonymous reviewer, we first address this issue by investigating the participants’ responses for non-words ending in /a/ or /ʌ/ which are similar in segmental composition to the frequently used verb forms inflected with the decl.ind (etc.) suffix /-A/. We then perform a simulation of learning a MaxEnt grammar using a verbal lexicon consisting of monomorphemic verbs and adjectives.

Twenty-five non-words in the vowel harmony set end in /a/ or /ʌ/. The participants ratings for them show that disharmonic non-words (3.80, $n = 12$ ) are slightly higher in mean ratings than harmonic ones (3.60, $n = 13$ ). The lack of vowel harmony effect is confirmed by the results of a Bayesian mixed effects linear regression model, not shown, in which the posterior probability of the proposition that disharmonic non-words are lower in ratings than harmonic ones is only 21.03%. These results are not significantly different from the results of the analysis for the entire vowel harmony set of non-words, reported in §3.2.2.

In the MaxEnt simulation using a verbal lexicon consisting of monomorphemic verbs ( $n = 1057$ ) and adjectives ( $n = 288$ ), all of which are native Korean, most vowel projection constraints learned were either trivial (such as *## ‘no word without a vowel’) or those involving vowels with highly limited distribution, [y ø ɨ], as in the results of the modelling based on the entire lexicon, presented in §4.2.2. Only three of the learned constraints with relatively low weights (0.643, 0.692, 1.658) seem to have vowel harmony effects. They require a dark vowel [u] to co-occur with some other dark vowels, but not with a light vowel [o]. Note that these constraints prohibit only one, that is, [uo], out of sixteen target vowel sequences tested in this study, shown in (21). This indicates that most of target vowel sequences are not underrepresented in the verbal lexicon, which is not significantly different from the results of the simulation using the entire lexicon, reported in §4.2.2.

In summary, the results of the present study show that Korean vowel harmony is not phonotactically supported, regardless of whether we limit ourselves to verbs or not.

5.3. A diachronic approach

The results of the current research, summarised in (44), show a tight link between phonotactics and alternations. These findings lead us to believe that the grammar itself and/or its learning algorithm includes a mechanism that integrates alternations and phonotactics. While we will not delve into the grammatical implementation of the observed alternation-phonotactics link in this study, it is notable that there are several relevant approaches from previous work that we may consider. First, in Standard Optimality Theory (Prince & Smolensky [Reference Prince and Smolensky1993] 2004), the same set of constraints is employed to explain both patterns of alternations and phonotactics, and the mechanism for alternations is in principle inseparable from that for phonotactics. In addition, Martin (Reference Martin2011) provides an analysis of ‘grammatical leakage’ (Martin’s term), in which a generalisation that holds for the tautomorphemic domain leaks into the heteromorphemic domain. Specifically, Martin uses a MaxEnt learning algorithm with a Gaussian prior to explain data from English and Navajo in which categorical phonotactic constraints holding in the monomorphemic lexicon are also active, though gradiently, in the heteromorphemic domain. Martin’s approach appears to be more promising for the data we discussed in this study, given that the effects of the relevant constraints are mostly gradient. Their specific formal analysis is beyond the scope of this article, and we leave it for future research.

One might attribute the correlations between alternations and phonotactics attested across languages, including those observed in this study, to the diachronic development of relevant segment sequences. For instance, as pointed out by an anonymous reviewer, Korean palatalisation used to be exceptionless both within and across morphemes in the early nineteenth century (Cho Reference Cho, Hanson and Inkelas2009, §19.6). Morpheme-internal exceptions to palatalisation then arose, which were mainly due to a subsequent historical sound change in which an underlying /ɨ̯i/ diphthong became a monophthong /i/ (e.g., mati < matɨi ‘joint’) or to foreign loans (e.g., tʰipi ‘TV’). If these historical changes introduced a relatively small number of exceptions to the Korean lexicon, without affecting Korean palatalisation across morphemes, our finding – that the palatalisation constraint, that is, *TI, is active both within and across morphemes in Contemporary Korean – would be a consequence of the historical changes involved. Similarly, the lack or weakening of vowel harmony and laryngeal OCP effects in both alternations and phonotactics might also be attributed to some relevant historical changes.

We consider the diachronic account to be at best tentative, and possibly implausible, for the following reasons. First, only after we are informed of the exact range and magnitude of the influence exerted by the historical changes will we be able to evaluate the success of the diachronic account. Thus, we cannot conclude that our findings can be explained solely based on historical changes until we know, rather than speculate, whether and exactly how much the historical changes affected alternating forms and lexical statistics. Second, as shown by Martin (Reference Martin2011), it is possible that the extent and impact of the historical changes may be subject to restrictions in the grammar and/or its learning algorithm. For instance, it is possible that the creation and acceptance of exceptional new words may be stochastically blocked by grammatical restrictions. In such cases, the effects of the historical changes can be considered grammatical. Finally, and most importantly, we believe that the diachronic account is too unconstrained to explain the relationship between alternations and phonotactics. According to the diachronic approach (e.g., Paster Reference Paster2013), phonotactic and alternation patterns may change freely and independently of each other. Therefore, the diachronic approach predicts that cases like true derived environment patterns with a perfect mismatch between alternations and phonotactics can be attested without any restriction. This prediction is not supported by the results of the present study. None of the three patterns tested in this study, which are known to exhibit ‘derived environment’ effects, show a perfect mismatch between alternations and phonotactics. Instead, our research results align with previous studies (Hayes Reference Hayes, Kager, Pater and Zonneveld2004; Prince & Tesar Reference Prince, Tesar, Kager, Pater and Zonneveld2004; Jarosz Reference Jarosz2006, Reference Jarosz, Danis, Mesh and Sung2011; Pater & Tessier Reference Pater, Tessier, Slabakova, Montrul and Prévost2006; Chong Reference Chong2017, Reference Chong2021) that argue that alternations with phonotactic support are easier to learn and predict that alternations with mismatching phonotactics are rare or dispreferred. Thus, it appears that the diachronic approach is not well-supported.

5.4. Phonetic naturalness

As suggested by an anonymous reviewer, the observed correlations between alternations and phonotactics in this study could be attributed to phonetic naturalness, which may shape speakers’ phonotactics. For instance, the high robustness of, and high phonotactic support for, palatalisation could both be explained by its phonetic naturalness. Similarly, the low robustness of, and low phonotactic support for, vowel harmony could also be elucidated by the absence of such naturalness in vowel harmony of the type attested in Korean, which is typologically uncommon. One possible way to test this, a potential future direction for this study, would be to compare phonetically similar patterns in two different languages that differ in the robustness of their derived environment patterns.

6. Conclusion

In this study, to test the hypothesis in (1), repeated below, we conducted an acceptability rating experiment for Korean non-words and performed computational modelling of Korean phonotactic grammar.

From the results of the experiment and modelling, we made the following findings:

First, sequences that violate the palatalisation constraint were judged to be ill-formed by the participants and are underrepresented in the lexicon.

Second, sequences that violate the vowel harmony constraint were not judged to be ill-formed by the participants and are not underrepresented in the lexicon.

Third, sequences that violate the laryngeal OCP are either not or weakly underrepresented in the lexicon.

These findings suggest that Korean palatalisation, which is highly robust, has strong lexical support, whereas vowel harmony and laryngeal OCP, both of which are not robust, have little or weak lexical support, confirming the hypothesis in (1). Consequently, none of the three derived environment effect phenomena in Korean shows a mismatch between alternations and phonotactics. We can conclude that the results of the present study support the claim of a tight link between alternations and phonotactics. This suggests the presence of a grammatical mechanism and/or a learning bias responsible for integrating alternations with phonotactics.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/S0952675725000016.

Acknowledgements

We would like to thank the associate editor and three anonymous reviewers for their helpful and detailed comments. Our thanks also extend to Bruce Hayes, Donca Steriade, Canaan Breiss, James Whang, Haruo Kubozono, Mira Oh, Soohyun Kwon, Nayoung Park, Hoju Cha, and the audiences at Phonology Festa 16, Japan (online, 8–9 March 2021), and at the 2022 Annual Meeting on Phonology, UCLA (21–23 October 2022). We are particularly indebted to Daniel Plesniak, whose comments were invaluable in improving this article. Any remaining errors are, of course, our own responsibility. Lastly, we extend our thanks to our experimental subjects for their time and participation.

Competing interests

The authors declare no competing interests.

Footnotes

1 Syllable-final coda obstruents in Korean are always unreleased, though we do not mark this explicitly in phonetic transcriptions elsewhere in this article.

2 To investigate the stem-final variation patterns of Korean nouns and loanwords, Jun & Lee (Reference Jun and Lee2007) performed production and acceptability ratings experiments on ten speakers of Kyungsang Korean (a variety spoken in the southeastern province of South Korea). The results of the production experiment for loanwords ending in a coronal stop occurring before a nominative case marker -i show that the participants produced mostly [s]-final (and, less frequently, [cʰ]-final) forms, and [tʰ]- or [t]-final forms were rarely produced: [s] (54), [cʰ] (5), [tʰ] (2), and [t] (1) (where the frequency of each attested stem-final consonant is shown in parentheses). The results of the acceptability-ratings experiment show that [cʰ]- and [s]-final forms received higher mean ratings than [tʰ]- and [t]-final forms: [mɛcʰ-i] (3.9) > [mɛs-i] (3.3) > [mɛtʰ-i] (2.9), [mɛt-i] (2.2); [kʰʌs-i] (4.5) > [kʰʌcʰ-i] (3.5) > [kʰʌtʰ-i] (2.7), [kʰʌt-i] (1.7) (where mean ratings shown in parentheses are based on a Likert scale of 1 (impossible) to 5 (perfect)). We, as native speakers of Korean, think that such a dominant use of palatalised and [s]-final forms for loanwords ending in a coronal stop is generally true in other dialects of Korean, including Seoul Korean.

3 The phonetic transcriptions in (5) reflect two automatic phonological processes in Korean: obstruent nasalisation (i.e., an obstruent becomes nasal before a nasal) and coda neutralisation (i.e., coda obstruents are neutralised into homorganic unreleased lenis stops). Coda neutralisation followed by resyllabification applies across prefix–stem and stem–stem boundaries.

4 According to the results of Jun’s (Reference Jun2021) survey on Seoul Korean n-insertion, only a minority of participants chose forms with TI sequences, as opposed to n-inserted forms, as their preferred pronunciations: 2, 3, 7 and 4 out of 22 Seoul Korean participants chose [pak’atjaŋpan], [notjokaŋ], [sutjaŋ] and [cʰʌtjʌlɨm], respectively.

5 Kang (Reference Kang2012: 3) reports that sixteen such alternating suffixes are attested in the National Institute of the Korean Language (NIKL) frequency data.

6 There exist nominal suffixes that begin with /ʌ/ or /a/, but they do not show vowel harmony alternations. For instance, a vocative case marker /-a/ is always realised as [a], regardless of whether the last stem vowel is dark or light: for instance, /kihoŋ-a/ [kihoŋa] ‘(name)-voc’ and /kihun-a/ [kihuna] ‘(name)-voc’.

7 The phonetic transcriptions presented in this sentence reflect post-obstruent tensing (i.e., a lenis obstruent becomes tense after an obstruent).

8 Kang’s experiment included 30 native Korean speakers (19 Seoul, 4 Chungcheong, 6 Kyungsang and 1 Jeolla) as participants.

9 In a scatter plot in Kang’s Figure 20 in which a point represents the percentage of disharmonic forms for each participant, there are only 29 points for 30 participants. Our supposition is that two participants’ points completely overlap, but if so, we do not know which two points those were. This is why we say that ‘at least’ nine, rather than exactly nine, participants produced only disharmonic forms for novel Korean stems with a light vowel /a o/ as the last stem vowel.

10 We are not convinced that the observed difference is statistically significant, since it seems that no random effects for item or subject were included in the statistical analysis.

11 Tensing rates presented in (18a.i) are for lenis onsets in the columns for existing Seoul Korean and novel Yanbian Korean compounds, whereas the rates are for lenis and sonorant onsets for existing Yanbian Korean compounds. Ito (Reference Ito2014: 367, Table XII) does not provide separate values for lenis and sonorant onsets.

12 See Park (Reference Park2020) for a recent review of Korean phonotactic restrictions found in previous studies.

13 Note that disyllabic words are the shortest ones with which we can examine vowel harmony and laryngeal OCP. It seems that disyllabic words are short enough for Korean participants to consider them as monomorphemic, given that disyllabic words in our Seoul Korean corpus (which consist of native and Sino-Korean words) are predominantly monomorphemes (91%, i.e., 33,954 out of 37,364).

14 This did not include stimuli designed for other purposes but considered in the analysis as part of the same set. Therefore, we did not perfectly avoid the consecutive presentation of stimuli belonging to the same set.

15 Loanwords show some prominent phonotactic differences from native and Sino-Korean words – for example, word-initial liquids are allowed only in loanwords.

16 The word-final [cɨ], the word-initial [l] and the sequence [t’j] do not occur in (at least the surface forms of) native and Sino-Korean words.

17 Responses to six fillers were excluded from the analysis reported in this section.

18 The abbreviations used are as follows: Est. error = estimation error, and l/u-95% CI = lower/upper bound of the 95% CI.

19 Note that previous modelling studies on phonotactics provided more reliable predictions based on type than token frequencies (Hayes & Wilson Reference Hayes and Wilson2008: 395; Albright Reference Albright2009: 20; and the references therein).

20 The O/E is the ratio of the number of observed violations of a constraint, to the expected number of violations. Low O/E values indicate powerful, effective constraints. For details, see Capodieci et al. (Reference Capodieci, Hayes and Wilson2009, §5.4) and Hayes & Wilson (Reference Hayes and Wilson2008, §4.2).

21 We used nonparametric Spearman correlations because many of the data points were concentrated at the ends of the scale.

22 We performed a Bayesian linear regression analysis for the relevant model ratings data. The sample distributions show that the posterior probability of the proposition that disharmonic non-words are lower in model ratings than harmonic non-words is only 1.75%.

23 Note that this does not rule out the possibility that speakers may have and use knowledge of phonotactic differences between lexical categories, such as nouns and verbs, as demonstrated in Smith’s (Reference Smith2016) nonce-word experiments, where English speakers were asked to identify whether non-words sound more like nouns or verbs.

References

Albright, Adam (2009). Feature-based generalization as a source of gradient acceptability. Phonology 26, 941.CrossRefGoogle Scholar
Albright, Adam & Hayes, Bruce (2003). Rules vs. analogy in English past tenses: a computational/experimental study. Cognition 90, 119161.CrossRefGoogle ScholarPubMed
Barr, Dale J., Levy, Roger, Scheepers, Christoph & Tily, Harry J. (2013). Random effects structure for confirmatory hypothesis testing: keep it maximal. Journal of Memory and Language 68, 255278.CrossRefGoogle ScholarPubMed
Becker, Michael, Ketrez, Nihan & Nevins, Andrew (2011). The surfeit of the stimulus: analytic biases filter lexical statistics in Turkish laryngeal alternations. Lg 87, 84125.Google Scholar
Boersma, Paul & Weenink, David (2020). Praat: doing phonetics by computer. Computer program. Available at http://praat.org.Google Scholar
Breiss, Canaan & Hayes, Bruce (2020). Phonological markedness effects in sentence formation. Lg 96, 338370.Google Scholar
Bürkner, Paul-Christian (2018). Advanced Bayesian multilevel modeling with the R package brms. The R Journal 10, 395411.CrossRefGoogle Scholar
Capodieci, Frank, Hayes, Bruce & Wilson, Colin (2009). Manual: phonotactic learning program. Ms., University of California, Los Angeles.Google Scholar
Cho, Hyesun (2012). Statistical learning of Korean phonotactics. Studies in Phonetics, Phonology and Morphology 18, 339370.Google Scholar
Cho, Young-mee Yu (2009). A historical perspective on nonderived environment blocking: the case of Korean palatalization. In Hanson, Kristin & Inkelas, Sharon (eds.) The nature of the word: essays in honor of Paul Kiparsky. Cambridge, MA: MIT Press, 461486.Google Scholar
Cho, Young-mee Yu (2016). Korean phonetics and phonology. In Aronoff, Mark (ed.) Oxford research encyclopedia of linguistics. Oxford: Oxford University Press. Available at https://doi.org/10.1093/acrefore/9780199384655.013.176.Google Scholar
Choi, Myungok (1985). Pyenchik tongsauy umwunhyensangey tayhaye–p-, s-, t-pyenchiktongsalul cwungsimulo [An explanation of the phonological phenomena of irregular verbs in Korean]. Kwukehak [Korean Linguistics] 14, 149188.Google Scholar
Chomsky, Noam & Halle, Morris (1968). The sound pattern of English. New York: Harper & Row.Google Scholar
Chong, Adam J. (2017). On the relation between phonotactic learning and alternation learning. PhD dissertation, University of California, Los Angeles.Google Scholar
Chong, Adam J. (2019). The exceptionality and derived environment effects: a comparison of Korean and Turkish. Phonology 36, 543572.CrossRefGoogle Scholar
Chong, Adam J. (2021). The effect of phonotactics on alternation learning. Lg 97, 213244.Google Scholar
Daland, Robert, Hayes, Bruce, White, James, Garellek, Marc, Davis, Andrea & Norrmann, Ingrid (2011). Explaining sonority projection effects. Phonology 28, 197234.CrossRefGoogle Scholar
Do, Youngah & Yeung, Ping Hei (2021). Evidence against a link between learning phonotactics and learning phonological alternations. Linguistics Vanguard 7, article no. 20200127.CrossRefGoogle Scholar
Franke, Michael & Roettger, Timo B. (2019). Bayesian regression modeling (for factorial designs): a tutorial. Ms., University of Tübingen and University of Oslo. Available online at https://doi.org/10.31234/osf.io/cdxv3.CrossRefGoogle Scholar
Frisch, Stefan, Pierrehumbert, Janet & Broe, Michael (2004). Similarity avoidance and the OCP. NLLT 22, 179228.Google Scholar
Goldwater, Sharon & Johnson, Mark (2003). Learning OT constraint rankings using a maximum entropy model. In Spenader, Jennifer, Eriksson, Anders & Dahl, Östen (eds.) Proceedings of the Stockholm Workshop on Variation Within Optimality Theory. Stockholm: Department of Linguistics, Stockholm University, 111120.Google Scholar
Hayes, Bruce (2004). Phonological acquisition in Optimality Theory: the early stages. In Kager, R., Pater, J. & Zonneveld, W. (eds.) Constraints in phonological acquisition. Cambridge: Cambridge University Press, 158203.CrossRefGoogle Scholar
Hayes, Bruce & White, James (2013). Phonological naturalness and phonotactic learning. LI 44, 4575.Google Scholar
Hayes, Bruce & Wilson, Colin (2008). A maximum entropy model of phonotactics and phonotactic learning. LI 39, 379440.Google Scholar
Hayes, Bruce, Zuraw, Kie, Siptar, Peter & Londe, Zsuzsa (2009). Natural and unnatural constraints in Hungarian vowel harmony. Lg 85, 822863.Google Scholar
Ito, Chiyuki (2014). Compound tensification and laryngeal co-occurrence restrictions in Yanbian Korean. Phonology 31, 349398.CrossRefGoogle Scholar
Jang, Hayeun (2017). Decay of ungrounded rules: vowel harmony of Middle Korean in Contemporary Korean. In Erlewine, Michael Yoshitaka (ed.) Proceedings of GLOW in Asia XI, volume 2. Cambridge, MA: MIT Working Papers in Linguistics, 87101.Google Scholar
Jarosz, Gaja (2006). Rich lexicons and restrictive grammars: maximum likelihood learning in Optimality Theory. PhD dissertation, Johns Hopkins University.Google Scholar
Jarosz, Gaja (2011). The roles of phonotactics and frequency in the learning of alternations. In Danis, Nick, Mesh, Kate & Sung, Hyunsuk (eds.) Proceedings of the 35th annual Boston University Conference on Language Development (BUCLD), volume 2. Somerville, MA: Cascadilla Press, 321333.Google Scholar
Jo, Jinyoung (2023). Korean vowel harmony has weak phonotactic support and has limited productivity. Phonology 40, 65100.CrossRefGoogle Scholar
Jun, Jongho (2010). Stem-final obstruent variation in Korean. Journal of East Asian Linguistics 19, 137179.CrossRefGoogle Scholar
Jun, Jongho (2018). Morpho-phonological processes in Korean. In Aronoff, Mark (ed.) Oxford research encyclopedia of linguistics. Oxford: Oxford University Press. Available at https://doi.org/10.1093/acrefore/9780199384655.013.241.Google Scholar
Jun, Jongho (2021). Morphophonological gradience in Korean n-insertion. Glossa 6, Article no. 40.Google Scholar
Jun, Jongho & Lee, Jeehyun (2007). Multiple stem-final variants in Korean native nouns and loanwords. Eoneohag [Linguistics] 47, 159187.Google Scholar
Kang, Hijo (2012). Diachrony in synchrony: Korean vowel harmony in verbal conjugation. PhD dissertation, Stony Brook University.Google Scholar
Kang, Hijo (2016). Hankuke yongenhwalyongeyse poinun moumcohwauy pyeniwa pyenhwa: hyencangsiwa silceysi yenkwu [Variation and change of Korean vowel harmony in verbal conjugation: an apparent time and real time study]. Eoneohag [Linguistics] 76, 2756.Google Scholar
Kang, Hijo & Mira, Oh (2016). Dynamic and static aspects of laryngeal co-occurrence restrictions in Korean. Studies in Phonetics, Phonology and Morphology 22, 334.Google Scholar
Kang, Hijo & Mira, Oh (2019). The asymmetric tense consonant effects n compound and word-initial tensifications in Korean. Studies in Phonetics, Phonology and Morphology 25, 330.Google Scholar
Kang, Yoonjung (2006). Neutralizations and variations in Korean verbal paradigms. Harvard Studies in Korean Linguistics 11, 183196.Google Scholar
Kim, Moosik (2007). 16-7 seyki kwuke hancaeuy picwungkwa ku thukcing: swunchenkimssi mich hyenphwungkwakssi hankulphyencilul taysangulo [A study on the proportion of loanword from Chinese, and its traits which was expressed at Korean of 16th century]. Eomunlonchong [Korean Language and Literature] 47, 251276.Google Scholar
Kim, Seoyoung (2016). Phonological trends in Seoul Korean compound tensification. Master’s thesis, Seoul National University, Korea.Google Scholar
Kiparsky, Paul (1973). Phonological representations: abstractness, opacity and global rules. In Fujimura, Osamu (ed.) Three dimensions in linguistic theory. Tokyo: TEC, 1136.Google Scholar
Lee, Hoyoung (1996). Kwukeumsenghak [Korean phonetics]. Seoul: Thaehaksa.Google Scholar
Martin, Andrew (2011). Grammars leak: modeling how phonotactic generalizations interact within the grammar. Lg 87, 751770.Google Scholar
National Institute of the Korean Language (1999). Phyocwun kwuke taysacen [Standard Korean dictionary]. Seoul: Doosan Dong-A.Google Scholar
Nicenboim, Bruno & Vasishth, Shravan (2016). Statistical methods for linguistic research: foundational ideas – part II. Language and Linguistics Compass 10, 591613.CrossRefGoogle Scholar
Park, Nayoung (2020). Hankwuke umsopayyelceyyakuy thongkyeycek haksupkwa cekhyengseng phantan [Stochastic learning and well-formedness judgement in Korean phonotactics]. PhD dissertation, Seoul National University.Google Scholar
Paster, Mary (2013). Rethinking the ‘duplication problem’. Lingua 126, 7891.CrossRefGoogle Scholar
Pater, Joe & Tessier, Anne-Michelle (2006). L1 phonotactic knowledge and the L2 acquisition of alternations. In Slabakova, Roumyana, Montrul, Silvina & Prévost, Philippe (eds.) Inquiries in linguistic development: in honor of Lydia White. Amsterdam: John Benjamins, 115131.CrossRefGoogle Scholar
Pierrehumbert, Janet (1993). Dissimilarity in the Arabic verbal roots. NELS 23, 367381.Google Scholar
Prince, Alan & Smolensky, Paul ([1993] 2004). Optimality Theory: constraint interaction in generative grammar. Oxford: Blackwell. Originally published in 1993 as technical report no. 2 of the Rutgers Center for Cognitive Science.Google Scholar
Prince, Alan & Tesar, Bruce (2004). Learning phonotactic distributions. In Kager, René, Pater, Joe & Zonneveld, Wim (eds.) Constraints in phonological acquisition. Cambridge: Cambridge University Press, 245291.CrossRefGoogle Scholar
R Core Team (2022). R: a language and environment for statistical computing. Computer software. Available at https://www.R-project.org/.Google Scholar
Shih, Stephanie (2017). Constraint conjunction in weighted probabilistic grammar. Phonology 34, 243268.CrossRefGoogle Scholar
Smith, Jennifer L. (2016). Segmental noun/verb phonotactic differences are productive too. Proceedings of the Linguistic Society of America 1, Article no. 16.CrossRefGoogle Scholar
Wilson, Colin & Obdeyn, Marieke (2009). Simplifying subsidiary theory: statistical evidence from Arabic, Muna, Shona, and Wargamay. Ms., Johns Hopkins University.Google Scholar
Zuraw, Kie (2011). Predicting sai-siot in Korean compound nouns: phonological and non-phonological factors. Paper presented at the 21st Japanese/Korean Linguistic Conference, Seoul National University, October 2011.Google Scholar
Figure 0

Figure 1 A plot of ratings of practice items.

Figure 1

Figure 2 Palatalisation: Mean Likert rating in the survey of Korean non-words, by CV sequence types. Smaller semitransparent points represent individual non-words, averaged across participants, and larger points represent overall averages. The number of non-words of the corresponding type is shown in parentheses.

Figure 2

Figure 3 Palatalisation: Mean Likert rating of target non-words with TI sequences and control non-words with tA and Pi.

Figure 3

Figure 4 Vowel harmony: mean Likert rating in the survey of Korean non-words, by vowel sequence type (disharmonic and harmonic).

Figure 4

Figure 5 Laryngeal OCP: Likert rating in the survey of Korean non-words, by number of laryngeally marked consonants.

Figure 5

Figure 6 Laryngeal OCP: Likert rating in the survey of Korean non-words, by onset type.

Figure 6

Figure 7 All test non-words: Correlation between human and model ratings (z-transformed; $n = 193$; dashed line = regression line).

Figure 7

Figure 8 Palatalisation: Model ratings of non-words, by CV sequence type. Crossbars represent mean ratings.

Figure 8

Figure 9 Palatalisation: Correlation between human and model ratings ($n = 66$; dashed line = regression line; filled circle = target with TI, empty triangle = control with TA, empty circle = control with PI).

Figure 9

Figure 10 Palatalisation: Mean model rating of target non-words with TI sequences and control non-words with tA and Pi. Smaller semitransparent points represent individual non-words, and larger points represent overall averages.

Figure 10

Figure 11 Vowel harmony: Model ratings of Korean non-words, by the type of vowel sequence (disharmonic vs. harmonic). Crossbars represent mean ratings.

Figure 11

Figure 12 Vowel harmony: Correlation between human and model ratings ($n = 80$; dashed line = regression line; filled circle = disharmonic, empty circle = harmonic).

Figure 12

Figure 13 Laryngeal OCP: Model rating in the MaxEnt simulation by number of laryngeally marked consonants.

Figure 13

Figure 14 Laryngeal OCP: Correlation between human and model ratings ($n = 75$; dashed line = regression line; empty circle = no-lar ($n = 8$); grey circle = single-lar ($n = 35$); black circle = double-lar ($n = 32$)).

Figure 14

Figure 15 Laryngeal OCP: Model rating in the MaxEnt simulation, by the onset type.

Supplementary material: File

Jun et al. supplementary material

Jun et al. supplementary material
Download Jun et al. supplementary material(File)
File 277.8 KB