The English conative alternation between complexity effects and lexical biases: a historical perspective

Eva Zehentner

doi:10.1017/langcog.2025.10007

The English conative alternation between complexity effects and lexical biases: a historical perspective

Published online by Cambridge University Press: 13 June 2025

Eva Zehentner

Show author details

Eva Zehentner*: Affiliation:
English Department, University of Zurich, Zürich, Switzerland
*: Email: eva.zehentner@es.uzh.ch

Article contents

Abstract
Introduction
Factors at play in the English conative alternation
Data and methods
Results
Discussion
Conclusions
Funding statement
Data availability statement
Footnotes
References

Rights & Permissions

Abstract

This paper presents an in-depth study of the English conative construction (hit the ball vs hit at the ball), a particular case of variation between NP- and PP-patterns. Approaching the phenomenon from a usage-based, cognitive perspective, the study is interested in the different factors driving this alternation, specifically in the interaction of lexical effects on the one hand and high-level predictors relating to processing complexity on the other hand. More specifically, the paper explores the diachronic dimension of these effects, using corpus data from Middle English to Late Modern English (Penn-Helsinki Corpora of Historical English; 1150–1914). To assess the relative impact of the variables, the study compares the traditionally used methods of collostructional analysis and logistic regression to a more recent tool, namely elastic net regression. The findings ultimately indicate that while complexity does seem to play a role in this alternation, there are also strong lexical effects.

Keywords

processing complexity conative alternation conative construction history of English lexical effects syntactic alternations

Information

Type: Article
Information: Language and Cognition , Volume 17 , 2025 , e50

DOI: https://doi.org/10.1017/langcog.2025.10007 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press

1. Introduction

The present paper revisits the impact of different cognitive factors at play in argument structure variation and possible changes in such variation over time. It thereby further contributes to the extant body of cognitive, usage-based research on syntactic alternations and historical variationist studies. More specifically, the paper zooms into the choice between nominal (1–3a) and prepositional patterns (1–3b) in the English conative alternation (Levin, Reference Levin1993; Perek, Reference Perek, Glynn and Robinson2014, Reference Perek2015; Perek & Lemmens, Reference Perek and Lemmens2010).

The emergence of this alternation, similar to other cases of systematic NP-PP variation like the dative or genitive alternation, is commonly set in early Middle English (ME) and can be assumed to reflect a general increase in the frequency of PP-expressions as part of the typological shift of English from more synthetic to more analytic (Baugh & Cable, Reference Baugh and Cable2002; Fennell, Reference Fennell2001; Szmrecsanyi, Reference Szmrecsanyi, Nevalainen and Traugott2012). However, the precise historical development of the conative alternation and the factors impacting the choice between the patterns have to date been much less explored and even less so based on quantitative data. The focus of this paper is then on providing an empirical analysis of the conative alternation and the factors driving it, by investigating its development from ME to Late Modern English (LME), meaning from 1150 to ca. 1900.

Taking a usage-based linguistics approach to the diachrony of the alternation, the main question to be addressed in the paper is the relative importance of (i) variables relating to processing difficulty compared to (ii) lexical-semantic effects regarding the verb lemmas involved and (iii) verb frequency, as well as diachronic shifts in the impact of these factors. That is, communicative efficiency may play a role in the conative alternation in that the more explicit PP-patterns should be preferred in more complex contexts demanding more processing effort, e.g. when there are long dependencies between a verb and its object or when there is ambiguity between subject and object (e.g. Rohdenburg, Reference Rohdenburg1996; Levshina, Reference Levshina2018, Reference Levshina2021a). By contrast to such higher-order predictors, research on alternation phenomena has also suggested that choices are often strongly lexically conditioned (e.g. Stefanowitsch & Gries, Reference Stefanowitsch and Gries2003), with specific lexical items being biased towards one or the other variant. This often points to semantic differences between the constructions (e.g. Guerrero-Medina, Reference Guerrero-Medina, de Mendoza Ibáñez, Oyón and Pérez-Sobrino2017; Perek, Reference Perek, Glynn and Robinson2014, Reference Perek2015; Perek & Lemmens, Reference Perek and Lemmens2010). A further factor, namely lexical frequency, may be viewed as straddling both types: assuming that less frequently occurring elements are harder to retrieve from memory and therefore present more complex, difficult environments (e.g. Bloem, Reference Bloem2021), infrequent verbs may then be assumed to be biased towards PP-use. In general, following a usage-based approach to language, frequency in actual language use (as reflected in corpora) is presumed to impact cognitive representations and linguistic choices. The present paper tests the role of these factors in combination with each other in the diachrony of English, postulating that PPs – as the innovative variant – serve as a complexity-reducing tool, especially relevant at a time of systemic upheaval when the morphological case was disappearing. At the same time, lexical biases may change over time, indicating changes in constructional semantics.

The data for this study come from the Penn-Helsinki Parsed Corpora of Historical English (PPCME2, Kroch et al., Reference Kroch, Taylor and Santorini2000; PPCEME, Kroch et al., Reference Kroch, Santorini and Delfs2004; PPCMBE2, Kroch et al., Reference Kroch, Santorini and Diertani2016) and are analysed by means of three different methods. On the one hand, I use collostructional analyses (Stefanowitsch, Reference Stefanowitsch, Hoffmann and Trousdale2013; Stefanowitsch & Gries, Reference Stefanowitsch and Gries2003), typically employed to investigate lexical effects; on the other hand, I run a mixed-effects logistic regression which can readily reveal the impact of high-level predictors like complexity features, but is not as well equipped to handle lexical factors. To combine both in a unified approach, the study then follows Sevenants et al. (Reference Sevenants, Van de Velde and Speelman2024) in using elastic net regression, ‘a new methodology which is able to process predictors of various nature on an equal footing’ (p. 4). This comparison is expected to yield more robust insights into what drives this alternation in the history of English.

The paper is structured as follows: Section 2 is about background information. It first introduces the conative alternation in more detail, commenting on its use in present day English (PDE; Section 2.1) as well as its history (Section 2.2). Section 2.3 provides the motivation for the different explanatory accounts on the choice of nominal vs prepositional variant. Section 3 gives the data and methods of the study, while the main results are shown in Section 4. In Section 5, the implications of the results are discussed, and Section 6 concludes the paper.

2. Factors at play in the English conative alternation

2.1. The conative alternation in PDE

2.1.1. Main differences between the variants of the alternation

The conative alternation, with Levin (Reference Levin1993, p. 42), ‘is a transitivity alternation in which the object of the verb in the transitive variant turns up in the intransitive conative variant as the object of the preposition in a prepositional phrase headed by the preposition at.’ In other words, the members of this alternation are a transitive (nominal) pattern on the one hand (4–10a) and a prepositional variant with a post-verbal at-PP, or occasionally an on-PP, on the other hand (4–10b). The latter is also sometimes termed a type of ‘antipassive’ variant of the former (e.g. Guerrero-Medina, Reference Guerrero-Medina and Guerrero-Medina2011). As listed in Levin (Reference Levin1993, pp. 42–43), verbs used in the alternation include verbs of contact by impact such as hitting (4), but also verbs of poking (5), spraying/loading (6), pushing (7), pulling (8), cutting (9) and certain verbs of ingesting (10).

Most basically, both constructions are said to express a sense of contact and motion (Levin, Reference Levin1993, pp. 42–43). The main semantic differences between the constructions supposedly lie in the success or telicity of the action involved: while the transitive NP-construction is typically telic, meaning that it is associated with a sense of completion of the intended result, the intransitive, prepositional pattern is usually atelic, expressing an ‘“attempted” action without specifying whether the action was actually carried out’ (Huddleston & Pullum, Reference Huddleston and Pullum2002, p. 298; Levin, Reference Levin1993, p. 42; Pinker, Reference Pinker1989, p. 104; further Beavers, Reference Beavers2010, Reference Beavers2011; Broccias, Reference Broccias, Andronis, Ball, Elston and Neuvel2001, Reference Broccias2003; Guerrero-Medina, Reference Guerrero-Medina, de Mendoza Ibáñez, Oyón and Pérez-Sobrino2017; Perek, Reference Perek, Glynn and Robinson2014, Reference Perek2015; Perek & Lemmens, Reference Perek and Lemmens2010; Sosa Acevedo, Reference Sosa-Acevedo2009; van der Leek, Reference Van der Leek, Dobrin, Singer and McNair1996). That is, poking the snake implies succeeding in touching the animal (11a), but the outcome is left open or can be unsuccessful with the PP-construction (11b).Footnote ¹

The contrast between the constructions is furthermore often also framed in terms of the degree of affectedness of the theme – by comparison with the NP-pattern, the theme participant of the conative PP-pattern is not necessarily impacted by the action (Beavers, Reference Beavers2010, p. 829; Riemer, Reference Riemer2012, p. 354). In addition to ‘missed contact’, ‘lack of completion’ and ‘lack of affectedness’, Perek (Reference Perek2015, pp. 90–94), following Broccias (Reference Broccias, Andronis, Ball, Elston and Neuvel2001, Reference Broccias2003) and others, lists ‘repetition’ under the distinct semantic properties of the PP-construction: the transitive NP-construction usually expresses a punctual, single event with a clear delimitation, whereas the PP-construction often implies a repeated action, with the number of repetitions not specified (Perek, Reference Perek2015, p. 92; also Broccias, Reference Broccias, Andronis, Ball, Elston and Neuvel2001; Guerrero-Medina, Reference Guerrero-Medina and Guerrero-Medina2011). This is illustrated in the sentence pair in (12a–b). Finally, the patterns differ in terms of volitionality or intentionality of the agent, which is not necessarily given with the PP-construction. The adverbial modifier in (13a–b) and the greater acceptability of inanimate agents in the prepositional but not the nominal pattern (14a–b) support this (Perek, Reference Perek2015, pp. 93–94; also Broccias, Reference Broccias, Andronis, Ball, Elston and Neuvel2001; Dixon, Reference Dixon1991; Guerrero-Medina, Reference Guerrero-Medina and Guerrero-Medina2011; van der Leek, Reference Van der Leek, Dobrin, Singer and McNair1996). In general, the prepositional pattern seems to background the theme and place a greater focus on the agent (Perek, Reference Perek2015, p. 94; also Guerrero-Medina, Reference Guerrero-Medina, de Mendoza Ibáñez, Oyón and Pérez-Sobrino2017).

The key semantic differences between the constructions are repeated in Table 1 and can be summarised as follows: the NP-construction denotes a successful effect on a theme, whereas the meaning of the PP-pattern is best described as ‘X directs action at Y’ (Goldberg, Reference Goldberg1995, p. 63) or ‘the subject is trying to affect the oblique object but may or may not be succeeding’ (Pinker, Reference Pinker1989, p. 104).

Table 1. Main semantic differences between the NP- and the PP-pattern in PDE

2.1.2. Semantics of the conative PP-pattern

Having said this, a number of studies have challenged the assumption of a more general, unified, central meaning of the prepositional construction as ‘directed action’. Instead, they have stressed stark semantic differences between different verb classes and have posited several sub-types of conative PP-patterns (Beavers, Reference Beavers2010, Reference Beavers2011; Broccias, Reference Broccias, Andronis, Ball, Elston and Neuvel2001, Reference Broccias2003; Guerrero-Medina, Reference Guerrero-Medina, de Mendoza Ibáñez, Oyón and Pérez-Sobrino2017; Perek, Reference Perek, Glynn and Robinson2014, Reference Perek2015; Perek & Lemmens, Reference Perek and Lemmens2010).

The main distinction that has been put forward in previous literature here is that between ‘allatives’ on the one hand and ‘ablatives’ on the other hand (terminology introduced in Broccias, Reference Broccias, Andronis, Ball, Elston and Neuvel2001). The former corresponds more directly to the basic sense of ‘direction at’, containing impact verbs which entail contact and motion, such as hitting or kicking, and being described ‘in purely locative terms as involving translational motion towards a target with which contact is not necessarily made’ (Perek, Reference Perek, Glynn and Robinson2014, p. 64). Guerrero-Medina (Reference Guerrero-Medina, de Mendoza Ibáñez, Oyón and Pérez-Sobrino2017, p. 351) adds that while there may not be any observable physical change of state of the theme with allatives, the agent typically carries out some sort of characteristic movement. The ablative pattern, on the other hand, is instantiated by verbs of ingestion and verbs of pulling or cutting. Here, contact and a change of state of the theme are implied, but the intended result is not necessarily fully achieved; furthermore, repetition is foregrounded in these cases (Beavers, Reference Beavers2010, p. 829; Perek, Reference Perek, Glynn and Robinson2014, p. 64). For example, from eat at the cake, we can infer that only some (or indeed very little) of the food was consumed and thus affected. Likewise, to cut at the rope does not specify the extent of affectedness, but contact between the cutter and their knife and the rope is clearly implicated.Footnote ² Perek and Lemmens (Reference Perek and Lemmens2010) and more explicitly Guerrero-Medina (Reference Guerrero-Medina, de Mendoza Ibáñez, Oyón and Pérez-Sobrino2017) then furthermore add a third category of ‘attention’. This group includes verbs of visual perception (e.g. glance, stare, peek) or communication verbs/verbs of auditory perception (e.g. shout, curse, growl) and smiling/laughing verbs (Guerrero-Medina, Reference Guerrero-Medina, de Mendoza Ibáñez, Oyón and Pérez-Sobrino2017, p. 351). Neither contact nor an intended result and affected participant is necessarily present in these cases. Importantly, although some verbs allow paraphrasing (15), alternation with NP-patterns is not felicitous with many verbs in this group (16–17). The main differences between the types are summarised in Table 2.

Table 2. Three-way distinction of conative PP-types and main features

Even though this three-way categorisation has also been criticised as generalising too broadly over striking verb-class-dependent semantic features (cf. Perek, Reference Perek, Glynn and Robinson2014, Reference Perek2015), previous work seems to largely agree that the PP-construction expresses a range of distinct sub-senses, connected through involving some form of (more concrete or more metaphorical) motion, but differing in precise semantics as well as their ability to take part in the conative alternation. Despite the substantial body of research on the PP-pattern, however, little corpus-based research in fact exists on the relevance of lexical and semantic aspects for the alternation and possible other factors guiding it have not yet been explored at all. The present paper then addresses this gap. Before outlining the paper’s main assumptions in more detail, the following section briefly comments on what is known about the history of the conative alternation.

2.2. The conative alternation in the history of English

The history of the conative alternation in English is surprisingly little researched, with only some information found in Visser (Reference Visser1963, pp. 387–402), Sosa-Acevedo (Reference Sosa-Acevedo2009) and Esteban-Segura and Salles-Bernal (Reference Esteban-Segura and Salles-Bernal2017). What these sources suggest is that the alternation existed to some extent in Old English (OE) already and mainly involved senses relating to impact by motion. This is illustrated by the examples in (18), where a knocking event is instantiated by a transitive nominal pattern with a case-marked NP-theme (‘door’) in (18a), while in (18b), the theme is marked by the preposition at and (18c) involves on (examples taken from Sosa-Acevedo, Reference Sosa-Acevedo2009, p. 76). A cursory look at specific verbs in the online Bosworth-Toller Dictionary of Old English (Bosworth-Toller; Bosworth & Toller, Reference Bosworth, Toller, Sean and Tichy2014) and the online Middle English Dictionary (MED; University of Michigan Regents, 2013) further indicates that a variety of PP-constructions, including at and on, but also e.g. to or against were used with verbs of impact such as hitting or shooting in OE and ME (19a–c).Footnote ³

While the alternation thus appears to have had its precursors in the OE period already, the pattern presumably became increasingly restricted to at (as well as on) over time. Sosa-Acevedo (Reference Sosa-Acevedo2009, pp. 75–78) furthermore argues that among the different prepositional patterns in OE, only on-constructions constituted conatives proper in that they denoted a lack of contact, completion and affectedness; by contrast, at-PPs are taken to express reached locations and actual contact (Sosa-Acevedo, Reference Sosa-Acevedo2009, pp. 75–78). This may point towards some change in the semantics of at and at-constructions, potentially also reflected in an increasing openness of the subject slot to allow non-intentional and inanimate agents (Estaban-Segura & Salles-Bernal, Reference Esteban-Segura and Salles-Bernal2017). However, more extensive empirical data on the history of the alternation is clearly lacking to date.

The present study then aims to take a first step towards closing this gap. Drawing on evidence from other NP-PP alternations, I tentatively assume that the main changes and the emergence of the conative alternation in its present form should be located in early ME – a time when prepositional patterns in general came to be used more frequently, likely related to the loss of case marking around the same time, and a broad move away from synthetic means to more analytic patterns (Baugh & Cable, Reference Baugh and Cable2002; Fennell, Reference Fennell2001; Szmrecsanyi, Reference Szmrecsanyi, Nevalainen and Traugott2012). I therefore investigate data from this period onwards. The main focus is on the factors driving the alternation in earlier times; the precise features taken into account here are laid out in the following section.

2.3. Factors predicting the conative alternation

As mentioned, the present study aims to investigate the effects of two main groups of factors potentially at play in the conative alternation: (i) variables reflecting communicative and cognitive efficiency, specifically processing complexity, and (ii) lexical preferences, potentially reflecting semantic differences. A further factor, (iii) verb frequency, is taken to reflect both types.

2.3.1. Cognitive complexity

The first account builds on explorations of communicative efficiency affecting linguistic choices (e.g. Levshina, Reference Levshina2022). Among other things, processing complexity, meaning the cognitive costs incurred by producing or comprehending an utterance, is here taken to play a role in language use: for example, as formulated in Rohdenburg’s (Reference Rohdenburg1996) ‘complexity principle’, ‘[i]n case of more or less explicit grammatical options, the more explicit one(s) will tend to be favoured in cognitively more complex environments’ (Rohdenburg, Reference Rohdenburg1996, p. 151; also Rohdenburg, Reference Rohdenburg, Cuyckens and Radden2002, Reference Rohdenburg2020). Applying this principle to specific test cases necessitates definitions of (a) what constitutes more and less explicit grammatical (or lexico-grammatical) variants and (b) what constitutes cognitively more complex environments (Klavan & Schützler, Reference Klavan and Schützler2023, pp. 302–304). Regarding the former, Rohdenburg (Reference Rohdenburg1996, pp. 151–152; 2020, p. 771) points out that ‘the more explicit variant is generally represented by the bulkier element or construction,’ with the less explicit option typically involving less morphosyntactic material. This can be illustrated by e.g. less explicit/transparent NP-patterns vs more explicit PP-patterns in the case of the English dative alternation (Rohdenburg, Reference Rohdenburg1996, Reference Rohdenburg, Cuyckens and Radden2002), in the case of the home /at home-variation (Levshina, Reference Levshina2018) or in transitive alternations like search (for) (Pijpops et al., Reference Pijpops, Speelman, Grondelaers and Van de Velde2018). Other examples include morphological options, such as (less explicit) synthetic comparative adjectives vs (more explicit) analytic comparatives (Mondorf, Reference Mondorf2009) or also (less explicit) nominal case contrasted with (more explicit) postpositions (Klavan & Schützler, Reference Klavan and Schützler2023). Similarly, ‘complexity’ as a variable has been dealt with in a large range of ways, usually referring to environments requiring increased processing costs or higher working memory. For instance, Rohdenburg (Reference Rohdenburg2020, p. 771) includes greater clause length or greater morphosyntactic complexity of involved noun phrase complements/objects in his list, but also mentions factors such as passive voice, subordination and negation (which could be subsumed under ‘markedness’ features; cf. Klavan & Schützler, Reference Klavan and Schützler2023, p. 302). In addition, measures of predictability are often taken into account, with e.g. lower frequency of the lexical items included supposedly correlating with weaker entrenchment, lower accessibility, and therefore greater complexity, or low association strength between elements involved likewise indicating less predictability and thus increasing processing effort (e.g. Linzen & Jaeger, Reference Linzen and Jaeger2014). While many of these studies confirm the effect of complexity on the choice of variant (e.g. Levshina, Reference Levshina2018; Pijpops et al., Reference Pijpops, Speelman, Grondelaers and Van de Velde2018), other work demonstrates that complexity effects may ‘not generalise sufficiently to other languages to warrant the status of a “principle”’ (Klavan & Schützler, Reference Klavan and Schützler2023, p. 325). In the present paper, I test the assumptions made by the complexity principle on the conative alternation, assuming that the prepositional variant as the more explicit option should be more likely in more complex environments, such as with longer dependents, greater distance between verbs and their dependents, but also with more infrequent verbs. Since complexity effects are taken to be quasi-universal, I expect these variables to be rather stable in their impact; nevertheless, processing costs may be even higher in times of accelerated systemic change, and complexity effects may therefore become temporarily stronger compared to others.

A further aspect connected to cognitive complexity is the assumption that the use of prepositional marking may aid disambiguation between participants in a clause. This is driven by cross-linguistic research into phenomena of coding asymmetries and specifically the well-documented phenomenon of differential object marking (DOM): in languages such as Spanish, Maltese or Hebrew, prepositional marking is only used with a subset of transitive direct objects. This distribution is not random, but clearly guided by certain semantic-pragmatic features of the arguments (Levshina, Reference Levshina2021a, p. 1; Reference Levshina2021b; also Aissen, Reference Aissen2003; de Hoop & de Swart, Reference Hoop and de Swart2008; Fedzechkina et al., Reference Fedzechkina, Jaeger and Newport2012; Haspelmath, Reference Haspelmath2021a, Reference Haspelmath2021b; Iemmolo, Reference Iemmolo2010, Reference Iemmolo2013; Tal et al., Reference Tal, Smith, Culbertson, Grossman and Arnon2022; Witzlack-Makarevich & Seržant, Reference Witzlack-Makarevich, Seržant, Seržant and Witzlack-Makarevich2018). For instance, Spanish and Maltese DOM appears to be driven by animacy, with animate objects expressed by PPs opposed to inanimate NP-objects (21), while definiteness is the main determinant in Hebrew (Tal et al., Reference Tal, Smith, Culbertson, Grossman and Arnon2022, p. 3; further Haspelmath, Reference Haspelmath2021a, Reference Haspelmath2021b; Iemmolo, Reference Iemmolo2013).

The emergence and maintenance of DOM phenomena is typically linked to ambiguity avoidance mechanisms, more precisely to prepositional marking serving the function of disambiguating between subjects and objects in cases of overlap. This builds on observed hierarchies of semantic/grammatical roles, as subjects are prototypically animate and definite, among other properties. In the absence of other morphosyntactic cues such as word order, correctly distinguishing between subjects and likewise animate and definite objects would then be problematic. In such instances, ambiguity can be resolved by prepositional marking, presenting an efficient, selective strategy of argument disambiguation (Fedzechkina et al., Reference Fedzechkina, Jaeger and Newport2012, Reference Fedzechkina, Newport and Jaeger2017; Fedzechkina & Jaeger, Reference Fedzechkina and Jaeger2020; Levshina, Reference Levshina2022). Specifically relating to English, this argument has been applied to the history of the dative alternation in Zehentner (Reference Zehentner2022), arguing that the prepositional variant rose in ME as a response to increased ambiguity between subjects and indirect objects (recipients) due to the loss of case marking (Allen, Reference Allen1995). Building on this, we would expect the impact of these features to be strongest in ME (namely at a time of systemic change), with a potential decrease in relevance in later periods.

Ambiguity-related variables are here linked to complexity drawing on e.g. Tal et al. (Reference Tal, Smith, Culbertson, Grossman and Arnon2022, p. 5), who characterise DOM as a case of ‘predictability-based marking where languages mark less expected forms with more linguistic material’ (cf. also Haspelmath, Reference Haspelmath2019, Reference Haspelmath2021a, Reference Haspelmath2021b; Levshina, Reference Levshina2021a, Reference Levshina2021b). In this view, ambiguous, atypical contexts present cognitively costly environments, whose processing is facilitated by, for instance, the use of prepositional patterns.

2.3.2. Lexical factors

Besides the set of variables to do with complexity, previous work on PDE suggests that lexical factors – in this case, verb lemma – are also of relevance to the conative alternation (Perek, Reference Perek, Glynn and Robinson2014, Reference Perek2015). This is supported by research on a range of other alternation phenomena demonstrating strong lexical biases in the choice of variant (Heller et al., Reference Heller, Szmrecsanyi and Grafmiller2017; Röthlisberger et al., Reference Röthlisberger, Grafmiller and Szmrecsanyi2017; Stefanowitsch & Gries, Reference Stefanowitsch and Gries2003; also e.g. Pijpops et al., Reference Pijpops, Speelman, Grondelaers and Van de Velde2021, Reference Pijpops, Speelman and van den Bosch2022; Sevenants et al., Reference Sevenants, Van de Velde and Speelman2024). Verb-specific effects may then also reflect semantic differences between the constructions, possibly connected to the distinction between allative, ablative and attention senses introduced above or more generally reflecting the spatial origins of the prepositions involved in the PP-pattern (Broccias, Reference Broccias, Andronis, Ball, Elston and Neuvel2001, Reference Broccias2003; Guerrero-Medina, Reference Guerrero-Medina, de Mendoza Ibáñez, Oyón and Pérez-Sobrino2017; Perek, Reference Perek, Glynn and Robinson2014, Reference Perek2015; Perek & Lemmens, Reference Perek and Lemmens2010).

2.3.3. Lexical frequency

Finally, I add verb frequency as a factor combining lexical effects and processing complexity: as mentioned before, frequency plays a prominent part in usage-based approaches, positing that actual language use and in particular ‘frequency or repetition of experiences has an impact on cognitive representations’ (Bybee, Reference Bybee2007, p. 18; also Bybee & Hopper, Reference Bybee and Hopper2001; Divjak, Reference Divjak2019; Schmid, Reference Schmid2020). Most importantly for the present study, there is evidence that high-frequency words are activated and processed more easily, making them less complex environments than low-frequency words (cf. e.g. Bloem et al., Reference Bloem, Versloot and Weerman2017; Gibson, Reference Gibson1998; Röthlisberger, Reference Röthlisberger, Seoane and Biber2021). Following the hypotheses on complexity affecting syntactic choices laid out above, we can then assume verbs with a higher overall (corpus) frequency to tend towards NP-use, while PP-marking should be preferred with less frequent verbs, which present cognitively more demanding contexts.Footnote ⁴

In sum, the hypotheses to be tested in this study are as follows:

(i) the conative alternation is motivated by properties relating to complexity and ambiguity avoidance, treated under the umbrella of communicative efficiency,
(ii) the conative alternation is driven by lexical choices (verb lemma),
(iii) the conative alternation reflects the impact of lexical (verb) frequency, and
(iv) changes in the relative importance of these variables have to do with more general systemic shifts in the history of the English language.

The next section presents the data and methodology used to investigate these assumptions.

3. Data and methods

3.1. Data extraction and annotation

This paper uses data from three corpora of historical English: the Penn-Helsinki Parsed Corpus of Middle English (PPCME2; Kroch et al., Reference Kroch, Taylor and Santorini2000), which contains texts from 1150 to 1500 and has a size of approximately 1.2 million words; the Penn-Helsinki Parsed Corpus of Early Modern English (PPCEME; Kroch et al., Reference Kroch, Santorini and Delfs2004), which covers the timeframe of 1500 to 1700 and is about 1.8 million words large; and the Penn-Helsinki Parsed Corpus of Late Modern British English (PPCMBE2; Kroch et al., Reference Kroch, Santorini and Diertani2016), which goes from 1700 to 1914 and includes roughly 2.8 million words. For the study, I extracted all PPs and NPs which are annotated as standing in a ‘sister-relation’ to lexical verbs (excluding do, be, have and modals) in the three corpora, which yielded a total of about 406,000 combinations of verb+PP or verb+NP. In a first step, this was narrowed down to a subset of at-PPs on the one hand and NPs on the other hand. This means that other potentially interesting patterns such as collocations with on or against are not part of the study, and we can consequently not draw any conclusions on these further options for variation (as suggested by e.g. Sosa Acevedo’s Reference Sosa-Acevedo2009 discussion on- vs at-patterns in OE). In a second step, the two sets were compared and only instances of verbs attested in both samples were retained. Next, the resulting list was manually pruned to exclude irrelevant tokens, using Levin’s (Reference Levin1993) list of conative alternation verbs and their equivalents in earlier English provided in the BASICS toolkit (Percillier, Reference Percillier2016, Reference Percillier, Frank, Ivanovic, Mambrini, Passarotti and Sporleder2018) as a point of reference. For example, instances of at expressing time (21) and instances where NP- and PP-argument denote clearly different participants (e.g. instrument in 22a vs targets in 22b-c) were removed, and all instances of phrasal verbs (shoot so. down) and clauses with additional PP-arguments (shoot so. in the head) were discarded as non-alternating (Lee, Reference Lee2003, pp. 171–173). In order to nevertheless keep as many instances as possible, no frequency thresholds were imposed, meaning that any verb occurring at least once in both constructions across the corpora was included. This may evidently impact results and also requires a caveat regarding the question whether all instances in fact qualify as alternating contexts (Pijpops, Reference Pijpops2020). Likewise, I did not remove ditransitive clauses, that is, clauses with an additional (direct) NP-object besides the (indirect) alternating NP/at-PP (23), which could be debated.

The remaining final dataset contains 3,605 instances; in addition to basic meta-information and details on clause (a), these tokens were then each annotated for the groups of variables in (b–e), reflecting the issues presented above.

(a) Basic information
- text-ID
- period: Middle English (‘ME’), Early Modern English (‘EME’), Late Modern English (‘LME’)
- clause type: ‘transitive’ (V-NP/PP) vs ‘ditransitive’ (V-NP-NP/PP)
(b) Features pertaining to cognitive complexity
- length of the theme: log-transformed (base 10) length of the NP/PP-constituent, measured in words (not counting the preposition for PPs)
- distance between the verb and the theme: log-transformed, centred and scaled length of the number of words intervening between verb and NP/PP-argument
(c) Features relating to ambiguity avoidance
- animacy: ‘animate’ (including humans and animals) vs ‘inanimate’ themes (objects and concepts, but also institutions and body parts)
- definiteness: ‘definite’ (including pronouns and NPs with definite articles and determiners, as well as possessives) vs ‘indefinite’ themes (indefinite pronouns, unmarked NPs and NPs containing indefinite articles and pronouns)
(d) Lexical features: 47 distinct verb lemmasFootnote ⁵
(e) Verb lemma frequency

Regarding points (d) and (e), note that extracting lemma information and lemma frequencies from the corpora used here is difficult since no fully lemmatized versions of the corpora exist to date, and spelling variation is rampant, especially in earlier texts. For the PPCME2 (ME), I therefore employed the lemmatiser that is part of the BASICS toolkit (Percillier, Reference Percillier2016, Reference Percillier, Frank, Ivanovic, Mambrini, Passarotti and Sporleder2018). This tool – currently restricted to verbs – makes use of the MED, which lists all attested word forms for particular lemmas; verb forms in a specific corpus are then automatically matched with the appropriate lemma based on these correspondences and the lemma information is directly appended to the forms in the corpus. As detailed in Trips and Percillier (Reference Trips and Percillier2020, p. 7172), evaluation has shown that the lemmatiser has a precision and recall of 94.85% and 98.92%, respectively, as well as an accuracy score of 94%. In the current study, verb lemma assignment was manually checked again for the instances included in the dataset. For the PPCEME (EME) and PPCMBE (LME), I first VARDed the data to normalise spelling variation. VARD (Variable Detector; VARD 2; Baron & Rayson, Reference Baron and Rayson2008) is a Java-based tool which was specifically designed to deal with orthographic variation in EME texts. It can be used to automatically identify non-standard spellings in a text and to determine the most likely candidates for replacing them (cf. also Archer et al., Reference Archer, Kytö, Baron and Rayson2015). Since VARD is an interactive tool that, among other features, allows users to customise the balance between precision and recall to their liking (and for each word form individually), no one accuracy score for the tool can be provided. Once VARDed, I then used the lemmatising function of the R package ‘textstem’ (Rinker, Reference Rinker2022). For all corpora, the lemmatised versions were finally used to annotate the verbs in the extracted instances regarding lemma, as well as to derive (base 10 log-transformed) lemma frequencies in the respective periods.

3.2. Statistical analysis

As Sevenants et al. (Reference Sevenants, Van de Velde and Speelman2024, p. 2) observe, alternation research has tended to focus either on lexical effects or on ‘high-level, general predictors stemming from actual language use.’ These foci are also reflected in different methodological approaches. In studies mainly interested in lexical preferences, collostructional analyses, which ‘gauge[] associational strength between constructions and the lexical elements filling certain slots in these constructions,’ have become a standard tool (Schmid & Küchenhoff, Reference Schmid and Küchenhoff2013, p. 532; also Gries, Reference Gries2019; Gries & Stefanowitsch, Reference Gries and Stefanowitsch2004; Stefanowitsch, Reference Stefanowitsch, Hoffmann and Trousdale2013; Stefanowitsch & Gries, Reference Stefanowitsch and Gries2003, Stefanowitsch & Flach, Reference Stefanowitsch, Flach, Pastor and Colson2020). The impact of high-level variables, by contrast, is typically investigated by means of logistic regression analyses, with lexical biases often included as random factors in mixed-effects models, for lack of better options (Gelman & Hill, Reference Gelman and Hill2006; Levshina, Reference Levshina2015; Winter, Reference Winter2019; see Van de Velde & Pijpops, Reference Van de Velde and Pijpops2021 specifically for a discussion of lexical factors as random effects). However, neither of these methods is able to capture the full complexity of phenomena by providing a full and sound model of variation, and a unified analysis is clearly desirable. Such an approach is provided in Sevenants et al. (Reference Sevenants, Van de Velde and Speelman2024), ‘demonstrat[ing] elastic net regression, a new methodology which is able to process predictors of various nature on an equal footing’ (p. 4). As discussed in detail in their study, this method is superior to others in allowing to combine high-level predictors and lexical effects in one model and circumventing the inherent problem of being forced to ‘highlight[] only part of the phenomena under scrutiny’ (Sevenants et al., Reference Sevenants, Van de Velde and Speelman2024, p. 2). The method, which combines ‘lasso regression’ and ‘ridge regression’, is furthermore useful in avoiding overfitting, in that it ‘adds a penalisation factor to the maximum likelihood procedure used to compute the model parameters’ (Sevenants et al., Reference Sevenants, Van de Velde and Speelman2024, p. 4) and thereby effectively shrinks model coefficients. In addition, it is able to deal with high-level categorical predictors such as lexical items used in a particular slot by means of cross-validation and, since coefficients can also be shrunk to zero, can essentially ‘carry out variable selection within the regression analysis itself’ by disabling irrelevant predictors from the start (Sevenants et al., Reference Sevenants, Van de Velde and Speelman2024, p. 4). Last, the model is more robust to multicollinearity than traditional methods (see Sevenants et al., Reference Sevenants, Van de Velde and Speelman2024, pp. 4–7, for an overview of the components and specific features of the model).

In the present study, I then use all three methods, namely collostructional analysis, logistic regression and elastic net regression, with the aim of comparing and contrasting the respective results and in order to showcase the benefits of elastic net regression analysis, only very recently introduced to studies of linguistic variation. To conduct the analyses and for visualisation, I used R (version 4.4.2, R Core Team, 2024), specifically the packages ‘ggplot2’ (Wickham, Reference Wickham2016), and ‘collostructions’ (Flach, Reference Flach2021), as well as ‘lme4’ (Bates et al., Reference Bates, Mächler, Bolker and Walker2015), ‘visreg’ (Breheny & Burchett, Reference Breheny and Burchett2017) and ‘JGmermod’ (Grafmiller, Reference Grafmiller2019). For the elastic net regression, I used ‘ElasticToolsR’, a library by Sevenants (Reference Sevenants2023) based on ‘glmnet’ (Friedman et al., Reference Friedman, Tibshirani and Hastie2010). The results are given in the following section.

4. Results

4.1. Data distribution

Starting with a brief glance at the distribution of the variables in the dataset, Figure 1(a) first shows that between the two constructional variants, the PP-pattern is strikingly under-represented (taking up around 8% of all instances in total), meaning that the alternation is highly imbalanced. The small increase over time in the proportions accounted for by the at-variant (from approximately 4% in ME to roughly 10% in LME) is in line with the hypothesis that the conative (PP-)construction only really gained ground from ME onwards. However, this rise is rather subtle, and the conative alternation thereby stands in stark contrast to other cases like the dative alternation, where the PP-pattern temporarily even surpasses the NP-pattern in frequency during ME (McFadden, Reference McFadden and Lightfoot2002; Zehentner, Reference Zehentner2019). Parts (b) and (c) of Figure 1 indicate that the lengths of the theme arguments (namely the NP-object or NP-complement within the PP) and the distances between verb and theme, respectively, are generally very short, with mean lengths of 1.71 words, mean distances of 1.12 and medians of 1 word for both. While mean lengths very slightly increase between periods (ME: 1.42 words vs EME: 1.79 and LME: 1.8), mean distances decrease (if only marginally; ME: 1.18 words vs EME: 1.14 and LME: 1.07), with medians at 1 word throughout. With regard to animacy (d), we find that over two-thirds (70%) of all theme referents in total are animate, compared to less than one-third of inanimates – this discrepancy is higher in the earliest period (roughly 80/20) and lower in the latest period (roughly 60/40). For definiteness (e), there is a stable distribution of about 90% of instances including a definite theme throughout time. The frequency distribution of the 47 verb lemmas within the dataset is shown in (f): here, we find that the mean frequency of verb lemmas across the entire dataset is 76.7, with a median of 18; in the individual periods, mean verb lemma frequencies and medians are 27.8/4 for ME, 25.8/6 for EME and 37.9/11 for LME. The figure also indicates that there are strong outliers: specifically, the lemma ask is greatly overrepresented in the entire dataset (accounting for 1,215 tokens and thus almost 35% of instances overall), with the verbs next in frequency considerably lagging behind (beseech: 390/11%, touch: 209/6%, draw: 186/5%, smite: 152/4%). By contrast, there are two verbs (kick and spurn), which only occur once in each construction and accordingly only twice in the dataset. In terms of diachronic change, slight shifts can be seen, in particular a decrease in the frequency of the verb say and the gradual disappearance of e.g. beseech and smite, in contrast to growing frequencies of e.g. ask and shoot. Last, panel (g) shows verb lemma frequency in the whole corpora. Again, the distribution is clearly skewed here: while the mean corpus frequency of verb lemmas is about 652.5, the median is at 87. In ME, this is even more striking than in the later periods (mean/median ME: 1,390/96 vs EME: 352/70.5 and LME: 467/118). The main outlier in all periods is the verb say, which has a raw frequency of over 50,000 across all corpora (ME: over 33,000; EME and LME: over 9,000). The least frequent verbs in terms of corpus frequency include flout, spurn, cheap, tickle and lash (fewer than 25 in total).

Figure 1. Proportions and distributions of categorical and continuous variables in the dataset. (a) proportions of constructional variants, (b) distribution of theme lengths in words (log10 scale), (c) distribution of verb-theme distances in words (log10 scale), (d) proportions of animate vs inanimate theme referents, (e) proportions of definite vs indefinite themes, (f) distribution of verb lemma frequency within the dataset (log10 scale), and (g) distribution of corpus-wide verb lemma frequency (log10 scale).

4.2. Collostructional analysis

In order to determine which verbs, verb classes, and more abstract verb categories most strongly tend towards one or the other constructional variant and whether any shifts over time can be discerned, I then take a first exploratory look at lexical and semantic biases in the alternation by means of distinctive collostructional analysis (cf. Perek, Reference Perek, Glynn and Robinson2014, Reference Perek2015; Perek & Lemmens, Reference Perek and Lemmens2010 for collostructional analyses of the PDE conative PP-construction). To this end, separate analyses were conducted for the three periods, shown in Figure 2 as well as Tables A1–A3 in the Appendix.

Figure 2. Verb lemma associations with either NP-patterns (purple) or PP-patterns (yellow) as indicated by collostructional analyses for ME (top), EME (mid), and LME (bottom). Collocation strength is given on the x-axis, with the vertical dashed line marking the significance threshold conventionally used in collostructional analysis.

What can be seen in the figure is that the verbs most strongly associated with the NP-construction in all periods are verbs of communication (expressing ‘attention’ senses): ask is among the highest-ranked verbs for NP-expression in all periods, and beseech moves from higher PP-use to NP-preference over time. The communication verb say loses attraction to the NP-pattern from ME onwards; however, it should be noted here that the verb greatly drops in frequency in the alternation in general (a change that is likely connected to the by-now largely ungrammatical use of say in the double object construction, as in *say me one thing; cf. e.g. Colleman & De Clerck, Reference Colleman and De Clerck2011). Apart from communication verbs, the NP-pattern is rather stably used with the verbs catch, draw and touch as well as smite in later texts; while the first two of these express more ablative senses, the latter have an allative meaning. By comparison, the PP-construction appears as consistently biased towards allatives, in particular verbs of ballistic motion that denote throwing and shooting events (e.g. cast, throw and shoot) – this preference seems to become even more pronounced over time (e.g. with new verbs like fire and fling being added to the group of PP-associated verbs). In addition, verbs to do with engaging with work tasks or games (e.g. work, labour or play) become more strongly drawn to the PP-pattern in later periods; the same goes for verbs to do with emotional reactions to external stimuli such as rejoice or wonder.

4.3. Mixed-effects logistic regression modelling

In the next step, the interplay between the high-level predictors, specifically between the complexity- or ambiguity-related factors, is tested by means of mixed-effects logistic regression modelling (Gelman & Hill, Reference Gelman and Hill2006; Levshina, Reference Levshina2015; Winter, Reference Winter2019). For the sake of comparability with the other approaches, three separate models are run here for the respective periods (but note that a single model with ‘period’ as an interaction term for the other predictors yields the same results). In the models presented below, the dependent variable is construction (NP vs PP), while complexity (length and distance), ambiguity features (animacy and definiteness), as well as verb lemma frequency, are used as fixed effects. Verb lemma is included as a random effect due to the high number of levels leading to convergence issues when included as a fixed effect (cf. Van de Velde & Pijpops, Reference Van de Velde and Pijpops2021). Insignificant predictors were not discarded from the model and no model comparison or selection was conducted (cf. Klavan & Schützler, Reference Klavan and Schützler2023; Tizón-Couto & Lorenz, Reference Tizón-Couto and Lorenz2021).

The results of the regressions are given in Figure 3 as well as Tables A4–A6 in the Appendix. As can be seen, in ME, greater distance is linked to greater odds for PP-use, while higher corpus frequency of a verb lemma correlates with higher NP-use, as does inanimacy of the theme in this period. The effects of all other predictors are non-significant in the ME model. Effects also seem to shift over time – while e.g. verb frequency still significantly and positively impacts NP-choice in EME, this is absent in the LME model. Distance between verb and theme does not significantly influence the alternation in EME, but re-emerges as a significant predictor in LME, with greater distances again triggering higher PP-use. A further change is the disappearance of animacy as a relevant factor after ME, whereas length only comes into play in the latest period, with NP-patterns being preferred with longer themes in LME. Finally, the probability of PP-patterns is higher with indefinite themes in EME, but not in the other periods.

Figure 3. Output of mixed-effects logistic regression models for NP- vs PP-choice in Middle English (left), Early Modern English (centre), and Late Modern English (right), showing the effects of (a) theme length in words (log10 scale), (b) distance between verb and theme in words (log10 scale), (c) theme animacy, (d) theme definiteness, and (e) corpus frequency of the verb lemmas (log10 scale). Significant effects are indicated by asterisks in the plots, following standard conventions (*p<0.05, **p<0.01, ***p<0.001).

While some of these results are nicely in line with the expectations posited above, others do not fit predictions too well. In addition, although model fits (ME: C = 0.919; Somer’s Dxy = 0.838; EME: C = 0.982; Dxy = 0.964; LME: C = 0.969, Dxy = 0.937) and accuracies (ME: 96.65%; EME: 96.47%; LME: 94.9%) are high, importantly, variances of the random effects are also very high in all models (ME: variance = 3.672, std.dev = 1.916; EME: variance = 8.52, std.dev = 2.919; LME: variance = 13.04, std.dev = 3.611). This suggests that verb lemma has a strong influence throughout time and that the results here neither present the full picture nor are entirely reliable and robust. A method that is able to incorporate both high-level predictors as well as lexical effects in a balanced and unified way should therefore clearly be preferred: such a technique is, as already outlined above and as demonstrated in Sevenants et al. (Reference Sevenants, Van de Velde and Speelman2024), provided by elastic net regression. The outcome of applying this method to the present data is presented in the following section.

4.4. Elastic net regression modelling

The set-up for the elastic net regressions more or less mirrors the logistic regression presented in the preceding section: separate models are again run for different periods, each including the predictors of theme length, verb-theme distance, theme animacy and theme definiteness, as well as verb lemma frequency. Different from the logistic regression, however, verb lemma is also included here as a main predictor, rather than as a random effect. Figure 4 (as well as Table A7 in the Appendix) presents the results of the three-period models with features (feature levels) and their coefficients: coefficients indicate whether a particular feature has an effect and which direction this effect has; for categorical factors, coefficients are given for each individual type or feature level (e.g. separate coefficients for the different verb lemmas). The best performance for the models is at alpha = 0.2 for ME and EME (among others) and alpha = 0.1 for LME, as automatically determined by ElasticToolsR (cf. Sevenants et al., Reference Sevenants, Van de Velde and Speelman2024 for a detailed discussion of the specific components of elastic net regression).Footnote ⁶

Figure 4. Output of elastic net regression models for NP- vs PP-choice in Middle English (top), Early Modern English (mid) and Late Modern English (bottom); features and feature levels associated with NP-use are in purple, those associated with PP-use in yellow, and removed (non-significant) features in green. The x-axis shows the coefficients of the features (negative values indicate NP-preference, positive values signal PP-bias).

Starting with lexical effects, it is then first noticeable that NP-expression is again strongly tied to verbs expressing communication or verbalising; specifically, the verb ask is the only verb prone to NP-use in ME and also features as one of the most tightly associated verbs to NP-patterns in the later periods, alongside beseech in EME. However, in later texts, further verbs not connected to communication or perception are likely to appear with nominal themes as well, including the ablative verbs catch, snatch and draw, in addition to touch and the allative smite. By contrast, PP-expression is associated with more verb lemmas throughout time, and coefficients are generally higher. Among the verbs which are most inclined towards PP-use in ME are allative verbs of ballistic motion such as shooting and throwing (including cast, throw, shoot and strike) or hitting (lash) but also ablative verbs like fetch or pull, as well as the verb play. These preferences seem to be stable over time, with further allative verbs like fling, fire or kick being added to the group of PP-prone verbs in EME and LME. Interestingly, later texts also show verbs expressing feelings or attitude like wonder, muse, rejoice, admire or grudge as closely associated with PP-expression.

With respect to higher-level predictors, the elastic net regression on the one hand indicates that higher verb lemma frequency increases the odds of NP-patterns being chosen throughout time (even if less strongly so in LME). On the other hand, the PP-construction is more likely when there is a greater distance between verb and theme. By contrast, length does not play a role in any period, and animacy (as well as definiteness) shows a surprising effect: while this factor is not retained in the ME model and thus does not seem to significantly impact the alternation, it is present in EME and LME, even if relatively weakly so. Specifically, animate (and definite) themes are more closely linked to NP-expression in the later periods, while inanimates (and indefinites) are more often used in PP-patterns (e.g. kick the dog/ ask the woman [a question] vs kick at a ball/ ask [a question] at a book).

In terms of comparison with the methods presented in Section 4.2, the lexical effects seen here largely overlap with the results of the collostructional analyses, with the highest ranking verbs for each pattern also emerging as relevant in the elastic net regression and indications for some shifts in verb and verb class associations over time. Likewise, it is encouraging that most of the factors found to be significantly impacting the choice in the logistic regression model, such as distance as well as verb frequency, persist when verb lemma is taken into account as a main predictor within the same model. Nevertheless, there are some differences with respect to the latter, in particular the absence of an animacy (and definiteness) effect in the earliest period compared to an effect, and especially one in the opposite direction, in later periods. In addition, length is not retained in any of the period models, while the logistic regression for LME showed it as significant.

In sum, the findings of the elastic net regression confirm a strong predominance of lexical effects in guiding the conative alternation. Nevertheless, complexity is also found to play a role. In the following sections, I briefly discuss the theoretical and methodological implications of these results (Section 5) and point out some limitations of the study as well as directions for further research (Section 6).

5. Discussion

5.1. Complexity effects

The results of the models presented in Section 4, in particular those of the elastic net regression, largely provide support for the hypotheses laid out above, while also opening up room for further questions. That is, the findings (i) confirm a general impact of processing complexity, in that contexts that require less cognitive effort, meaning instances with shorter verb-theme dependencies, are drawn towards NP-patterns. By contrast, greater distance between verb and argument in the clause (caused by intervening material like e.g. adverbs), representing a cognitively more complex context, triggers the use of the more explicit grammatical option, namely the PP-pattern (Klavan & Schützler, Reference Klavan and Schützler2023; Levshina, Reference Levshina2018; Pijpops et al., Reference Pijpops, Speelman, Grondelaers and Van de Velde2018; Rohdenburg, Reference Rohdenburg1996, Reference Rohdenburg2020). Hypothesis (iii) on verb frequency as a factor can, as mentioned above, also be subsumed under complexity effects – high-frequency verbs are more predictable and thus present a cognitively less demanding environment, leading to higher uses of nominal patterns, whereas PP-marking facilitates processing in more complex environments with low-frequency verbs (e.g. Bloem, Reference Bloem2021).

At the same time, theme length does not seem to play a role in the conative alternation: despite constituting a more complex environment, it can be argued that in conative constructions, the theme is typically in clause-late position anyway and that heaviness of the constituent is thus less relevant in this case than in e.g. the dative alternation (Bresnan et al., Reference Bresnan, Cueni, Nikitina, Baayen, Bouma, Kraemer and Zwarts2007). Furthermore, animacy and definiteness show mixed effects: NP-patterns tend towards animate/definite themes, while inanimates/indefinites are more prone to prepositional patterns, which goes against the assumption of ambiguation avoidance as a factor in this choice in a process similar to that motivating DOM in other languages (Haspelmath, Reference Haspelmath2021a, Reference Haspelmath2021b; Iemmolo, Reference Iemmolo2013). Importantly, this effect is not found in the earliest period, but only present in later texts, indicating change over time (hypothesis iv). That is, there is no evidence supporting the idea that PP-marking was preferred with animate/definite themes in order to reduce the cognitive costs of potential ambiguity between the protoyypically animate/definite subject and a likewise animate/definite object, and no evidence that such an effect was particularly strong in ME, when the increasing loss of case marking would have amplified the demand for a different strategy of argument disambiguation. The conative alternation thus seems to differ from other alternations in the history of English, where prepositional patterns saw a striking increase in the course of ME especially in ambiguous contexts, only to drop in frequency again once the system had ‘recovered’ by fixing word order and thereby making another disambiguation cue quasi-categorical (cf. also Zehentner, Reference Zehentner2022).

5.2. Lexical effects

In addition to confirming some impact of processing complexity, the results clearly support hypothesis (ii) on lexical biases and semantic differences between the constructions. The strong predilection of shooting and throwing verbs for PP-use throughout time can be related to the basics semantics of such verbs as well as the spatial origins of the preposition, directly instantiating the PP-construction’s presumed dominant meaning of ‘X directs action at Y’ (Goldberg, Reference Goldberg1995). Still, in later periods the PP-pattern is also tightly associated with more abstract notions of emoting such as wonder or rejoice, in which cases NP-uses are in fact rare or obsolete today (OED, s.v. wonder, rejoice). This may be taken as a sign of the preposition grammaticalising further and the entire PP-construction expanding in semantics over time, coming to cover more metaphorical senses (Sato, Reference Sato2009) or can be viewed as indicating the emergence of new, semantically different (sub-)patterns (Perek, Reference Perek, Glynn and Robinson2014, Reference Perek2015; Traugott & Trousdale, Reference Traugott and Trousdale2013).

By comparison, the main verbs that emerge as strongly associated with the NP-pattern in the analysis are verbs like ask or beseech; that is, communication verbs emerge as clearly biased towards NP-expression rather than PP-marking. However, verbs from other classes such as catch and draw (both denoting an ablative sense of motion in the direction of the agent) as well as allative smite (motion away from the agent and towards a theme) also come to select for NP-use in later times, which may suggest some shifts in constructional semantics here as well. We could tentatively interpret this as constructional alignment – with alternating patterns becoming more similar to each other (De Smet et al., Reference De Smet, D’hoedt, Fonteyn and Goethem2018). Still, the associations shown for the NP-pattern, together with the fact that there are more verb lemmas with a strong preference for the PP-variant throughout time, may also reflect a constructional frequency effect (Bloem, Reference Bloem2021, p. 138; Sevenants et al., Reference Sevenants, Van de Velde and Speelman2024): regular NP-objects are generally more frequent than PP-complements, which makes it harder for verbs to ‘stand out’ in nominal patterns. The prepositional conative is thus more lexically specific compared to the ‘default’ nominal transitive, and the changes in lexical preference seen with the PP-pattern are more conspicuous. This is also in line with PP-uses of some included verbs being more specialised than corresponding NP-patterns, cf. e.g. the combination play + at-PP now being linked specifically to participating in games as well as pretending as a game (as opposed to the rather diverse meaning of play + NP (OED, s.v. play).

The lexical differences between the PP- and NP-patterns highlighted here may also provide an explanation for the animacy/definiteness biases in the analysis as discussed above: rather than reflecting an impact of ambiguity (and accordingly complexity), the animacy/definiteness effects seen in EME and LME may be strongly linked to the preferred argument structure patterns of NP-biased verbs. For example, the communication verb ask is frequently used in ditransitive constructions of the type ask them a question, where the alternating object in fact expresses an addressee instead of a transitive theme. While such addressees can still vary between NP- and PP-marking in PDE, the former is by far the preferred option today (Zehentner, Reference Zehentner2019). The general predominance of animate (definite) objects in NP-patterns is then likely a consequence of addressees in communication events prototypically being animate and discourse-given, while both goals and targets of ballistic motion (as in shooting at a car) as well as stimuli of emotions (as in rejoice at the flowers) are often inanimate and new. In order to get a clearer picture of the conative alternation, it may therefore be necessary to restrict further investigations to transitive tokens only and to view variation between nominal ditransitives and two-place at-patterns as a sub-type of or phenomenon similar to the English dative alternation (Levin, Reference Levin1993).

Despite these differences, it should also be pointed out that a considerable proportion of verbs do not show any predilection for either of the variants; this may indicate that subtler contextual, verb-independent notions (e.g. success or failure of the action or degree of affectedness of the participant), not captured by the variables included in this study, are likely at play in this alternation as well. This is, of course, not surprising, but directly corresponds to the observations abundantly made in previous research on conatives (e.g. Beavers, Reference Beavers2010, Reference Beavers2011; Broccias, Reference Broccias, Andronis, Ball, Elston and Neuvel2001, Reference Broccias2003; Guerrero-Medina, Reference Guerrero-Medina, de Mendoza Ibáñez, Oyón and Pérez-Sobrino2017; Levin, Reference Levin1993; Perek, Reference Perek, Glynn and Robinson2014, Reference Perek2015; Perek & Lemmens, Reference Perek and Lemmens2010). Nevertheless, it also again confirms that the English conative alternation and its history differ considerably from other well-researched alternations such as the dative alternation, where high-level predictors like complexity and information structure have been shown to be equally influential as lexical and semantic factors, if not more so (e.g. Bresnan et al., Reference Bresnan, Cueni, Nikitina, Baayen, Bouma, Kraemer and Zwarts2007). A final caveat concerns verb frequency within the dataset: the fact that no frequency threshold was imposed in this study means that some verb lemmas with a low frequency in the dataset may not show significant attraction to either pattern merely due to insufficient data – although two low-frequency items (lash, muse) are retained in the present results, it cannot safely be inferred from these results whether other infrequent items really have no preference for either of the constructions.

5.3. Methodological implications

On a final methodological note, the output of the elastic net regressions presented in this study to a large extent overlaps with results gained from more standard methods such as collostructional analysis and mixed-effects logistic regression modelling, which is overall reassuring. Nevertheless, the elastic net regression models have also revealed important differences, especially compared to the logistic regression. Most strikingly, the animacy/definiteness effects shown in the latter – which would have confirmed one of the hypotheses posed in this paper – disappeared in the elastic net regression. The comparison has thus had a significant effect on the conclusions drawn about the history of the conative alternation, showcasing the benefits of being able to assess lexical effects and those of high-level predictors in a single model (Sevenants et al., Reference Sevenants, Van de Velde and Speelman2024) and to thereby obtain more refined and robust results. The combination of both lexical effects and higher-level predictors in one model – and evidence for both of these affecting syntactic variation – also has more general theoretical implications, in that it provides support for accounts arguing against a syntax-lexicon dichotomy, such as construction grammar (e.g. Goldberg, Reference Goldberg1995).

6. Conclusions

This paper has investigated the impact of different cognitive factors which are involved in a particular case of argument structure variation between a nominal pattern and a prepositional construction with at, namely the English conative alternation observable with verbs of impact such as hitting. More precisely, the study has zoomed in on the history of the alternation from ME to LME, using data from the standard corpora of historical English. Tracking the development of the alternation over time, the study has investigated different factors impacting the choice of prepositional over nominal patterns. On the one hand, these include variables grounded in Rohdenburg’s complexity principle (Reference Rohdenburg1996), which suggests that the more explicit option (in this case the PP-pattern) should be used in more complex, cognitively demanding environments, such as in instances with longer constituents or low-frequency verbs, or in potentially ambiguous contexts. On the other hand, the paper has assessed the role of lexical biases, in particular verb-related preferences. In order to explore the relative impact of these two groups of factors, the more standard methods of collostructional analysis and mixed-effects logistic regression modelling have been compared to a new model demonstrated in Sevenants et al. (Reference Sevenants, Van de Velde and Speelman2024), namely elastic net regression. The latter is particularly beneficial as it allows, among other things, to combine both lexical effects and high-level predictors such as argument length in a single, unified model.

Ultimately, the elastic net regression model has confirmed the important role of lexical effects throughout time and has suggested that the PP-pattern in particular may have lexically and semantically expanded over time. At the same time, results also show an impact of complexity, specifically of distance and verb frequency. Other effects are more mixed: in particular, ambiguity seems to have the opposite impact than what was expected. Overall, it thus seems that the conative alternation in general may be more lexically driven than other alternations like the English dative alternation. The influence of semantic-pragmatic factors likely at play in the alternation (such as success or failure of the action) as well as their historical development, however, remains unexplored.

Further limitations of the study and points for future investigations include the exclusion of additional potentially alternating patterns (with prepositions other than at). For example, PP-instances verbs of ingestion or the prototypical impact verbs hit and knock may be attested with different prepositions. On the other hand, the study has been more inclusive than others, mostly due to including ditransitive patterns in addition to transitive clauses. As additional objects in the clause also provide more options for order variation, which may, in turn, impact the factors at play and the direction of their influence, this factor needs to be revisited in more detail. Likewise, the paper has been very (potentially too) broad regarding the verbs included (also covering e.g. admire or rejoice, not typically counted as part of the conative alternation). In general, manual decisions on the interchangeability of the patterns are always subjective to some extent (especially when dealing with historical data), which may evidently skew results.

Funding statement

This study was funded by the Swiss National Science Foundation (Grant nr. 175987; Prepositions in English Argument Structure across Time and Space).

Data availability statement

The data and code for the present study can be accessed via OSF: https://osf.io/uzvtd/?view_only=adc1af91de6f49c4a2fbb335cb506d70.

Appendix

Table A1. Distinctive collexeme analysis of verb lemmas, verb classes, and verb categories associated with NP vs PP patterns in Middle English

Table A2. Distinctive collexeme analysis of verb lemmas, verb classes, and verb categories associated with NP vs PP patterns in Early Modern English

Table A3. Distinctive collexeme analysis of verb lemmas, verb classes, and verb categories associated with NP vs PP patterns in Late Modern English

Table A4. Output of a mixed-effects logistic regression model for NP/PP-choice in Middle English

Table A5. Output of a mixed-effects logistic regression model for NP/PP-choice in Early Modern English

Table A6. Output of a mixed-effects logistic regression model for NP/PP-choice in Late Modern English

Table A7. Output of an elastic net regression model for NP vs PP-choice (features and coefficients) in Middle English, Early Modern English, and Late Modern English

Footnotes

¹ This ‘trying’ aspect is also what gives the prepositional construction, typically referred to as the ‘conative’ construction and ultimately the alternation, its name (Lat. conor/conari ‘to try’, Perek & Lemmens, Reference Perek and Lemmens2010, §17; Vincent, Reference Vincent2013).

² Note that Beavers (Reference Beavers2010) further distinguishes between incremental theme verbs such as eating and other verbs like chewing or gnawing, arguing that with the latter, no semantic contrast to the NP-version is given.

³ In (19b) and (19c), a transitive NP-object (‘arrows’ and ‘spear’, respectively) is also given, meaning that these instances do not fully qualify as tokens of the conative construction in the definition adhered to in most present-day accounts (Levin, Reference Levin1993, pp. 42–43).

⁴ Note that frequency of the theme head nouns might be of relevance as well; this factor is not included in the present study due to issues with operationalisation (see also Section 3 on lemmatisation).

⁵ The verb lemmas are furthermore grouped into verb classes (loosely based on Perek, Reference Perek2015, p. 122) and classified according to the allative/ablative/attention distinction introduced before; however, this is merely done to aid post-hoc interpretation of the results.

⁶ The low values for alpha in all models mean that Ridge penalty is emphasised.

References

Aissen, J. (2003). Differential object marking: Iconicity vs. economy. Natural Language & Linguistic Theory, 21(3), 435–483. https://doi.org/10.1023/a:1024109008573.CrossRef Google Scholar

Allen, C. (1995). Case marking and reanalysis: Grammatical relations from old to early modern English. OUP.10.1093/oso/9780198240969.001.0001CrossRef Google Scholar

Archer, D., Kytö, M., Baron, A., & Rayson, P. (2015). Guidelines for normalising early modern English corpora: Decisions and justifications. ICAME Journal, 39(1), 5–24.10.1515/icame-2015-0001CrossRef Google Scholar

Baron, A., & Rayson, P. (2008). VARD2: A tool for dealing with spelling variation in historical corpora. Postgraduate Conference in Corpus Linguistics, 2008-05-22, Birmingham, UK: Aston University. http://ucrel.lancs.ac.uk/people/paul/publications/BaronRaysonAston2008.pdf.Google Scholar

Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48. https://doi.org/10.18637/jss.v067.i01.CrossRef Google Scholar

Baugh, A., & Cable, T. (2002). A history of the English language (5th ed.). Taylor & Francis. https://doi.org/10.4324/9780203994634.Google Scholar

Beavers, J. (2010). The structure of lexical meaning. Language, 86(4), 821–864. https://doi.org/10.1353/lan.2010.0040.CrossRef Google Scholar

Beavers, J. (2011). On affectedness. Natural Language & Linguistic Theory, 29(2), 335–370. https://doi.org/10.1007/s11049-011-9124-6.CrossRef Google Scholar

Bloem, J. (2021). Processing verb clusters. LOT. https://doi.org/10.48273/LOT0586.Google Scholar

Bloem, J., Versloot, A., & Weerman, F. (2017). Verbal cluster order and processing complexity. Language Sciences, 60, 94–119. https://doi.org/10.1016/j.langsci.2016.10.009.CrossRef Google Scholar

Bosworth, J., & Toller, T. N. (Eds.) (2014). An Anglo–Saxon dictionary online. Edited and maintained by Sean, C. & Tichy, O.. Faculty of Arts, Charles University. https://bosworthtoller.com/.Google Scholar

Breheny, P., & Burchett, W. (2017). Visualization of regression models using visreg. The R Journal, 9, 56–71.10.32614/RJ-2017-046CrossRef Google Scholar

Bresnan, J., Cueni, A., Nikitina, T. & Baayen, H. (2007). Predicting the dative alternation. In Bouma, G., Kraemer, I. & Zwarts, J. (Eds.), Cognitive foundations of interpretation (pp. 69–94). Royal Netherlands Academy of Science. https://web.stanford.edu/~bresnan/qs-submit.pdf.Google Scholar

Broccias, C. (2001). Allative and ablative at-constructions. In Andronis, M., Ball, C., Elston, H., & Neuvel, S. (Eds.), CLS 37: The main session. Volume 1 (pp. 67–82). Chicago Linguistic Society.Google Scholar

Broccias, C. (2003). The English change network: Forcing changes into categories. De Gruyter Mouton. https://doi.org/10.1515/9783110901207.CrossRef Google Scholar

Bybee, J. (2007). Frequency of use and the organization of language. OUP. https://doi.org/10.1093/acprof:oso/9780195301571.001.0001.CrossRef Google Scholar

Bybee, J., & Hopper, P. (2001). Frequency and the emergence of linguistic structure. Benjamins. https://doi.org/10.1075/tsl.45.CrossRef Google Scholar

Colleman, T., & De Clerck, B. (2011). Constructional semantics on the move: On semantic specialization in the English double object construction. Cognitive Linguistics, 22(1), 183-209. https://doi.org/10.1515/cogl.2011.008CrossRef Google Scholar

De Smet, H., D’hoedt, F., Fonteyn, L., & Goethem, K. (2018). The changing functions of competing forms: Attraction and differentiation. Cognitive Linguistics, 29(2), 197–234. https://doi.org/10.1515/cog-2016-0025.CrossRef Google Scholar

Divjak, D. (2019). Frequency in language: Context, memory and attention. CUP. https://doi.org/10.1017/9781316084410.CrossRef Google Scholar

Dixon, R. M. W. (1991). A new approach to English grammar, on semantic principles. OUP.10.1093/oso/9780198242727.001.0001CrossRef Google Scholar

Esteban-Segura, L., & Salles-Bernal, S. (2017). A diachronic study of the conative alternation construction in English. Unpublished conference abstract. http://hdl.handle.net/10630/13794.Google Scholar

Fedzechkina, M., Jaeger, T. F., & Newport, E. L. (2012). Language learners restructure their input to facilitate efficient communication. Proceedings of the National Academy of Sciences of the United States of America, 109(44), 17897–17902. https://doi.org/10.1073/pnas.1215776109.CrossRef Google Scholar PubMed

Fedzechkina, M., Newport, E. L., & Jaeger, T. F. (2017). Balancing effort and information transmission during language acquisition. Cognitive Science, 41(2), 416–446. https://doi.org/10.1111/cogs.12346.CrossRef Google Scholar PubMed

Fedzechkina, M., & Jaeger, T. F. (2020). Production efficiency can cause grammatical change. Cognition, 196(104115), 104115. https://doi.org/10.1016/j.cognition.2019.104115.CrossRef Google Scholar PubMed

Fennell, B. A. (2001). A history of English: A sociolinguistic approach. Blackwell.Google Scholar

Flach, S. (2021). Collostructions: An R implementation for the family of collostructional methods. version v.0.2.0, https://sfla.ch/collostructions/.Google Scholar

Friedman, J., Tibshirani, R., & Hastie, T. (2010). Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33(1), 1–22. https://doi.org/10.18637/jss.v033.i01.CrossRef Google Scholar PubMed

Gelman, A., & Hill, J. (2006). Data analysis using regression and multilevel/hierarchical models. CUP.10.1017/CBO9780511790942CrossRef Google Scholar

Gibson, E. (1998). Linguistic complexity: Locality of syntactic dependencies. Cognition, 68(1), 1–76. https://doi.org/10.1016/S0010-0277(98)00034-1.CrossRef Google Scholar

Goldberg, A. (1995). Constructions: A construction grammar approach to argument structure. University of Chicago Press.Google Scholar

Grafmiller, J. (2019). jasongraf1/JGmermod: Custom functions for mixed-effects regression models. R package. https://rdrr.io/github/jasongraf1/JGmermod/.Google Scholar

Gries, S. T. (2019). 15 years of collostructions. International Journal of Corpus Linguistics, 24(3), 385–412. https://doi.org/10.1075/ijcl.00011.gri.CrossRef Google Scholar

Gries, S. T., & Stefanowitsch, A. (2004). Extending collostructional analysis. International Journal of Corpus Linguistics, 9(1), 97–129. https://doi.org/10.1075/ijcl.9.1.06gri.CrossRef Google Scholar

Guerrero-Medina, P. (2011). An antipassive interpretation of the English “conative alternation”. In Guerrero-Medina, P. (Ed.), Morphosyntactic alternations in English (pp. 182–203). Equinox.Google Scholar

Guerrero-Medina, P. (2017). The English conative as a family of constructions. In de Mendoza Ibáñez, F. J. R., Oyón, A. L., & Pérez-Sobrino, P. (Eds.), Constructing families of constructions (pp. 277–299). Benjamins. https://doi.org/10.1075/hcp.58.10gue.CrossRef Google Scholar

Haspelmath, M. (2019). Differential place marking and differential object marking. Language Typology and Universals, 72(3), 313–334. https://doi.org/10.1515/stuf-2019-0013.CrossRef Google Scholar

Haspelmath, M. (2021a). Explaining grammatical coding asymmetries. Journal of Linguistics, 57(3), 605–633. https://doi.org/10.1017/s0022226720000535.CrossRef Google Scholar

Haspelmath, M. (2021b). Role-reference associations and the explanation of argument coding splits. Linguistics, 59(1), 123–174. https://doi.org/10.1515/ling-2020-0252.CrossRef Google Scholar

Heller, B., Szmrecsanyi, B., & Grafmiller, J. (2017). Stability and fluidity in syntactic variation world-wide. Journal of English Linguistics, 45(1), 3–27. https://doi.org/10.1177/0075424216685405.CrossRef Google Scholar

Hoop, H., & de Swart, P. (2008). Differential subject marking. Springer. https://doi.org/10.1007/978-1-4020-6497-5.Google Scholar

Huddleston, R. D., & Pullum, G. K. (2002). The Cambridge grammar of the English language. CUP. https://doi.org/10.1017/9781316423530.CrossRef Google Scholar

Iemmolo, G. (2010). Topicality and differential object marking. Studies in Language, 34(2), 239–272. https://doi.org/10.1075/sl.34.2.01iem.CrossRef Google Scholar

Iemmolo, G. (2013). Symmetric and asymmetric alternations in direct object encoding. Language Typology and Universals, 66(4), 378–403. https://doi.org/10.1524/stuf.2013.0019.CrossRef Google Scholar

Klavan, J., & Schützler, O. (2023). The complexity principle and the morphosyntactic alternation between case affixes and postpositions in Estonian. Cognitive Linguistics, 34(2), 297–331. https://doi.org/10.1515/cog-2021-0114.CrossRef Google Scholar

Kroch, A., Santorini, B. & Delfs, L. (2004). The Penn-Helsinki Parsed Corpus of Early Modern English. Department of Linguistics, University of Pennsylvania, first edition, release 3. http://www.ling.upenn.edu/ppche/ppche-release-2016/PPCEME-RELEASE-3.Google Scholar

Kroch, A., Santorini, B., & Diertani, A. (2016). The Penn Parsed Corpus of Modern British English. Department of Linguistics, University of Pennsylvania, second edition, release 1. http://www.ling.upenn.edu/ppche/ppche-release-2016/PPCMBE2-RELEASE-1.Google Scholar

Kroch, A., Taylor, A. & Santorini, B. (2000). The Penn-Helsinki Parsed Corpus of Middle English. Department of Linguistics, University of Pennsylvania, second edition, release 4. http://www.ling.upenn.edu/ppche/ppche-release-2016/PPCME2-RELEASE-4.Google Scholar

Lee, K. (2003). The conative construction in English. Discourse and Cognition, 10(3), 165–186.Google Scholar

Levin, B. (1993). English verb classes and alternations: A preliminary investigation. Chicago UP.Google Scholar

Levshina, N. (2015). How to do linguistics with R. Benjamins. https://doi.org/10.1075/z.195.CrossRef Google Scholar

Levshina, N. (2018). Anybody (at) home? Communicative efficiency knocking on the construction grammar door. Yearbook of the German Cognitive Linguistics Association, 6(1), 71–90. https://doi.org/10.1515/gcla-2018-0004.CrossRef Google Scholar

Levshina, N. (2021a). Communicative efficiency and differential case marking. Linguistics Vanguard: Multimodal Online Journal, 7(Suppl 3), 20190087. https://doi.org/10.1515/lingvan-2019-0087.CrossRef Google Scholar

Levshina, N. (2021b). Cross-linguistic trade-offs and causal relationships between cues to grammatical subject and object, and the problem of efficiency-related explanations. Frontiers in Psychology, 12, 648200. https://doi.org/10.3389/fpsyg.2021.648200.CrossRef Google Scholar

Levshina, N. (2022). Communicative efficiency. CUP. https://doi.org/10.1017/9781108887809.CrossRef Google Scholar

Linzen, T., & Jaeger, F. (2014). Investigating the role of entropy in sentence processing. In Proceedings of the fifth workshop on cognitive Modeling and computational linguistics (pp. 10–18). Association for Computational Linguistics.10.3115/v1/W14-2002CrossRef Google Scholar

McFadden, T. (2002). The rise of the to−dative in middle English. In Lightfoot, D. (Ed.), Syntactic effects of morphological change (pp. 107–123). OUP. https://doi.org/10.1093/acprof:oso/9780199250691.003.0006.CrossRef Google Scholar

Mondorf, B. (2009). More support for more-support. Benjamins. https://doi.org/10.1075/silv.4.CrossRef Google Scholar

Percillier, M. (2016). Verb lemmatization and semantic verb classes in a Middle English corpus. In Proceedings of the 13th Conference on Natural Language Processing (KONVENS 2016) (pp. 209–214). Ruhr-Universität Bochum. https://www.linguistics.rub.de/konvens16/pub/26_konvensproc.pdf.Google Scholar

Percillier, M. (2018). A toolkit for lemmatising, analysing, and visualising middle English data. In Frank, A., Ivanovic, C., Mambrini, F., Passarotti, M., & Sporleder, C. (Eds.), Proceedings of the second workshop on corpus-based research in the humanities (pp. 153–160). Dept. of Geoinformation, TU Wien. https://www.oeaw.ac.at/fileadmin/subsites/academiaecorpora/PDF/CRH2.pdf.Google Scholar

Perek, F. (2014). Rethinking constructional polysemy. In Glynn, D. & Robinson, J. (Eds.), Human cognitive processing (pp. 61–85). Benjamins. https://doi.org/10.1075/hcp.43.03per.Google Scholar

Perek, F. (2015). Argument structure in usage-based construction grammar. Benjamins. https://doi.org/10.1075/cal.17.CrossRef Google Scholar

Perek, F. & Lemmens, M. (2010). Getting at the meaning of the English at-construction. CogniTextes, 5, 1–28. http://cognitextes.revues.org/331.10.4000/cognitextes.331CrossRef Google Scholar

Pijpops, D. (2020). What is an alternation? Belgian Journal of Linguistics, 34, 283–294. https://doi.org/10.1075/bjl.00053.pij.CrossRef Google Scholar

Pijpops, D., Speelman, D., Grondelaers, S., & Van de Velde, F. (2018). Comparing explanations for the complexity principle. Language and Cognition, 10(3), 514–543. https://doi.org/10.1017/langcog.2018.13.CrossRef Google Scholar

Pijpops, D., Speelman, D., Grondelaers, S., & Van de Velde, F. (2021). Incorporating the multi-level nature of the constructicon into hypothesis testing. Cognitive Linguistics, 32(3), 487–528. https://doi.org/10.1515/cog-2020-0039.CrossRef Google Scholar

Pijpops, D., Speelman, D., & van den Bosch, A. (2022). Generating hypotheses for alternations at low and intermediate levels of categoryticity: The use of memory-based learning. Linguistics Vanguard, 8(1), 305–319. https://doi.org/10.1515/lingvan-2021-0081.CrossRef Google Scholar

Pinker, S. (1989). Learnability and cognition: The acquisition of argument structure. MIT Press.Google Scholar

R Core Team (2024). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. www.R-project.org/.Google Scholar

Riemer, N. (2012). Introducing semantics. CUP. https://doi.org/10.1017/cbo9780511808883.Google Scholar

Rinker, T. (2022). textstem: Tools for stemming and lemmatizing text, Version 0.1. 4. http://github.com/trinker/textstem.Google Scholar

Rohdenburg, G. (1996). Cognitive complexity and increased grammatical explicitness in English. Cognitive Linguistics, 7(2), 149–182. https://doi.org/10.1515/cogl.1996.7.2.149.CrossRef Google Scholar

Rohdenburg, G. (2002). Processing complexity and the variable use of prepositions in English. In Cuyckens, H. & Radden, G. (Eds.), Perspectives on prepositions (pp. 79–100). De Gruyter Mouton. https://doi.org/10.1515/9783110924787.79.CrossRef Google Scholar

Rohdenburg, G. (2020). The complexity principle at work with rival prepositions. English Language and Linguistics, 24(4), 769–800. https://doi.org/10.1017/s1360674319000327.CrossRef Google Scholar

Röthlisberger, M., Grafmiller, J., & Szmrecsanyi, B. (2017). Cognitive indigenization effects in the English dative alternation. Cognitive Linguistics, 28(4), 673–710. https://doi.org/10.5167/uzh-170210.CrossRef Google Scholar

Röthlisberger, M. (2021). Between context and community: Regional variation in register effects in the English dative alternation. In Seoane, E. & Biber, D. (Eds.), Corpus-based approaches to register variation (pp. 111–142). Benjamins. https://doi.org/10.1075/scl.103.05rot.CrossRef Google Scholar

Sato, K. (2009). The development from case-forms to prepositional constructions in old English prose. Peter Lang.Google Scholar

Schmid, H.-J. (2020). The dynamics of the linguistic system: Usage, conventionalization, and entrenchment. OUP. https://doi.org/10.1093/oso/9780198814771.001.0001.CrossRef Google Scholar

Schmid, H.-J., & Küchenhoff, H. (2013). Collostructional analysis and other ways of measuring lexicogrammatical attraction. Cognitive Linguistics, 24(3), 531–577. https://doi.org/10.1515/cog-2013-0018.CrossRef Google Scholar

Sevenants, A. (2023). ElasticToolsR. Version 1.3. Leuven. https://github.com/AntheSevenants/ElasticToolsR/tree/v1.3.Google Scholar

Sevenants, A., Van de Velde, F., & Speelman, D. (2024). Investigating lexical-semantic effects on morphosyntactic variation using elastic net regression. Corpus Linguistics and Linguistic Theory, 1–43. https://doi.org/10.1515/cllt-2024-0068.Google Scholar

Sosa-Acevedo, E. (2009). Lexical classes and the conative construction in old English. Studia Anglica Posnaniensia, 45(1), 69–90. https://doi.org/10.2478/v10121-009-0005-6.CrossRef Google Scholar

Stefanowitsch, A., & Gries, S. T. (2003). Collostructions: Investigating the interaction of words and constructions. International Journal of Corpus Linguistics, 8(2), 209–243. https://doi.org/10.1075/ijcl.8.2.03ste.CrossRef Google Scholar

Stefanowitsch, A. (2013). Collostructional analysis. In Hoffmann, T. & Trousdale, G. (Eds.), The Oxford handbook of construction grammar (pp. 290–306). OUP. https://doi.org/10.1093/oxfordhb/9780195396683.013.0016.Google Scholar

Stefanowitsch, A., & Flach, S. (2020). Too big to fail but big enough to pay for their mistakes. In Pastor, G. C. & Colson, J.-P. (Eds.), Computational phraseology (pp. 248–272). Benjamins. https://doi.org/10.1075/ivitra.24.13ste.Google Scholar

Szmrecsanyi, B. (2012). Analyticity and syntheticity in the history of English. In Nevalainen, T. & Traugott, E. (Eds.), The Oxford handbook of the history of English (pp. 654–665). OUP. https://doi.org/10.1093/oxfordhb/9780199922765.013.0056.CrossRef Google Scholar

Tal, S., Smith, K., Culbertson, J., Grossman, E., & Arnon, I. (2022). The impact of information structure on the emergence of differential object marking. Cognitive Science, 46(3), e13119. https://doi.org/10.1111/cogs.13119.CrossRef Google Scholar

Tizón-Couto, D., & Lorenz, D. (2021). Variables are valuable: Making a case for deductive modeling. Linguistics, 59(5), 1279–1309. https://doi.org/10.1515/ling-2019-0050.CrossRef Google Scholar

Traugott, E. C., & Trousdale, G. (2013). Constructionalization and constructional changes. Oxford: OUP.10.1093/acprof:oso/9780199679898.001.0001CrossRef Google Scholar

Trips, C., & Percillier, M. (2020). Lemmatising verbs in middle English corpora: The benefit of enriching the Penn-Helsinki parsed corpus of middle English 2 (PPCME2), the parsed corpus of middle English poetry (PCMEP), and a parsed linguistic atlas of early middle English (PLAEME). In Proceedings of the twelfth language resources and evaluation conference (pp. 7170–7178). European Language Resources Association. https://aclanthology.org/2020.lrec-1.886.Google Scholar

University of Michigan Regents. (2013). The electronic Middle English dictionary. http://quod.lib.umich.edu/m/med/.Google Scholar

Van de Velde, F., & Pijpops, D. (2021). Investigating lexical effects in syntax with regularized regression (lasso). Journal of Research Design and Statistics in Linguistics and Communication Science, 6(2), 166–199. https://doi.org/10.1558/jrds.189.CrossRef Google Scholar

Van der Leek, F. (1996). The English conative construction. In Dobrin, L. M., Singer, K., & McNair, L. (Eds.), CLS 32: The main session (pp. 363–378). Chicago Linguistic Society.Google Scholar

Vincent, N. (2013). Conative. Linguistic Typology, 17(2), 269–289. https://doi.org/10.1515/lity-2013-0012.CrossRef Google Scholar

Visser, F. (1963). An historical syntax of the English language. Brill.Google Scholar

Wickham, H. (2016). Ggplot2: Elegant graphics for data analysis (2nd ed.). Springer. https://doi.org/10.1007/978-3-319-24277-4CrossRef Google Scholar

Winter, B. (2019). Statistics for linguists: An introduction using R. Routledge.10.4324/9781315165547CrossRef Google Scholar

Witzlack-Makarevich, A., & Seržant, I. (2018). Differential argument marking: Patterns of variation. In Seržant, I. & Witzlack-Makarevich, A. (Eds.), Diachrony of differential argument marking (pp. 1–140). Language Science Press. https://doi.org/10.5281/zenodo.1228243.Google Scholar

Zehentner, E. (2019). Competition in language change: The rise of the English dative alternation. Mouton de Gruyter. https://doi.org/10.1515/9783110633856.CrossRef Google Scholar

Zehentner, E. (2022). Ambiguity avoidance as a factor in the rise of the English dative alternation. Cognitive Linguistics, 33(1), 3–33. https://doi.org/10.1515/cog-2021-0018.CrossRef Google Scholar

Table 1. Main semantic differences between the NP- and the PP-pattern in PDE

Table 2. Three-way distinction of conative PP-types and main features

Table A1. Distinctive collexeme analysis of verb lemmas, verb classes, and verb categories associated with NP vs PP patterns in Middle English

Table A2. Distinctive collexeme analysis of verb lemmas, verb classes, and verb categories associated with NP vs PP patterns in Early Modern English

Table A3. Distinctive collexeme analysis of verb lemmas, verb classes, and verb categories associated with NP vs PP patterns in Late Modern English

Table A4. Output of a mixed-effects logistic regression model for NP/PP-choice in Middle English

Table A5. Output of a mixed-effects logistic regression model for NP/PP-choice in Early Modern English

Table A6. Output of a mixed-effects logistic regression model for NP/PP-choice in Late Modern English

Table A7. Output of an elastic net regression model for NP vs PP-choice (features and coefficients) in Middle English, Early Modern English, and Late Modern English

Article contents

The English conative alternation between complexity effects and lexical biases: a historical perspective

Abstract

Keywords

Information

1. Introduction

2. Factors at play in the English conative alternation

2.1. The conative alternation in PDE

2.1.1. Main differences between the variants of the alternation

2.1.2. Semantics of the conative PP-pattern

2.2. The conative alternation in the history of English

2.3. Factors predicting the conative alternation

2.3.1. Cognitive complexity

2.3.2. Lexical factors

2.3.3. Lexical frequency

3. Data and methods

3.1. Data extraction and annotation

3.2. Statistical analysis

4. Results

4.1. Data distribution

4.2. Collostructional analysis

4.3. Mixed-effects logistic regression modelling

4.4. Elastic net regression modelling

5. Discussion

5.1. Complexity effects

5.2. Lexical effects

5.3. Methodological implications

6. Conclusions

Funding statement

Data availability statement

Appendix

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests