Hostname: page-component-68c7f8b79f-fc4h8 Total loading time: 0 Render date: 2026-01-05T06:54:22.366Z Has data issue: false hasContentIssue false

The Santa Cruz Sluicing Data Set

Published online by Cambridge University Press:  01 January 2026

Pranav Anand*
Affiliation:
University of California, Santa Cruz
Daniel Hardt*
Affiliation:
Copenhagen Business School
James McCloskey*
Affiliation:
University of California, Santa Cruz
Get access

Abstract

This report describes a new research resource: a searchable database of 4,700 naturally occurring instances of sluicing in English, annotated so as to shed light on the questions that have shaped research on ellipsis since the 1960s. The paper describes the data set and how it can be obtained, how it was constructed, how it is organized, and how it can be queried. It also highlights some initial empirical findings, first describing general characteristics of the data, then focusing more closely on issues concerning antecedents and possible mismatches between antecedents and ellipsis sites.

Information

Type
Research Report
Copyright
Copyright © 2021 Linguistic Society of America

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Footnotes

*

The research reported here was supported by funding from the Academic Senate of UC Santa Cruz, from The Humanities Institute of UC Santa Cruz, and from the National Science Foundation via Award Number 1451819: ‘The Implicit Content of Sluicing’ (PI Pranav Anand, co-PIs James McCloskey and Daniel Hardt). The project would have been impossible without the perceptiveness and commitment of our undergraduate annotators: Brooks Blair, Jacob Chemnick, Charlotte Daciolas, Jasmine Embry, Jack Haskins, Anny Huang, Zach Lebowski, Lily Ng, Lyndsey Olsen, Reuben Raff, and Serene Tseng. Particularly important contributions were made by our lead annotators—Rachelle Boyson, Mansi Desai, Chelsea Miller, Lydia Werthen, and Anissa Zaitsu. Our graduate student research assistants also made crucial contributions: Kelsey Kraus, Margaret Kroll, Deniz Rudin, and Bern Samko. Beyond the project itself, many colleagues have provided advice and support that we appreciate—Sandy Chung, Vera Gribanova, Kyle Johnson, Jason Merchant, and Tim Stowell in particular. We are also grateful to two referees and to the editorial team at Language (Lisa Travis and John Beavers) for a review process that was critical, constructive, and helpful.

References

Anand, Pranav, Hardt, Daniel; and McCloskey, James. 2020. The domain of matching in sluicing. Santa Cruz: University of California, Santa Cruz, ms.Google Scholar
Anand, Pranav, and McCloskey, Jim. 2015. Annotating the implicit content of sluices. Proceedings of the 9th Linguistic Annotation Workshop (LAW IX), 178–87. DOI: 10.3115/v1/W15-1621.10.3115/v1/W15-1621.10.3115/v1/W15-1621CrossRefGoogle Scholar
Barros, Matt, Elliott, Patrick D.; and Thoms, Gary. 2014. There is no island repair. Online: https://ling.auf.net/lingbuzz/002100.Google Scholar
Biezma, Maria. 2014. The grammar of discourse: The case of then. Proceedings of Semantics and Linguistic Theory (SALT) 24. 373–94. DOI: 10.3765/salt.v24i0.2444.Google Scholar
Chung, Sandra. 2005. Sluicing and the lexicon: The point of no return. Berkeley Linguistics Society 31. 7391. DOI: 10.3765/bls.v31i1.896.Google Scholar
Chung, Sandra. 2013. Syntactic identity in sluicing: How much and why. Linguistic Inquiry 44(1). 144. DOI: 10.1162/LING_a_00118.10.1162/LING_a_00118CrossRefGoogle Scholar
Chung, Sandra, Ladusaw, William A.; and McCloskey, James. 1995. Sluicing and logical form. Natural Language Semantics 3. 239–82. DOI: 10.1007/BF01248819.10.1007/BF01248819CrossRefGoogle Scholar
Dalrymple, Mary, Schieber, Stuart M.; and Pereira, Fernanda C. N.. 1991. Ellipsis and higher-order unification. Linguistics and Philosophy 14. 399452. DOI: 10.1007/BF00630923.10.1007/BF00630923CrossRefGoogle Scholar
Fernández, Raquel, Ginzburg, Jonathan; and Lappin, Shalom. 2004. Classifying ellipsis in dialogue: A machine learning approach. COLING '04: Proceedings of the 20th International Conference on Computational Linguistics, 240–46. DOI: 10.3115/1220355.1220390.10.3115/1220355.1220390CrossRefGoogle Scholar
Fiengo, Robert, and May, Robert. 1994. Indices and identity. Cambridge, MA: MIT Press.Google Scholar
Ginzburg, Jonathan, and Sag, Ivan A.. 2000. Interrogative investigations: The form, meaning and use of English interrogatives. Stanford, CA: CSLI Publications.Google Scholar
Graff, David, Kong, Junbo, Chen, Ke; and Maeda, Kazuaki. 2005. English Gigaword. 2nd edn. LDC2007T07. Philadelphia: Linguistic Data Consortium.Google Scholar
Hardt, Daniel. 1993. Verb phrase ellipsis: Form, meaning and processing. Philadelphia: University of Pennsylvania dissertation.Google Scholar
Hardt, Daniel. 1997. An empirical approach to VP ellipsis. Computational Linguistics 23(4). 525–41. Online: https://www.aclweb.org/anthology/J97-4002.Google Scholar
Hardt, Daniel, and Rudin, Deniz. 2019. Sluicing and modal mismatches. Paper presented at Sluicing@50, University of Chicago, April 12, 2019.Google Scholar
Heim, Irene. 1997. Predicates or formulas? Evidence from ellipsis. Proceedings of Semantics and Linguistic Theory (SALT) 7. 197221. DOI: 10.3765/salt.v7i0.2793.10.3765/salt.v7i0.2793CrossRefGoogle Scholar
Hofmann, Lisa. 2018. Why not: Polarity ellipsis and negative concord. Santa Cruz: University of California, Santa Cruz, ms.Google ScholarPubMed
Karttunen, Lauri. 1974. Presuppositions and linguistic context. Theoretical Linguistics 1(1–3). 181–94. DOI: 10.1515/thli.1974.1.1-3.181.10.1515/thli.1974.1.1-3.181CrossRefGoogle Scholar
Kehler, Andrew. 2002. Coherence in discourse. Stanford, CA: CSLI Publications.Google Scholar
Klein, Dan, and Manning, Christopher C.. 2003. Accurate unlexicalized parsing. Proceedings of the 41st meeting of the Association for Computational Linguistics, 423–30. Online: https://www.aclweb.org/anthology/P03-1000.10.3115/1075096.1075150CrossRefGoogle Scholar
Klima, Edward S. 1964. Negation in English. The structure of language: Readings in the philosophy of language, ed. by Fodor, Jerry A. and Katz, Jerrold J., 246323. Englewood Cliffs, NJ: Prentice Hall.Google Scholar
Kroll, Margaret. 2016. Polarity reversals under sluicing. Proceedings of Sinn und Bedeutung 21. 713–29. Online: https://semanticsarchive.net/Archive/DRjNjViN/Kroll.pdf.Google Scholar
Kroll, Margaret. 2019. Polarity reversals under sluicing. Semantics and Pragmatics 12:18. DOI: 10.3765/sp.12.18.10.3765/sp.12.18CrossRefGoogle Scholar
Kroll, Margaret, and Rudin, Deniz. 2018. Identity and interpretation: Syntactic and pragmatic constraints on the acceptability of sluicing. North East Linguistic Society (NELS) 47(2). 177–90.Google Scholar
Lasnik, Howard, and Funakoshi, Kenshi. 2018. Ellipsis in transformational grammar. The Oxford handbook of ellipsis, ed. by van Craenenbroeck, Jeroen and Temmer-mann, Tanja, 4674. Oxford: Oxford University Press. DOI: 10.1093/oxfordhb/9780198712398.013.3.Google Scholar
Merchant, Jason. 2001. The syntax of silence: Sluicing, islands, and the theory of ellipsis. Oxford: Oxford University Press.10.1093/oso/9780199243730.001.0001CrossRefGoogle Scholar
Merchant, Jason. 2013. Voice and ellipsis. Linguistic Inquiry 44(1). 77108. DOI: 10.1162/LING_a_00120.10.1162/LING_a_00120CrossRefGoogle Scholar
Roberts, Craige. 2012. Information structure: Towards an integrated formal theory of pragmatics. Semantics and Pragmatics 5:6. DOI: 10.3765/sp.5.6.10.3765/sp.5.6CrossRefGoogle Scholar
Rohde, Douglas L. T. 2005. TGrep2 user manual, version 1.15. Online: https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.70.9846&rep=rep1&type=pdf.Google Scholar
Romero, Maribel. 1998. Focus and reconstruction effects in WH-phrases. Amherst: University of Massachusetts, Amherst dissertation.Google Scholar
Rooth, Mats. 1992. Ellipsis redundancy and reduction redundancy. Arbeitspapiere des Sonderforschungsbereichs (Proceedings of the Stuttgart Ellipsis Workshop) 340. 126.Google Scholar
Ross, John R. 1967. Constraints on variables in syntax. Cambridge, MA: MIT dissertation. [Published as Infinite syntax!, Norwood, NJ: Ablex, 1986.].Google Scholar
Ross, John R. 1969. Guess who? Chicago Linguistic Society 5. 252–86.Google Scholar
Rudin, Deniz. 2019. Head-based syntactic identity in sluicing. Linguistic Inquiry 50(2). 253–83. DOI: 10.1162/ling_a_00308.10.1162/ling_a_00308CrossRefGoogle Scholar
Sag, Ivan A. 1976. Deletion and logical form. Cambridge, MA: MIT dissertation. Online: http://hdl.handle.net/1721.1/16401.Google Scholar
Stenetorp, Pontus, Pyysalo, Sampo, Topić, Goran, Ohta, Tomoko, Ananiadou, Sophia; and Tsujii, Jun'ichi. 2012. brat: A web-based tool for NLP-assisted text annotation. Proceedings of the demonstrations at the 13th conference of the European Chapter of the Association for Computational Linguistics, 102–7. Online: https://www.aclweb.org/anthology/E12-2021. Project website: http://brat.nlplab.org/about.html.Google Scholar
van Craenenbroeck, Jeroen. 2004. Sluicing in Dutch dialects. Leiden: Leiden University dissertation. Online: https://www.lotpublications.nl/Documents/096_fulltext.pdf.Google Scholar
van Craenenbroeck, Jeroen. 2010a. Invisible last resort: A note on clefts as the underlying source for sluicing. Lingua 120(7). 1714–42. DOI: 10.1016/j.lingua.2010.01.002.10.1016/j.lingua.2010.01.002CrossRefGoogle Scholar
van Craenenbroeck, Jeroen. 2010b. The syntax of ellipsis: Evidence from Dutch dialects. (Oxford studies in comparative syntax.) Oxford: Oxford University Press.Google Scholar
Vicente, Luis. 2019. Sluicing and its subtypes. The Oxford handbook of ellipsis, ed. by van Craenenbroeck, Jeroen and Temmerman, Tanja, 479503. Oxford: Oxford University Press. DOI: 10.1093/oxfordhb/9780198712398.013.22.10.1093/oxfordhb/9780198712398.013.22CrossRefGoogle Scholar
Yoshida, Masaya. 2010. ‘Antecedent-contained’ sluicing. Linguistic Inquiry 41(2). 348–56. DOI: 10.1162/ling.2010.41.2.348.10.1162/ling.2010.41.2.348CrossRefGoogle Scholar