Appendix B: Top Five Bigrams with Target Item on the Left

Note. Numbers in brackets give raw frequency and range; items are sorted by frequency; in case two or more items have the same frequency, the one with the higher range is given first (everything being equal, items are listed in alphabetical order).
Table 2.6Long description
The table is divided into 6 columns with the labels Cor, JEngl, J G L, J H L, JoL, and Syn. There are several rows in which the data are filled from left to right. The details are as follows:
For canonical, the data are sequence, 6/2, antonyms, 9/2, verbs, 26/1, subject, 27/4, gender, 60/1, subject, 11/7, antonyms, 5 by 1, oppositions, 8 by 1, transitive, 17 by 1, grammatical, 5 by 1, morphosyntactic, 37 by 2, perfective, 9 by 1, agreement or clause-initial or emphatic or p si or sequences or use, 4 by 1, oppositional, 6 by 2, E O A C C, 17 by 1, transitive, 3 by 2, position, 8 by 4, and, 5 by 2, E O, 6 by 1, complement, 3 by 1, case, 24 by 7, order, 7 by 6, status, 5 by 1, and, 5 by 1, utterance, 3 by 1, agreement, 24 by 3, null-subject or typology, 23 by 6, positions, 3 by 2.
For non-canonical, the data are oppositions, 2 by 1, oppositions, 17 by 3, order, 6 by 1, subjects, 14 by 3, case, 21 by 2, plural, 3 by 1, examples, 5 by 1, word, 2 by 1, subject, 8 by 2, agreement, 9 by 6, agreement, 2 by 2, opposition, 3 by 2, clauses, 2 by 1, marking, 2 by 2, behaviour, 7 by 1, positions, 2 by 2, textual, 3 by 1, marking, 2 by 1, morphosyntactic, 6 by 1, position, 2 by 1, ones, 2 by 1, sentences, 2 by 1, passives, 5 by 4.
For marked, the data are as, 30 by 20, by, 34 by 27, by, 33 by 23, by, 94 by 37, by, 114 by 54, with, 100 by 38, with, 29 by 20, for, 34 by 7, for, 30 by 13, with, 78 by 22, with, 102 by 40, by, 43 by 22, by, 26 by 15, in, 21 by 16, as, 27 by 17, in, 48 by 19, and, 52 by 15, as, 30 by 13, for, 22 by 8, as, 19 by 11, in, 18 by 12, for, 29 by 16, for, 51 by 22, for, 20 by 12, plural, 21 by 1, with, 15 by 7, on, 12 by 8, as, 23 by 6, as, 41 by 30, in, 15 by 9.
For unmarked, the data are forms, 21 by 2, in, 6 by 3, verb, 8 by 2, transitive, 20 by 2, subject, 28 by 1, case, 32 by 8, and, 13 by 3, form, 4 by 2, for, 7 by 4, construction, 11 by 2, nominal, 15 by 3, argument, 10 by 1, form, 9 by 1, speakers, 4 by 2, form, 6 by 4, in, 9 by 6, causative, 14 by 1, and, 7 by 3, marked, 5 by 1, or, 3 by 2, gender or in or order, 4 by 2, for 5 by 4, with, 13 by 2, objects, 7 by 2, ministro, 4 by 1, order, 3 by 2, p-prominent, 5 by 2, form, 12 by 2, option, 6 by 4.
For standard, the data are deviation, 26 by 10, English, 143 by 37, Dutch, 246 by 23, negation, 73 by 3, Polish, 63 by 4, assumptions, 22 by 17, deviations, 16 by 11, deviation, 37 by 14, German, 230 by 24, Finnish, 26 by 1, English, 62 by 11, English, 21 by 7, and, 12 by 5, and, 29 by 13, language, 84 by 17, language, 10 by 9, Arabic, 51 by 13, of, 21 by 1, English, 11 by 8, language, 24 by 12, and, 32 by 6, ModGr, 10 by 1, O T, 39 by 9, error, 14 by 8, reference, 7 by 6, error, 18 by 5, variety, 27 by 9, English, 6 by 4, Russian, 24 by 6, analysis, 13 by 8.
For non-standard, the data are capitalisation, 25 by 1, varieties, 25 by 11, language, 17 by 4, varieties, 8 by 2, English, 4 by 3, varieties, 4 by 2, spellings, 5 by 4, and, 16 by 7, varieties, 11 by 7, European, 2 by 1, use, 4 by 1, assumption, 3 by 2, language, 3 by 3, forms, 13 by 7, features, 9 by 3, Ibero-Romance, 2 by 1, varieties, 3 by 3, dialects, 2 by 2, varieties, 3 by 3, sentences, 9 by 1, gender, 6 by2, uses, 3 by 2, forms, 3 by 2, form, 6 by 4, form, 5 by 5, Polish, 2 by 1, Lucas or in or never, 3 by 1, primarily, 2 by 1.