Peter Petré, Lynn Anthonissen, Sara Budts, Enrique Manjavacas, Emma-Louise Silva, William Standing and Odile A.O. Strik
The present article provides a detailed description of the corpus of Early Modern Multiloquent Authors (EMMA), as well as two small case studies that illustrate its benefits. As a large-scale specialized corpus, EMMA tries to strike the right balance between big data and sociolinguistic coverage. It comprises the writings of 50 carefully selected authors across five generations, mostly taken from the 17th-century London society. EMMA enables the study of language as both a social and cognitive phenomenon and allows us to explore the interaction between the individual and aggregate levels.
The first part of the article is a detailed description of EMMA’s first release as well as the sociolinguistic and methodological principles that underlie its design and compilation. We cover the conceptual decisions and practical implementations at various stages of the compilation process: from text-markup, encoding and data preprocessing to metadata enrichment and verification.
In the second part, we present two small case studies to illustrate how rich contextualization can guide the interpretation of quantitative corpus-linguistic findings. The first case study compares the past tense formation of strong verbs in writers without access to higher education to that of writers with an extensive training in Latin. The second case study relates s/th-variation in the language of a single writer, Margaret Cavendish, to major shifts in her personal life.
Irma Taavitsainen, Turo Hiltunen, Anu Lehto, Ville Marttila, Päivi Pahta, Maura Ratia, Carla Suhr and Jukka Tyrkkö
Gross, Alan G., Joseph E. Harmon and Michael Reddy. 2002. Communicating science: The scientific article from the 17thcentury to the present. Oxford: Oxford University Press.
Hall, A. Rupert. 1971. English medicine in the Royal Society’s correspondence: 1660-1677. Medical History 15 (2): 111-125.
Hiltunen, Turo. 2010. Philosophical Transactions. In I. Taavitsainen and P. Pahta (eds.). Early Modern English Medical Texts: Corpus description and studies, 127-131. Amsterdam: John Benjamins.
Kilpatrick, Robert. 1990. ‘Living
Benedikt Heller, Tobias Bernaisch and Stefan Th. Gries
Altenberg, Bengt. 1982. The genitive v. the of-construction: A study of syntactic variation in 17thcentury English. Lund: Gleerup.
Behaghel, Otto. 1909. Beziehungen zwischen Umfang und Reihenfolge von Satzgliedern. Indogermanische Forschungen 25: 110-142.
Bernaisch, Tobias. 2012. Attitudes towards Englishes in Sri Lanka. World Englishes 31(3): 279-291.
Bernaisch, Tobias. 2015. The lexis and lexicogrammar of Sri Lankan English. Amsterdam: John Benjamins.