Blogging around the world

Kopaczyk, J. and Tyrkkö, J. (2018) Blogging around the world. In: Kopaczyk, J. and Tyrkkö, J. (eds.) Applications of Pattern-driven Methods in Corpus Linguistics. Series: Studies in corpus linguistics (82). John Benjamins Publishing Company: Amsterdam ; Philadelphia, pp. 277-310. ISBN 9789027200136 (doi: 10.1075/scl.82.11kop)

Full text not currently available from Enlighten.


The borderless nature of blogging raises the question whether the traditional regionally defined varieties of English continue to hold true (see Crystal 2011). In order to investigate the extent to which the language published online without external intervention is similar around the world, this chapter investigates repetitive patterns, or 3-grams, found in blogs in the 583-million-word GloWbE corpus (Davies 2013). The data shows two types of repetitive word sequences: universal, or those that are frequent in all or most of the nineteen geographic locations represented in the corpus, and localised, or those unique to specific regions. We explore multiple ways of approaching the regional distribution of universal and localised 3-grams, such as statistical similarity measures (Jaccard coefficient and hierarchical clustering) and network visualisations. Three correlated research issues are addressed by this study: (1) the ratio of 3-grams in blogs from various World Englishes, which will shed light onto the degree of formulaicity in Web Englishes around the world; (2) the overlaps between various locations in terms of preferred sequences, which may point to local or global standardization hubs on the level of sentence and text construction; (3) finally, the status of model-providing varieties for internet communication, especially American English, in view of the most frequent 3-grams from other locations (cf. Mair 2013).

Item Type:Book Sections
Glasgow Author(s) Enlighten ID:Kopaczyk, Professor Joanna
Authors: Kopaczyk, J., and Tyrkkö, J.
College/School:College of Arts & Humanities > School of Critical Studies > English Language and Linguistics
Journal Name:Studies in Corpus Linguistics
Publisher:John Benjamins Publishing Company

University Staff: Request a correction | Enlighten Editors: Update this record