The Glasgow Norms: Ratings of 5,500 words on 9 scales

The Glasgow Norms are a set of normative ratings for 5,553 English words on 9 psycholinguistic dimensions: arousal, valence, dominance, concreteness, imageability, familiarity, age of acquisition, semantic size, and gender association. The Glasgow Norms are unique in several respects. First, the corpus, itself, is relatively large while simultaneously providing norms across a substantial number of lexical dimensions. Second, for any given subset of words, the same participants provided ratings across all 9 dimensions (33 participants/word, on average). Third, two novel dimensions of semantic size and gender association are included. Finally, the corpus contains a set of 379 ambiguous words that are presented alone (e.g., toast ) or with information that selects an alternative sense (e.g., toast (bread) , toast (speech) ). Relationships between the dimensions of the Glasgow Norms were initially investigated by assessing their correlations. In addition, a principal component analysis revealed four main factors accounting for 82% of the variance (“visualization,” “emotion,” “salience,” and “exposure”). The validity of the Glasgow Norms was established via comparisons of our ratings to 18 different sets of current psycholinguistic norms. The dimension of size was tested with megastudy data, confirming past studies that have explicitly examined this variable. Alternative senses of ambiguous words (i.e., disambiguated forms), when discordant on a given dimension, seemingly led to appropriately distinct ratings. Informal comparisons between ratings of ambiguous words and their alternative senses showed different patterns that likely depended on several factors (the number of senses, their relative strengths, and the rating scales, themselves). Overall, the Glasgow Norms provide a valuable resource, in particular, for researchers investigating the role of word recognition in language comprehension.

The Glasgow Norms provide a set of normative ratings for 5,553 English words on 9 psycholinguistic dimensions.Each word was rated on the dimensions of arousal, valence, dominance, concreteness, imageability, familiarity, age of acquisition, semantic size, and gender association.The aim was to develop a substantial set of standardized, freely available, psycholinguistic materials.The norms provide researchers with a considerable collection of materials that are not only reliably evaluated on specific dimensions of interest, but also on other potentially confounding dimensions.Accordingly, the norms allow for the creation and analysis of carefully controlled stimuli, facilitating continued investigations into these lexical dimensions as well as their interactions.
In comparison to previous word norms, the Glasgow Norms offer several significant, novel features.First, a relatively large number of lexical dimensions (9) was examined.Other norms typically assess only one to three dimensions.Second, the same participant provided ratings across all 9 dimensions for any given word.Currently, researchers interested in investigating more than a few lexical dimensions need to access different sets of norms which are tested on different populations of participants.Additionally, as different norms test nonoverlapping word sets, it is often difficult to obtain ratings on all stimuli on all dimensions of interest.Third, two of the dimensions, semantic size and gender association, have not been investigated to date via an extensive set of norms.Finally, many words in the English lexicon are ambiguous, having more than one meaning (e.g., bank, having a 'money' or 'river' sense).
The Glasgow Norms include ambiguous words presented in different forms (to different participants)as isolated words (e.g., bank), and as words presented with disambiguating information (e.g., bank (money) or bank (river)).These key aspects of our approach make the Glasgow Norms a unique and valuable methodological contribution.
There are currently several sets of psycholinguistic norms that report ratings of words on particular psycholinguistic dimensions.Typically, such norms comprise ratings of either 1,000 or so words on a few dimensions or more than 10,000 words on a single dimension.
Table 1 summarizes such norms, limited to those based on more than 500 words.For each set of norms, information is provided about the lexical dimensions examined, the number of words used, and the number of participants tested.
All remaining dimensions of the Glasgow Norms are based on 7-point rating scales, a practice consistent with most existing norms.CNC represents the degree to which something can be experienced by our senses (concrete, abstract).Concrete words are typically recognized faster than abstract words (e.g., Juhasz & Rayner, 2003;Schwanenflugel et al., 1988;Whaley, 1978;Yao et al., 2013Yao et al., , 2018)).Kousta, Vigliocco, Vinson, Andrews, and Del Campo (2011), however, proposed that abstract words tend to be more emotionally valenced than concrete words which gives rise to a residual processing advantage of abstract over concrete words, critically, once opposing effects of context availability and imageability are controlled.IMAG represents the degree of effort involved in generating a mental image of something (imageable, unimageable).In general, imageable words are facilitated in processing compared to less imageable words (e.g., Balota, Cortese, Sergent-Marshall, Spieler, & Yap, 2004;Cortese & Schock, 2013;Yao et al., 2018).CNC and IMAG, although highly correlated (see, e.g., Paivio, Yuille, & Madigan, 1968), are nevertheless considered to capture distinct semantic aspects of a word (Kousta et al., 2011;Richardson, 1976).
The measures of FAM and AOA are related in different subjective ways to the objective measure of word frequency, in which the relative number of occurrences of individual words within a substantial corpus (more often written than spoken) are calculated (e.g., the British National Corpus, 2007;Davies, 2004).FAM is a measure of a word's subjective experience (familiar, unfamiliar), and can be partially contrasted with subjective frequency estimates (Balota, Pilotti, & Cortese, 2001), which are considered to be less dependent on other meaninglevel variables.Words that are more familiar are recognized faster than those that are less familiar (e.g., Connine, Mullennix, Shernoff, & Yelen, 1990).AOA is a measure of the age at which a word was initially acquired.Although there are alternative ways of measuring AOA (e.g., Juhasz, 2005;Morrison et al., 1997), it is often assessed by adults providing an estimate of when they first learned a word, in spoken or written form, on a 7-point scale (a series of 2year periods from 0-12 years and a final 13+ period).Zevin andSeidenberg (2002, 2004) suggested that our developmental experience with words may be better captured by measures of their cumulative frequency (summed lifetime usage) and frequency trajectory (how usage changes over time).More recently, however, Brysbaert (2017) demonstrated that the best predictor of objective AOA is rated AOA.Behaviorally, words acquired earlier in life are recognized faster than those acquired later (e.g., Cortese & Khanna, 2007;Johnston & Barry, 2006;Juhasz & Rayner, 2006;Sereno & O'Donnell, 2009).
The final dimensions of SIZE and GEND have only been the subject of more recent psycholinguistic investigations (e.g., Sereno & O'Donnell, 2009;Sereno et al., 2009;Yao et al., 2013).SIZE is a measure of magnitude (big, small) expressed in either concrete or abstract terms.That is, words can refer to objects or concepts that are considered bigger (e.g., castle, wealth) or smaller (e.g., pocket, unique).It has been demonstrated that words referring to bigger things are recognized faster than those referring to smaller ones (e.g., Sereno et al., 2009;Yao et al., 2013).GEND is a measure of the degree to which words are considered to be associated with male or female behavior (masculine, feminine).Recent norms have specifically examined gender perception of role nouns across languages (Garnham, Doehren, & Gygax, 2016;Misersky et al., 2014).Although reading studies have investigated gender role stereotypes (e.g., electrician, secretary) and their gendered mis/matching pronouns (e.g., Duffy & Keir, 2004;Kreiner, Sturt, & Garrod, 2008), there has been little if any research into gender associations to a much broader spectrum of content words.Sereno and O'Donnell (2009) investigated words rated as either male-or female-oriented (e.g., frog, cigar, guitar or duck, flute, heaven, respectively) in a lexical decision task (AOA was also manipulated).They found that while female participants demonstrated an advantage to same-gendered words (e.g., responses were faster to tights than pliers), male participants showed no such comparable bias (i.e., pliers was no faster than tights).
The current corpus of 5,553 words includes a range of content words (nouns, verbs, adjectives, adverbs) as well as 379 semantically ambiguous words (homographs) whose alternative meanings were additionally rated.To our knowledge, only four existing norms have explicitly included ambiguous words.Clark and Paivio (2004), in their extension of the Paivio et al. (1968;N=925) norms, included number of meanings as an additional measure, but did not collect ratings on the alternative meanings, themselves.Bird, Franklin, and Howard (2001) did acquire IMAG and AOA ratings on a subset of their items (N=110) of noun-verb homographs (disambiguated by preceding the ambiguous word with a or to, respectively).Gilhooly and Logie (1980a) had participants rate whether or not their words had multiple meanings, resulting in a set of ambiguous words (N=649) that were then further rated for the relative dominance of alternative senses.Gilhooly and Logie (1980b) collected ratings on a set of 387 ambiguous words having a total of 905 separate meanings on the scales of CNC, IMAG, FAM, and AOA.Khanna and Cortese (2011) collected AOA ratings of 1,208 ambiguous words having a total of 3,460 senses.While most ambiguous words are "biased," having a strongly dominant meaning and one or more subordinate meanings, some are "balanced," having two more salient meanings with other possible subordinate senses (Sereno, O'Donnell, & Rayner, 2006).With the notable exceptions of the studies described above, ratings associated with ambiguous words from previous norming studies which have not explicitly disambiguated such words probably reflect participants' interpretation of the dominant meaning, although this is not a certainty.The ambiguous words identified in the Glasgow Norms were presented alone (e.g., ball), or in disambiguated form (e.g., ball (sphere) or ball (dance)), critically, to different participants.
The Glasgow Norms were collected by presenting our list of 5,553 words to participants in lists of either 101 or 150 words.For each list, participants rated words separately on all 9 dimensions described above.The relations among dimensions are explored and comparisons to other norms are provided.

Method Participants
A profile detailing the number, age, and gender of participants is presented in Table 2.
A total of 829 individuals ("unique participants") took part in the rating studies, with some completing more than one word list.When participants were tallied on the basis of completing a single list ("all participants"), regardless of whether they had completed other lists, the total came to 1,368.Overall, ages ranged from 16 to 73 years and there was slightly more than twice the number of females than males who took part.Participants were native English speakers from the University of Glasgow community and were recruited opportunistically via an experiment advertisement link on the home page of Psychology at the University of Glasgow.
They were either paid at a rate of £6/hr or were given course credit for their participation.The study conformed to British Psychological Society ethical guidelines and protocols.

Materials
A corpus of 5,553 words was assembled from an initial set of 808 words and a subsequent, larger set of 4,800 words (with 55 words included in both lists).The data acquired from these two sets were merged into a single corpus for subsequent analyses.Words ranged in length from 2 to 16 letters, with an average length of 6.10 letters (SD=1.99).
The corpus included 379 ambiguous words.Each was presented in isolation or with disambiguating information following the word in parentheses (e.g., solution, or solution (answer) and solution (chemical), respectively).The average number of disambiguated forms presented was 2.30 (SD=.58).The number of words having two, three, four, and five alternative meanings was 289, 69, 19, and 2, respectively.Thus, there was a total of 871 items in the corpus that were presented with disambiguation.

Procedure
The experiment was run online via an in-house, experimental platform (http://experiments.psy.gla.ac.uk).Each participant rated a list of either 101 (8 possible lists of the 808 word set) or 150 words (32 lists of the 4,800 word set).Lists of 101 or 150 words were generated by taking every 8 th or 32 nd item from alphabetized versions of either the 808 or 4,800 sets, respectively.This way, each list was representative of the set in terms of its distribution of word-initial letters and no list contained more than one instance of any given ambiguous word in any of its forms.
The general instructions for the experiment as well as the specific instructions for each of the 9 different rating tasks are presented in the Appendix.The same participant provided ratings across all 9 dimensions for any given word.Participants rated all words of a list on one scale, then all words on the next scale, and so on.The order of words within each scale was randomized as was the order of scales across participants.The approximate time to complete the experiment was 40 or 60 minutes for 101-or 150-item lists, respectively.

Results
Data were eliminated from further analyses if the response time (RT) to any word on any scale was less than 600 ms or if participants reported not knowing a word.For RT, examination of the trial-by-trial data revealed the presence of infrequent episodes of rapid responding by some participants, typically repeating a given rating value.In such cases, RTs tended to be less than 400 ms.In two recent, large-scale lexical decision experiments performed locally, average RTs to words were just under 600 ms (Sereno et al. (2015) used 240 words with 144 participants; Yao et al. (2018) used 270 words with 127 participants).A conservative lower cutoff of 600 ms was therefore implemented in the present study.No upper cutoff was imposed as there was no emphasis on speed of response.That is, participants were not instructed, for example, to respond to each item as quickly as possible.An identical procedure of not implementing an upper cutoff has been employed by several sets of norms (e.g., Clark & Paivio, 2004;Cortese & Khanna, 2008;Khanna & Cortese, 2011;Schock, Cortese, & Khanna, 2012).Of the total number of responses recorded (N=1,732,607), the RT distribution was as follows: 3.07% were shorter than 600 ms, 77.68% were 600-3000 ms (with 36.03% 1500-2000 ms), 12.98% were 3000-5000 ms, and 6.27% were longer than 5000 ms.
In terms of word knowledge, for all scales except for FAM, if participants did not know the meaning of a word, they could select the "unfamiliar word" button instead of rating it (see instructions in the Appendix).This option accounted for 0.33% of all responses.On average, there were 33.29 responses per word (SD=3.76).A detailed profile of the number of responses across all 9 rating scales is presented in Table 3.

Insert Table 3 about here
The descriptive statistics for the 9 rated dimensions are presented in Table 4.The Glasgow Norms are available as part of the Supplementary Materials to this article and are provided in .csvformat.The Glasgow Norms present an alphabetized list of 5,553 words.The columns, from left to right, are as follows: Word, Length (which excludes possible disambiguating information), and, for each of the 9 dimensions, the mean rating (M), standard deviation (SD), and number of ratings (N) for each word.Ratings for the 55 words that were included in both lists were highly correlated for all scales, ranging from r=.88 for DOM to r=.97 for VAL.

Relations between the nine dimensions of the Glasgow Norms
To provide an initial overview of the relations between all 9 Glasgow Norms scales, we performed Spearman correlations and these are presented in Table 5.As Spearman correlations are rank-based, this method takes into account linear and non-linear relations between dimensions.We used the Bonferroni method to correct p-values for multiple tests and applied a significance threshold of p=.01.Due to the large number of items (N=5,553), almost all correlations were significant.However, considering only large effects (i.e., with rs>.5;Cohen, 1988), the following correlations were particularly strong: CNC × IMAG (r=.91; the more concrete a word is, the easier it is to imagine); VAL × DOM (r=.68; the more positive a word is, the more it provokes feelings of dominance); FAM × AOA (r=-.67; the more familiar a word is, the earlier that word was learned); and SIZE × AROU (r=.51; the bigger the object or concept is to which a word refers, the more arousing it is).
Insert Table 5 about here For a more detailed analysis of relations between scales, we fit linear and quadratic models to the data, using the MATLAB function fitlm (The MathWorks, Inc.).To account for outliers, the fits were computed using a robust least-squares method (bisquare weighting function).Reported R 2 s were adjusted for the number of coefficients.The results for all combinations of dimensions using linear and quadratic fits are included in the Supplementary Materials to this article as Table S1 and Table S2, respectively.
We will highlight effects of SIZE and GEND, as these two dimensions are relatively new and less well understood.Figure 1 shows quadratic fits for all combinations with either SIZE or GEND that account for more than 18% of variance (see Table S2): SIZE × AROU (R 2 =.27); SIZE × CNC (R 2 =.19); VAL × SIZE (R 2 =.19); and VAL × GEND (R 2 =.18).For three of these, linear fits accounted for comparable (but numerically slightly less) amounts of variance (see Table S1) and are straightforward to interpret: SIZE × AROU (the semantically bigger a word is, the more arousing it is); SIZE × CNC (the semantically bigger a word is, the less concrete it is); and VAL × GEND (the more positive a word is, the more feminine it is).
In contrast, VAL × SIZE was better explained by a quadratic (R 2 =.19) than linear (R 2 <.01) fit (the more extremely valencednegative or positivea word is, the semantically bigger it is).
Insert Figure 1 about here

Factor analysis of dimensions
To summarize and interpret the correlation results and to explore the relative alignment of the newer scales of SIZE and GEND, we performed a factor analysis.Data for all 9 scales were submitted to a principal component analysis (PCA) with an oblique rotation (direct oblimin, Harman, 1976;Jennrich & Sampson, 1966).Note that using an orthogonal rotation (e.g., varimax, quartimax, or equamax) yielded comparable results.Factors with eigenvalues greater than one were included in the factor solution (Kaiser, 1960).
The factor analysis is presented in Table 6 and yielded a solution with four factors.The first factor "visualization" accounted for 30% of the variance in the data and included the scales CNC and IMAG.The second factor "emotion" accounted for an additional 26% of the variance and included VAL and DOM.The third and fourth factors each accounted for 13% of the variance: a "salience" factor, including SIZE, GEND, and AROU, and an "exposure" factor, including FAM and AOA.Together, the four factors explained 82% of the common variance.
The communality for each scale was above .6,indicating that the amount of variance accounted for by the retained factors was sufficient.In other words, the scales' variance was useful in delineating the extracted factors.

Insert Table 6 about here
It is noteworthy that most scales loaded relatively high (i.e., above an absolute value of .5) on one factor.However, AROU (loading highest on factor "salience") also loaded on the factor "emotion," and GEND (also loading highest on factor "salience") additionally loaded on the factor "visualization," indicating that these variables cannot be explained in terms of a single factor.

Correlations with other psycholinguistic norms
To confirm the validity of our ratings, we correlated the dimensions of the Glasgow Norms with 18 of the 20 different sets of English norms listed in Table 1.Norms that were excluded were those that were not easily accessible.Between one and 10 norms were available for all dimensions except SIZE.As linear relations between norms were expected, we performed Pearson correlations for all shared words.These correlations are presented in Table 7.All correlations were highly significant (ps<.0001,Bonferroni corrected) and the vast majority showed a Pearson coefficient of greater than .5,indicating a large effect (Cohen, 1988) and, therefore, sufficient validity.We do not have an explanation for why two of the 36 correlations reported in Table 7, although significant, had coefficients of less than .5.The highest and most consistent correlations were achieved by VAL, CNC, AOA, and GEND (with nearly all rs>.9).
Insert Table 7 about here

SIZE and GEND
One unique strength of our norms is the inclusion of SIZE and GEND variables which allows us to test these effects on a much larger set of words which was not previously possible.
To assess the effects of SIZE, we attempted to replicate the semantic size effect reported in Sereno et al. (2009) and Yao et al. (2013).We combined our ratings and the lexical decision RT data from the English Lexicon Project (ELP; Balota et al., 2007).To avoid multiple entries of the same word, we removed items corresponding to alternative meanings of homographs.A total of 4,568 words were entered into the analysis.We examined the effects of SIZE on RTs, with all the other variables as covariates (word frequency, word length, CNC, IMAG, AROU, VAL, FAM, AOA, DOM, and GEND).To address collinearity between covariates (e.g., CNC × IMAG, AROU × VAL), we reduced the dimensions of covariates via a PCA using a varimax rotation.We extracted seven principal components, accounting for 93.8% of the variance, and their factor loadings are shown in Table 8A.We fit a linear model of RT with SIZE and the extracted principal components as predictors, and the results are shown in Table 8B.Having controlled for a wide range of lexical and semantic variables, SIZE negatively predicted word recognition timessemantically bigger words were recognized significantly faster than semantically smaller words, replicating the findings of Sereno et al. (2009) and Yao et al. (2013).

Insert Table 8 about here
The effects of word GEND, however, are more difficult to test.RTs in megastudies are aggregated across participant gender.Moreover, the relative proportion of male versus female participants in megastudies is typically not specified.Sereno and O'Donnell (2009) examined the effects of word GEND and AOA on lexical decision times across male and female participants.All participants demonstrated AOA effects.However, females took longer to respond to male-oriented words, particularly late AOA ones, while males, in contrast, demonstrated no effect of word GEND.Confirming this pattern of results with megastudy data (in combination with our norms) would entail finding a GEND × AOA interaction across participant gender.Without access to data that separately presents responses from male and female participants, we are unable to directly test effects of word GEND.

Ambiguous words
We did not perform any formal analyses on the ratings of ambiguous words in the corpus, whether they occurred in isolation (e.g., pen) or in disambiguated form (e.g., pen (ink), pen (cage)).Informal examination of the ratings, however, indicated certain patterns.First, when disambiguating information was provided, the alternative senses of ambiguous words received distinct ratings where relevant to the dimension in question.Figure 2A illustrates the ratings that alternative senses of several ambiguous words received across the 9 dimensions, in particular, where the alternative senses were expected to lead to disparate judgments.The second aspect of ambiguous word ratings concerned the relationship between a word's ambiguous and disambiguated forms.While ambiguous words typically have a dominant sense and one or more subordinate senses (Sereno et al., 2006), the relative strengths of these alternative senses can vary substantially, not only across items, but across individuals.It is also possible that the dimensions, themselves, may have served as 'contexts' for ambiguous words presented in isolation (i.e., given the scale AROU, a participant may have rated plot in its 'story' sense, but later, given the scale CNC, rated plot in its 'land' sense, because these were the most accessible meanings).Figure2B shows the ratings across all dimensions of the ambiguous word shell and two of its alternative senses ('sea' and 'military').Although ratings for shell tended to be closer to its dominant 'sea' sense than its subordinate 'military' sense, they did not always overlap as might be expected.Moreover, we observed several different patterns of ratings across ambiguous items.Oftentimes, it did not appear that ambiguous words were rated according to only one of their senses.Without an independent measure of the dominance relationships among alternative senses of ambiguous words in our corpus, however, we are at present unable to characterize this data.Anecdotally, factors such as number of meanings, their relative strength, and the rating scale, itself, seem to play a role.
Insert Figure 2 about here Two sets of prior norms contained a fairly large number of ambiguous words with clearly defined alternative meanings that were rated on a subset of dimensions (Gilhooly & Logie, 1980b;Khanna & Cortese, 2011).We correlated the shared dimensions of disambiguated words from these samples with disambiguated items from our norms whose meanings matched.These correlations were significant and are included in Table 7 (Norms 8   and 17).

Discussion
The Glasgow Norms examined 9 semantic dimensions of words in a corpus of 5,553 words, with an average of 33 participants contributing ratings to each word on each scale.
Seven of the dimensions (AROU, VAL, DOM, CNC, IMAG, FAM, and AOA) are wellestablished and have been extensively investigated, while two dimensions (SIZE and GEND) are relatively novel and have not been investigated in a comprehensive way.In comparison to past norms, the Glasgow Norms provide an internally consistent set of ratings, not only across a sizeable corpus, but also across a considerable set of psycholinguistic dimensions.Moreover, the Glasgow Norms provide ratings for a significant number of ambiguous words (N=379), presented in isolation (e.g., figure) as well as in disambiguated forms (e.g., figure (body shape), figure (graph), figure (number), and figure (reckon)).

Analyses comprised evaluating the relations among the dimensions in the Glasgow
Norms and comparing its results to those of other norms.Correlations between the 9 dimensions of the Glasgow Norms were generally significant (see Table 5) due to the large number of items.Particularly strong relationships (with rs>.5;Cohen, 1988) included the following: CNC × IMAG (concrete words are easier to imagine); VAL × DOM (positive words provoke greater feelings of dominance); FAM × AOA (familiar words are acquired earlier); and SIZE × AROU (words referring to bigger things are more arousing).The first three relationships are established in the literature (e.g., Bradley & Lang, 1999;Friendly et al., 1982;Gilhooly & Logie, 1980a;Paivio et al., 1968;Stadthagen-Gonzalez & Davis, 2006;Toglia & Battig, 1978;Warriner et al., 2013).The latter is a novel finding, although it has already obtained behavioral support (Yao et al., 2013).In further analyses, we fit linear and quadratic models for all combinations of the dimensions (see Tables S1 and S2, Supplementary Materials).We focussed on effects related to the relatively new dimensions of SIZE and GEND (see Figure 1).For SIZE, words referring to bigger things were more arousing, more extremely (positively or negatively) valenced, and more abstract.For GEND, feminine words were more positive.It should be noted that although all participants were native English speakers, we did not record whether they were fluent in any other languages.Knowledge of a grammatically gendered language may have potentially impacted GEND ratings of English words in some participants (Boroditsky, Schmidt, & Philips, 2003).
Factor analysis of all dimensions of the Glasgow Norms yielded a four-factor solution accounting for 82% of the variance (see Table 6).The factors and their associated high-loading dimensions were as follows: "visualization" (CNC, IMAG), "emotion" (VAL, DOM), "salience" (SIZE, GEND, AROU), and "exposure" (FAM, AOA).Notably, both AROU and GEND also loaded moderately on "emotion" and "visualization," respectively.The lack of a one-to-one mapping between factors and dimensions highlights both the complexity of these semantic relationships as well as the need to recognize their potential influence in the design and analysis of psycholinguistic research.
The validity of the Glasgow Norms was assessed by a series of correlations with 18 different sets of English norms (see Table 7).All dimensions of the Glasgow Norms were tested with the exception of SIZE (to our knowledge, we are the first to obtain extensive ratings for this dimension).For any given dimension, between one and seven comparisons were made to previous sets of norms.Correlations with the prior norms were highly significant across the eight dimensions of AROU, VAL, DOM, CNC, IMAG, FAM, AOA, and GEND.
The newer dimensions of SIZE and GEND were tested against megastudy data from the ELP (Balota et al., 2007).Analyses revealed that both SIZE and GEND effects were obtained, confirming the findings from prior studies which have specifically examined these factors.
Finally, the Glasgow Norms included a set of 379 ambiguous wordspresented alone or with disambiguating information.This is the first time an appreciable number of ambiguous words as well as their alternative senses have been normed across an extensive number of lexical dimensions.Informal examination of the data demonstrated that alternative senses of ambiguous words having contrasting meanings were rated appropriately (see Figure 2A).Ambiguous words presented in isolation were sometimes rated according to their highly dominant sense across the different dimensions (see Figure 2B).In general, however, the rating patterns we observed for ambiguous words and their disambiguated senses varied.We believe that these different configurations most likely depend on several factors, including the number of alternative senses, the dominance relationship among these senses, as well as the rating scales, themselves.
In conclusion, the Glasgow Norms represent a valuable resource, providing a substantial set of words normed across a large number of psycholinguistic dimensions.Key features of the norms include the evaluation of the relations between dimensions, the validation of established dimensions, the assessment of novel dimensions, and the examination of ambiguous words and their meanings.Use of the Glasgow Norms will allow both the manipulation and control of lexical variables, in particular, in studies that investigate word recognition processes, whether in the experimental context of word-based tasks or during the course of fluent reading.Establishing the semantic contingencies and interactions of these variables will inform models of language processing.Note: The number of participants, age, and/or average number of lists completed are provided by grouping and gender."Unique participants" comprise individuals, some of whom provided ratings for more than one list of words; "all participants" represent the total number of participants responding to all lists, and does not take into account whether any given participant took part in more than one list.The majority of participants, 69%, completed a single list of words.The remaining percentages of individual participants completing more than one list are as follows: 17% did 2 lists, 3% did 3 lists, 8% did 4 lists, and 3% did 8 lists.Note: Pearson coefficients for 18 sets of norms reporting scales corresponding to the Glasgow Norms (note that no norms were available for semantic size).For each of the norms, the number of total items (Nsource) and the number of identical items within the Glasgow Norms (Noverlap) which were used for the correlations are indicated.Norms 8 and 17 examine different senses of ambiguous words.All correlations were highly significant (ps<.0001,Bonferroni corrected).Correlations with a large effect (r>.5, see Cohen, 1988) Davies, 2004) and Length in number of letters.CNC=concreteness; IMAG=imageability; AROU=arousal; |VAL|=absolute valence (i.e., the 1-9 scale was collapsed around its midpoint so that higher values would reflect more extreme valence, regardless of whether they were rated as positive or negative); FAM=familiarity; AOA=age of acquisition (negative loading); DOM=dominance; GEND=gender association; SIZE=semantic size.Note: Quadratic fits with SIZE or GEND that explain more than 18% of variance.R 2 s, Fvalues, and significance for linear and quadratic fits for all combinations of dimensions with can be found in Tables S1 and S2 of the Supplementary Materials.

Figure 1 .
Figure 1.Quadratic relations between semantic size or gender association and other dimensions.

Figure 2 .
Figure 2. A. Ratings of alternative senses of ambiguous words across dimensions.B.Ratings of the ambiguous word shell and its alternative senses across dimensions.

Table 3 Profile of the Number of Responses across Dimensions
Profile of the number of responses to the overall corpus (N=5,553), the subset of words (N=55) repeated across the 808-and 4,800-word lists, and the majority of words (N=5,498) presented in only one of the two lists.

Table 5 Correlations between Dimensions of the Glasgow Norms AROU VAL DOM CNC IMAG FAM AOA SIZE GEND
(Cohen, 1988)n coefficients for all combinations of scales.All correlations are significant (p<.01;Bonferroni corrected) except those listed in parentheses.Tests printed in bold are those considered large, with rs > |±.50|(Cohen, 1988).

Table 6 Factor Loadings for all Dimensions of the Glasgow Norms
Note: Reported are loadings of an oblique rotation matrix (direct oblimin) on four factors.Loadings > |±.50| are highlighted in bold.Explained common variance (%Var) is given for individual factors as well as the cumulative variance (%CumVar).

Table 8 SIZE Effects: Factor Loadings for the Extracted Principal Components (A) and Corresponding Multiple Linear Regression Results (B)
In A, only factor loadings ≥ |±.10| are shown and factor loadings > |±.50| are highlighted in bold.Frequency was measured in written occurrences per million as per the British National Corpus (2007;