Skip to main content
PLOS One logoLink to PLOS One
. 2023 Jul 26;18(7):e0288662. doi: 10.1371/journal.pone.0288662

“The algorithm will screw you”: Blame, social actors and the 2020 A Level results algorithm on Twitter

Dan Heaton 1,*, Elena Nichele 1,2, Jeremie Clos 1, Joel E Fischer 1
Editor: Michal Ptaszynski3
PMCID: PMC10370707  PMID: 37494323

Abstract

In August 2020, the UK government and regulation body Ofqual replaced school examinations with automatically computed A Level grades in England and Wales. This algorithm factored in school attainment in each subject over the previous three years. Government officials initially stated that the algorithm was used to combat grade inflation. After public outcry, teacher assessment grades used instead. Views concerning who was to blame for this scandal were expressed on the social media website Twitter. While previous work used NLP-based opinion mining computational linguistic tools to analyse this discourse, shortcomings included accuracy issues, difficulties in interpretation and limited conclusions on who authors blamed. Thus, we chose to complement this research by analysing 18,239 tweets relating to the A Level algorithm using Corpus Linguistics (CL) and Critical Discourse Analysis (CDA), underpinned by social actor representation. We examined how blame was attributed to different entities who were presented as social actors or having social agency. Through analysing transitivity in this discourse, we found the algorithm itself, the UK government and Ofqual were all implicated as potentially responsible as social actors through active agency, agency metaphor possession and instances of passive constructions. According to our results, students were found to have limited blame through the same analysis. We discuss how this builds upon existing research where the algorithm is implicated and how such a wide range of constructions obscure blame. Methodologically, we demonstrated that CL and CDA complement existing NLP-based computational linguistic tools in researching the 2020 A Level algorithm; however, there is further scope for how these approaches can be used in an iterative manner.

1 Introduction

Blame and agency in relation to automated decision-making is an emerging topic in academia [1]. Although currently under-explored, studying this has shown to be important when forming interventions for when decision-making algorithms do not do their intended job [2]. A recent example of this is the case of the 2020 A Level algorithm in England and Wales, where examinations during the Covid-19 pandemic were replaced by automatically calculated grades. Although initially defended, the algorithm-decided grades were abolished and teacher assessment grades were used instead due to an outpouring of public dismay.

Although work has been done on collecting public perspectives about the A Level algorithm, there is a research gap regarding public views expressed on Twitter, which could be a valuable source of data as it hosts a plethora of views relating to current affairs [3, 4]. Therefore, addressing this research gap could provide a fuller and more detailed picture of the wider public’s response to the event. To date, one contribution by Heaton et al. [5] has examined Twitter discourses relating to decision-making algorithms—including the 2020 A Level algorithm through the use of computational linguistic approaches, including sentiment analysis, due to their popular use in analysing trending topic discussions on Twitter [68]. Their analysis found sentiment fluctuated throughout the discourse, though was predominantly negative. In particular, fear and anger were the most prominent emotions, whilst discussions around the government, teachers and statistics were taking place.

However, due to the limitations of this approach—such as interpreting results and inconsistencies when comparing sentiment scores to human review [9, 10]—we build on this through the use of Corpus Linguistics (CL) and Critical Discourse Analysis (CDA). This qualitative analysis is underpinned by Social Actor Representation (SAR), a branch of Social Action Theory (SAT), where grammatical and transitivity structures play a crucial role in the representation of social actors [11]. Transitivity analysis—the examination of active and passive agents in texts—may uncover who is acting as the agent over whom and whether passive verbal constructions delete or mask social actors. There are various SAR techniques that indicate whether an agent in a text is a social actor, including exclusion, backgrounding, individualism, assimilation, personalisation and impersonalisation, which will all be explored. Thus, using SAR is helpful when examining blame and responsibility in discourse.

Our contribution is based on the belief that applying CL and CDA to Twitter discourses can mitigate some of the potential shortcomings of NLP-based computational linguistic tools [12]. This is due to the high emphasis on context and how language is used, underpinned by SAR. In fact, studies into Twitter discourses using these methods have yielded insightful and meaningful results on women driving in Saudi Arabia [13], refugees [14] and the dislike for hyperfeminized items being marketed to women and girls [15]. These examples showcase how this approach can be used in the wider context of social media research, which will be examined in this contribution.

This paper intends to add to the original findings from Heaton et al., which were affected by the shortcomings of the approaches discussed above. Opportunities offered by CL and CDA will be explored in this work, which shares the same dataset as Heaton et al., ultimately digging deeper into the discourse and finding out who Twitter users blame for the disruption to A Levels. Ultimately, using this combined approach will add to the the current discourse regarding which entities have been blamed for the algorithm’s failure, particularly illuminating ideas about how social media users reacted to the scandal.

Summarising, this paper will use CL and CDA to examine how blame is implied in relation to automated decision-making, through agency and transitivity, in Twitter discourses regarding the A Level algorithm. From a practical perspective, the entities will be identified through the aid of SAR. From a theoretical perspective, complementing NLP-based computational linguistics with CL and CDA will illustrate a hybrid language analysis approach.

1.1 Context of the 2020 A Level algorithm

On August 13th 2020, Ofqual (The Office of Qualifications and Examinations Regulation), the UK examinations regulations body, used a decision-making algorithm to replace the standard A Level qualifications, which had been cancelled that year due to the Covid-19 pandemic. The algorithm—defined here as the processing of data to produce a score through classification and filtering [16]—used prior centre attainment and teacher assessments to generate a grade for each qualification [17]. In comparison to the predicted outcomes submitted by their teachers, 35.6 per cent of students had qualification results lowered by one grade, 3.3 per cent by two grades, and 0.2 per by three grades [18]. The conditions that their university offers or employment opportunities were required were unmet. Therefore, their career plans were irreparably compromised.

This became a highly contested issue to schools, regulators and the wider public [19]. The key aspect criticised was that prior assessment data and teacher-assessed grades had been submitted but not used in their sole form [20]. Instead, they were combined with previous assessment data. That rendered the calculation unfair to students and educators from high deprivation communities especially.

The UK government defended the use of the algorithm initially, as it helped combat grade inflation. However, due to public outcry, it retracted the algorithm-generated grades on August 17th 2020. Instead, all qualifications were awarded the teacher-submitted grades [21]. The Education Secretary of State at the time, Gavin Williamson, appeared to place blame on Ofqual and emphasised he was not aware of the scale of the problem [22]. The public reaction also saw the resignations of Sally Collier, CEO and Chief Regulator of Ofqual, and Jonathan Slater, the most senior civil servant in the Department for Education. Therefore, the social impact of the choice went well beyond the class of 2020.

Ofqual reported there was no grading bias [23]. However, it was found that the algorithm favoured students from more economically privileged backgrounds while other suffered more [24]. This was due to each school’s historic results being a significant factor in the algorithm’s grade calculation. This led to the algorithm being labelled as ‘mutant’ by UK Prime Minister Boris Johnson [25]. Ofqual officials were quick to blame ‘overly generous teachers’, but not the algorithm itself [19].

Several studies examined the impact of the algorithm. Bhopal and Myers surveyed 583 students and interviewed a further 53 students who were eligible to take A Level examinations, between April and August 2020 [26]. Their aims were to to examine the impact (mental and academic) of predicted grades on A Level students, explore support systems in place for such students, and analyse differences by race, class, gender and school type. Through quantitative and qualitative analysis, it was found that students had identified the significance of unfairness within their individual experience. Students from all types of school and background felt the deployment of the algorithm placed little or no value on individual students’ experiences. Consequently, many students received results they perceived to be unfair (21% of those surveyed said they were happy with their results), which was in contrast to the official investigation report that concluded that there was no grading bias [23].

Additionally, Kolkman noted that the incident shone a light on algorithmic bias [27]. However, he also noted that greater knowledge of algorithmic-driven decisions requires better understanding of the functionality. More specifically, the author foregrounded the importance of critical reflection within the process of algorithm design and noted that, without intervention, there will be further unrest and distrust in algorithms that impact daily lives. Hecht further examined the social impact of using the algorithm [28]. They stated that public awareness, scrutiny, and transparency are critical first steps to eliminate perceived bias from the algorithm but far from a guarantee. Therefore, these are important factors to consider when examining views expressed about the algorithm. Ultimately, the current literature demonstrates that different entities have been blamed for the algorithm’s failure, yet limited research into how social media users reacted to the scandal, thus providing motivation for our research.

As indicated previously, only one study has taken social media responses into account when considering the public reaction to the algorithm [5]. However, the NLP-based approach, as well as issues with accuracy and interpretation, meant that the contribution did not explore social actors and who was therefore blamed. As a result, we propose using CL and CDA, underpinned by SAR, to examine who users portrayed as social actors and who was blamed. The following section will look in more detail about the shortcomings of NLP-based computational linguistic tools. It will also examine how CL and CDA, underpinned by SAR, can achieve more detailed insights into who users presented as social actors, and therefore blamed, through the exploration of grammatical agency.

2 Related work

To demonstrate the need to combine the aforementioned analytical approaches, an overview of limitations affecting sentiment analysis, among other approaches, will follow. This is set up in the context of the previous Heaton et al. study [5]. Additionally, an outline of CL and CDA—the chosen approaches—will be used to review existing contributions, which used similar methods to investigate Twitter discourses. Using CL and CDA, underpinned by SAR, it will be possible to ultimately contribute to filling the gap previously identified in the literature by identifying social actors in the Twitter discourse, providing an indication of who social media users blamed for the assignment of A Level grades in 2020.

2.1 NLP-Based computational linguistics to examine social media

Popular NLP-based computational linguistic tools can support the identification of the viewpoints expressed in large social media datasets. Sentiment analysis can offer such insights through predictive algorithms, which work on a binary polarity scale [29, 30]. For example, Park et al. used VADER, a sentiment analysis tool, to investigate fashion trends on Instagram and proved that the strong social media presence of a model was more effective than a contract with a top agency [31]. Similarly, Sivalakshmi et al. explored the sentiment towards the Covid-19 vaccine using TextBlob, another sentiment tool, and concluded that the discourse was neutral-to-negative in polarity [32].

As previously mentioned, Heaton et al. used sentiment analysis and other NLP-based computational linguistic tools to examine the views expressed on Twitter regarding the Ofqual algorithm, [5], critically evaluating sentiment analysis, topic modelling and emotion detection tools for textual analysis purposes.

Their findings showed that, from the TextBlob sentiment analysis, Fig 1 indicates that overall sentiment ranged from 0.088 to -0.052 and that overall sentiment was neutral. However, from the VADER sentiment analysis, Fig 1 shows that overall sentiment ranged from 0.03 to -0.5, indicating that overall sentiment was negative.

Fig 1. Sentiment analysis of tweets relating the A Level algorithm in 2020 by Heaton et al. [5], licensed under CC BY 4.0.

Fig 1

Additionally, they also found that the sentiment analysis in Fig 1 showed negative change in sentiment on August 14th, the day after the results were shared with students. On 17th August, when the government reversed the decision, there was a rise in positive sentiment. Although, on 26th August, when then-UK Prime Minister Boris Johnson told students that their results had been affected by a ‘mutant algorithm’—mentioned in the introduction –, a sharp negative change occurred, potentially caused by the negative word ‘mutant’ and associated negative terms used in response to this. On September 3rd, when Ofqual Chair, Roger Taylor, apologised to students when appearing at the Educational Select Committee at the House of Commons, another positive rise could be seen.

Additionally, other findings reported were that the most featured word of the most prominent topic was ‘government’, foregrounding their role in using and then withdraw the algorithm. ‘Trust’ was the emotion detected most frequently, but the direction of trust was not clear. Although this was a good starting point to capture general trends, these results struggle to explain why changes occurred or who tweet authors blamed.

As previously mentioned, sentiment analysis struggles to detect nuanced opinions. Notably, Heaton et al. [5] found difficulties in aiding their interpretation and potential applications in their study. This is echoed in similar studies [9, 10, 33]. Therefore, these findings might benefit from more rigorous qualitative analysis to unearth nuance and detail, especially when it comes to blame.

Generally, as well as this, experts have sought to combine computational linguistic tools with other methods to mitigate these shortcomings. These combinations have ranged from manual inspections [34] to comparing human and algorithm classification [35, 36]. Human analysis provided the most accurate results, illustrating the need to combine these computational linguistic tools with other approaches for validation. However, these tools still do not clearly identify grammatically and social actors or who is blamed. Therefore, we aim to mitigate this challenge by incorporating CL and CDA.

2.2 Using Corpus Linguistics to examine social media

One suitable approach to provide insight is Corpus Linguistics (CL). A corpus is defined as a body of written text or transcribed speech, which can be linguistically or descriptively analysed [37]. CL takes this idea of further investigating the corpus through a multitude of different analytical tasks. This is the study of language data on a large scale [38]. CL allows for the comparison of multiple corpora (more than one dataset) to identify trends and patterns in a texts, which is particularly helpful when comparing data from different time periods, such as in this study.

As data is tagged according to the part-of-speech (noun, verb, adjective, etc.), analysis can begin. One of these analytical methods is collocation. Collocation is defined as the co-occurrence of two or more words within a defined word span [39]. When using frequency as the sole measure, Baker states that it might not be possible to verify whether a co-occurrence is a true reflection of a semantic relationship or whether chance played a part [40]. Instead, statistical significance measures, such as LogDice (or Log Likelihood), become a useful indicator of lexical and grammatical associations between textual elements, as well as themes [41]. In this sense, concordances help identify collocations as they can show how adjacent or in close vicinity the related words are together. Therefore, concordance lines can display the context surrounding a word of interest [42].

There are advantages to using CL to analyse social media datasets. According to Jaworska, CL offers an ease in how large amounts of data can be automatically scanned to uncover patterns in frequency and keywords [39]. This is echoed by Tognini-Bonelli, who states that CL allows access to real-world, authentic texts and a high processing speed [43]. Given its efficiency and capacity to process large datasets, CL facilitates diachronic comparisons across corpora through lexical usage [44]. Because of its capacity to point out language patterns in large datasets, CL has been frequently deployed to carry out analyses on social media.

Jaworska also categorises media research involving CL into two strands: the first focuses on structural, pragmatic and rhetorical features of text, and the second on how language shapes representation [39]. Similarly, Nugraha et al. concentrated on both whilst investigating a Twitter corpus about the 2020 Charlie Hebdo shootings, the terrorist attacks to the headquarters of the French satirical magazine [45]. While ‘#JeSuisCharlie’ was used to most frequently express sympathy, ‘#CharlieHebdo’ featured in messages dealing with a wider variety of topics and emotions. Through using keyword and concordance analysis, and building on the previous CL findings of Kopf and Nichele [46], they found that there were 13 categories of keyword—such as place, the weapon, and the attacker. These categories are connected to each other: for example, many tweets linked the attacker to Islam, his religion, and discussed Pakistan and Islamic culture generally, framed by this incident. These studies all constitute examples of using CL to analyse Twitter discourses of social interest or having an impact on society.

Despite its key advantages, CL can pose analytical challenges with social media data. For instance, Baker found that CL, used in isolation, provides a focus on collocation and word frequencies, which is descriptive in functionality, and thus focus is drawn away from interpretation or critique [47]. Rose also criticises the restricted explainability of CL-derived results, despite the large evidence these could provide [48]. In this sense, the author calls for an integration of CL with other qualitative approaches to ensure more meaningful insights. These recommendations appear supported by Sulalah, who investigated the semantic prosody of ‘increase’ in Covid-19 discourses [49]. Additionally, Liimatta states that CL analysis can be problematic when dealing with short texts because of its normalised counts—usually calculated on a base of either 1,000 or 10,000 [50]. The calculations could generate unreliable values when applied to very short texts—such as tweets—due to the excessively small lexical samples these allow to consider. As a result, very short texts, which are especially common on certain social media platforms, should be interpreted carefully when compared.

2.3 Using critical discourse analysis to examine social media

Considering the challenges posed by CL, discussed in the previous section, we chose Discourse Analysis (DA) as a complementary approach. Whilst CL analysis tools struggle to pinpoint different perspectives and meaning shades, DA examines texts for nuance and pragmatic opinion (here meaning an examination of implied meanings of language). Therefore, these approaches were deemed especially effective together to explore blame in the A Level algorithm Twitter discourse.

Discourse surpasses the sentence boundaries [51] and comprises language stretches that are interlinked and create meaning, thus they carry an inscribed sociolinguistic value [52]. In this sense, questioning the social significance of language can uncover how it influences –- and is influenced by—the world around us [53]. Therefore, DA is an interpretative qualitative approach to text analysis that draws upon related theoretical frameworks.

In fact, there are several foci that can be adopted when approaching DA and Hodges et al. label them as descriptive, empirical and critical [54]. While descriptive addresses solely how language and grammar work together to create meaning in isolation, empirical and critical variations account for context and even include it as part of the data collected from discourses. Empirical analysis has been used successfully in studies where there is still a microanalytical focus on language. However, critical analysis places even greater emphasis on contextual information through macroanalysis, which focuses on the power and perspectives of individuals and institutions. As this is relevant to our aim, we will apply Critical Discourse Analysis (CDA) in this paper.

CDA can be used as a tool to better understand meanings implied by the context of a text or series of texts [55]. Fairclough identifies three CDA layers: micro, meso and macro [56]. Micro analysis examines syntax (sentence construction), metaphorical meanings and rhetoric. Meso analysis looks at the interpretation of the relationship between discursive processes and the text. Macro analysis examines the explanation of the relationship between the discourse and the socio-cultural reality that is external to the text.

Another significant contribution with regards to contextual meanings of the discourse is put forward by Van Dijk, who offers a socio-cognitive perspective [57]. Accordingly, discourse can be viewed as socially shared representations of societal arrangements, as well as interpreting, thinking, arguing, inferencing and learning. Although different, the two contributions are similar in regards to transitivity [58]. For example, an examination of transitivity patterns may uncover who is acting as the agent—thus, performing the action—over whom and whether passive verbal constructions exclude and background social actors. Therefore, this shows existing studies employing CDA show that a specific focus on agency is possible to unveil blame and responsibility.

More specifically, Leslie defines an agent as an entity with an internal source of energy through which it exerts force supposedly to carry out the action referred in the text [59]. Expanding on this, Richardson et al. state that that agency in linguistics is often explored by examining how it is emphasized, manipulated, or concealed [60]. As such, transitivity analysis—the examination of agency in text—looks at the use of active and passive voice or the nominalization, where verbs are word class converted to nouns. Here, choices reveal the attitude and ideology of the language user.

Additionally, research shows that passive constructions tend to remove agency from the subject. Especially when the subject is absent from the clause, there are shifts in blame [61]. Alternatively, agency can be implied through lexical choices. For instance, Morris et al. suggest that that “acceding trajectory evokes impression of high animacy, which would be caused by enduring internal property, i.e. the volitional action” (e.g., “the NASDAQ fought its way upward”) [62]. On the other hand, “the descending trajectory suggests inanimacy, as a result of lack of external forces.” (e.g., “stocks drifted higher”).

Metaphors have also been used to personify inanimate entities and increase the dramatic effect and intensity of a statement [63]. Additionally, vocabulary can be examined to unearth how words are used to show ideology, including the use of euphemisms and metaphors. It is also important to factor in how implicit information can be inferred and deduced through the examination of these aspects of language. Given its relevance, this work will use transitivity and agency as a focus of our analysis.

Similar studies have used CDA to examine Twitter data, whilst addressing other social aspects such as gender and origins. Among them, Aljarallah et al. investigated perspectives on women driving in Saudi Arabia, finding specific hashtags that were supported or opposed to women driving [13]. Their results showed, among others, that tweets with the hashtag #Womencardriving presented significant support towards the movement. However, opposing reactions emerged from the hashtags #Iwilldrivemycar and #Iwillentermykitchen. In another study by Sveinson et al., representations of gender and stereotyping have also been explored, including overwhelming dislike for hyperfeminized items marketed to women and girls through detailed linguistic analysis [15]. This study demonstrated that fan clothing serves as more than just a reflection of consumer preferences, as it can also embody the cultural identity of an organisation. Also, Kreis investigated the hashtag #refugeesnotwelcome, unearthing that users deployed a rhetoric of inclusion and exclusion to depict refugees as unwanted, criminal outsiders [14]. Her findings showed that this discourse reflected a prevailing political climate in Europe, where nationalist-conservative and xenophobic right-wing groups were gaining influence and promoting a discourse that is prominent on social media. Overall, these studies demonstrate the benefits of using CDA on Twitter discourses specifically, highlighting the depth of understanding that it can uncover.

Notably, CDA brings several advantages as it can reveal unacknowledged aspects of human behaviour and support new or alternative positions on social subjects [12, 64]. In this sense, CDA is naturally interdisciplinary [65] and requires an adductive approach, where a symbiotic relationship between theory and empirical data is necessary [12]. As CDA examines the intricate relationships between text, social opinion, power, society and culture, it provides a lens to better understand urgent social implications [55]. Additionally, the incorporation of an epistemological aspect into CDA means that, while the researcher brings their own beliefs and perspective, reflection upon findings has its place within the approach. Bucholtz claims this to be reflexivity with a heightened self-consciousness [66]. Therefore, CDA is an appropriate choice to explore social action, blame and agency, as in this study.

As with any methodological approach, CDA has shortcomings, too. Firstly, it requires considerable effort and time required to perform CDA on a large dataset [67]. Additionally, the subjective nature of CDA, approaching data with a personal perspective and lens, may limit its validity and decrease the objectivity and applications of the findings [68]. Both shortcomings provide a case for combining computational linguistic analysis with CDA. Also, Morgan notes CDA is not fixed and is always open to interpretation and negotiation [64]. The lack of objective measures available to analysts may result in inaccurate or misrepresentative findings. This complements the view of Olson that it is not a ‘hard science’ and more of an insight through examination and discussion [69].

These shortcomings provide a rationale for using CL with CDA to increase processing efficiency. This also aids the mitigation of the potential subjectivity of CDA: using a semi-automated approach first means comparisons can be organised according to the research focus. Although combining CL and CDA does not grant ultimate objectivity, it is less prone to exclusive subjective analysis.

2.4 Research gap

As previously discussed, combining all three of the approaches—sentiment analysis, CL and CDA—is uncommon. Nevertheless, existing contributions have demonstrated the individual efficacy of each to analyse features and characteristics of specific social media discourses. Thus, this work sets out to test whether their combination could provide a more complete account into the agency of potential social actors within the A Level grade calculation discourse. We aim to fill this current research gap by underpinning our analysis with Social Actor Representation (SAR), drawn from Social Action Theory (SAT).

SAT states that “people create society, institutions and structures” [70]. According to Engestrom, examining social actions can provide an explanation for human behaviour and societal change [71]. In other words, SAR is a branch of SAT which examines how grammatical structures convey social agency. For example, active or passive constructions and transitivity structures can be employed to communicate who social actors are in discourse [11].

Moreover, references to grammatical agents do not necessarily need to be present in discourses altogether. This choice is called excluding, and, in backgrounding, clues can be left in. Other strategies include individualism, which implies referring to actors as individuals or assimilation by referring to actors as groups. Also, actors can be personalised through word choices pertaining to the semantic nature of being ‘human’ or impersonalised. All of these representation structures play a role in indicating the social and power dynamics within discourse, as shown in other Twitter case studies that used CL and CDA [7274]. For example, McGlashan explored the language patterns of followers of the Football Lads Alliance, revealing correlations between follower profile descriptions and their tweets, indicating a construction of identity tied to radical right-wing and populist discourse regarding Islam where Islam is attributed agency [72]. Moreover, Fadanelli et al. found that social actors in former Brazillian president Jair Bolsonaro’s pre-campaign and government tweets served to publicise the president’s enemies, promote polarization, and align with his ideology, ultimately impacting his popularity among supporters both positively and negatively [73]. Finally, Bernard studies the construction of social actors in the reports of two South African mining companies, revealing how linguistic representations of higher- and lower-wage employees contribute to power dynamics and social inequality in the industry, which emphasised the agency of these companies in shaping relationships and maintaining dominance. These studies indicate the potential value of combining SAR with CL and CDA.

Finally, it is important to note that the application of SAR illuminates insights through a novel perspective that would not have been possible using popular NLP-based computational linguistic tools alone. Therefore, using SAR with CL and CDA will allow further unpacking of the social implications discovered in this Twitter discourse.

3 Method

As previously mentioned, using Twitter has allowed the collection of a large, readily available dataset. Twitter data can be processed before analysis [75], lending itself well to exploratory analyses [76].

For convenience, data was collected using the Twitter for Academic Purposes Application Programming Interface (API) and Tweepy [77]. We ensured that the collection and analysis method complied with the terms and conditions for the source of the data and the API. The data were sourced from the United Kingdom and only tweets in English were selected, meaning the analysis investigated views expressed in English only. Since retweets indicated agreement or support, duplicate tweets were expected, although eliminated from the corpora not to bias counts.

The 18,239 tweets composing the dataset were published from 12th August 2020, the day before A Level results were released to students, until 3rd September 2020, after Ofqual’s chair appeared at the Education Select Committee. Tweets containing ‘Ofqual algorithm’, ‘ofqualalgorithm’, ‘A level algorithm’, ‘alevelalgorithm’, ‘a levels algorithm’, ‘a-level algorithm’ or ‘a-levels algorithm’ were gathered. These search terms were chosen on the basis of their relevance to the algorithm, rather than the A Level results in general. The tweet IDs, and other associated information, can be found in S1 Dataset.

The next step concerned CL. Using the CL software The Sketch Engine [78], a keyword analysis was conducted to investigate frequently featuring social actors. The reference corpus used was the English Web 2020 (enTenTen20) [79], which comprises of 36 billion words of internet texts. Since it contains texts from social media, this was believed to be a suitable reference corpus for this study.

Firstly, comparing our corpus to the reference corpus is used to generate a keyness score, which was calculated by comparing the frequency of the words in the target corpus to the frequency of the words in the reference corpus. Secondly, concordance lines featuring potential social actors were examined to prompt the collocation analysis. This included using LogDice as a statistical measure of collocational strength. Thirdly, CDA was used to examine agency and blame as expressed in the concordance lines, where the selected keywords appeared in context.

Additionally, the focus was placed on transitivity, through the examination of social actors in sentence structures, vocabulary choice and the use of metaphor and possession. Specifically, principles of Leeuwen’s SAR were employed to provide insight into these social representations. Therefore, we looked at items of interest that could be related to blame, agency and social action through the collocation analysis of their concordance lines.

Despite the advantages just discussed, using tweets as a reflection of specific social media discourses carries risks [80]. Firstly, complex ethical considerations have to be made when scraping data for analysis from Twitter. For instance, a prominent ethical issue is the fact that although tweets are public by default, Twitter ‘data’ is not actively provided by users for research purposes, yet gaining explicit consent to use tweets from their authors is practically unfeasible [81]. Therefore, we decided not to attribute to any specific excerpts of tweets, mentioned in the results section, trusting they could hardly be attribute to specific users, as approved by the university department’s ethics committee. As an extra precaution, data was pseudonymised during the extraction process, with a unique number generated by the Twitter Academic API referring to each tweet. Hence, this is why only tweet IDs are available in S1 Dataset, rather than the tweets in their entirety.

4 Results

This section first comprises of the CL keyword analysis, which led us to identify potential social actors for investigation. Based on this first list, four potential social actors (the algorithm, Ofqual, the government and students) were investigated through the examination of collocational strength and CDA.

4.1 Keyword analysis of potential social actors

Table 1 shows the top ten words with the highest keyness score when compared to EnTenTen2020. From this analysis, the main findings were that four potential entities were identified: the algorithm itself, Ofqual, students and the government, as they all appeared as keywords. These were identified as they were all nouns that had the potential to be presented actively in a grammatical construction, thus could be a social actor. The following sections detail how blame is placed or not placed on the entity of concern through the main events of the discourse.

Table 1. The top ten words with the highest keyness score.

Item Relative frequency (per million) Score
Focus corpus Reference corpus
algorithm 28,339.45 0.51 29.3
a-level 9,881.38 1.27 10.9
ofqual 8,598.08 0.08 9.6
results 6,261.83 14.79 7.2
grades 5,717.25 7.94 6.7
students 4,730.1 94.14 5.2
a-levels 4,175.65 1.61 5.2
by 7,584.61 471.41 4.9
exam 2,826.54 30.25 3.7
government 2,518.88 45.21 3.4

4.2 The algorithm

Collocational strength of the top ten words associated with algorithm is shown in Table 2 (after stopword associations were removed). The trajectory of the collocations over time can be seen in Figs 2 and 3. Both a level and ofqual appeared as adjectival modifiers to algorithm. Flaws collocates strongly with algorithm at the start of the discourse, pertaining to one particular tweet that had been retweeted many times about a father (hence the strong collocation with this word too) that points out ‘algorithm flaws’. This returned towards the end of the discourse, where there were many tweets discussing how Education Secretary Gavin Williamson ‘knew of the flaws of the algorithm’. Words with high collocational strength that are in the semantic field of education, such as results, grades and exam were also present, but could not tell us much about how the algorithm was presented. Therefore, from this analysis alone, it is not clear whether the algorithm itself had grammatical agency grammatical or perceived social agency from tweet authors.

Table 2. Collocational strength of algorithm.

Collocate Freq Coll. freq. logDice
a-level 3405 6006 12.2297
ofqual 2393 5226 11.7701
results 1324 3806 11.0105
flaws 1090 1149 10.9247
a-levels 1110 2538 10.8458
foresaw 997 1015 10.8066
father 991 1025 10.7971
exam 1004 1718 10.7622
grades 1053 3475 10.703
level 903 1305 10.641

Fig 2. Temporal trajectory of LogDice scores of collocates of algorithm—Part A.

Fig 2

Fig 3. Temporal trajectory of LogDice scores of collocates of algorithm—Part B.

Fig 3

However, through the manual examination of other concordances, the algorithm itself is presented as having agency and potentially being blamed for the events that occurred. In this section, the key findings relate to the active presentation of the algorithm, its metaphorical agency and personalisation, and how this changes through the timeline as tweets show an undetermined responsibility for the actions.

On August 12th 2020, tweets show the algorithm performing a task as the social actor in grammatical constructions. Tweets that contain structures such as ‘that algorithm is going to screw you’ and ‘this algorithm appears to be cementing that bias towards the wealthy’ received a 235 total engagements (combined likes and retweets). The active syntactical structures implies that social agency is with the algorithm. On 13th August, the day results were released to students, there were also many tweets that gave the algorithm social agency, presented in a similar way, illustrated by the active statements that the algorithm ‘caused today’s chaos’ (5795 engagements). Here, personalisation is seen. This is in addition to a tweet that contained ‘the algorithm used by ofqual can’t be applied to small cohorts’ (5517 engagements), here foregrounding the importance of the algorithm, despite a lack of agency, through this passive construction. This could be seen as the backgrounding of Ofqual and a foregrounding of the algorithm.

Prior to the government change, transitivity analysis shows more cases of the algorithm being presented in an unfavourable way. Regarding pathways to university, one tweet says that it is ‘intolerable that an algorithm is denying this to others’ (7774 engagements), a clear active grammatical construction that places agency with the algorithm. Another tweet states that ‘this racist, discriminatory and downright evil algorithm is ruining lives’ (2595 engagements)—overtly stating that the algorithm has the power to create significant impact on humans, thus being personalised. Additionally, a tweet on 16th August stated that ‘97% of gcse results fully decided by an algorithm’ (1490 engagements). This implies that the algorithm has the capacity to make decisions on the outcome of the GCSE qualifications of students. Another well-engaged tweet on 16th August stated that the ‘algorithm has given them Us and fails’ (13256 engagements)—placing agency with the algorithm through personalisation.

This sentiment continued into the date of the reversed decision, 17th August 2020. One tweet with 2136 engagements included the clause ‘your future should be based on your abilities not an algorithm’, continuing the notion that the algorithm has the potential to change lives. Another tweet with 7126 engagements said that ‘private schools had done better with the ofqual algorithm’. Despite being part of a prepositional phrase in this context, the algorithm is still mentioned when the foregrounded part of the tweet is concerned with inequality of results. However, the algorithm is nominally labelled as ‘the ofqual algorithm’—thus, despite the active presentation of the algorithm, it is owned by Ofqual, thus potentially blurring the boundaries of blame and accountability.

There are occasions when the algorithm is referred to as being ‘used’ by an unknown actor. This is first seen on the most engaged-with tweet on 12th August, the day before results were released to students, which stated ‘the algorithm used to grade a-level results is incredibly sophisticated’ (4513 engagements). The fact that a transitive verb ‘used’ is chosen here without a named active social actor creates the impression that authors believe the algorithm is not to blame for the results, but the anonymous ‘user’ is. There are further instances where this occurs, such as ‘algorithm used for a-level grades’ on August 17th (1695 engagements).

The algorithm is also presented passively, implying removed agency. One tweet with 1329 engagements states that people ‘benefited from [the] algorithm’ on 13th August. Additionally, the most engaged-with tweet on 15th August (10311 engagements) discussed the importance of rectifying the situation prior to the release of GCSE results the following week, stating that the qualifications would also be ‘assigned *solely* by another ofqual algorithm’. While this presents Ofqual as the possessor of the algorithm and could imply blame, the algorithm itself is performing the task of ‘assigning’ despite being an inactive entity. This is in addition to a tweet on the same day that explains ‘1/4 state school students were downgraded by the algorithm versus 1/10 private school students’ (2931 engagements). Here, again, while a passive construction is used, the algorithm is not the focus of the construction; instead, the focus is shifted to the inequality of the ‘decisions’ that the algorithm made. Thus, while blame is not attributed to the algorithm through syntactical structures here, the subject matter of the tweet places blame on it through the foregrounding of this comparison. This backgrounding limits the agency that the algorithm has as a social actor but still implies blame.

Passive constructions continue on 18th August, where a UK university tweeted about supporting students ‘who have been disproportionately affected by the a-level algorithm’ (298 engagements). Again, while this is a passive construction, agency may still be attributed to the algorithm as it has performed an action that affected a human. However, it must be noted that the construction of the sentence foregrounds the students in this case.

Further on in the discourse, on the 25th August, there are tweets that imply the algorithm is doing a ‘job’, an activity usually performed by a human. One author wrote ‘Ofqual guidance doesn’t require them to moderate—that was the job of the algorithm’. This personification and personalisation of the algorithm could place further blame and agency on it as a distinct social actor. This in addition to a user who details that the algorithm had ‘failed [their] daughter’, thus implying that the algorithm had agency to perform such an action.

To summarise, the algorithm is mostly seen in active constructions that indicate agency is with it as a social actor. The personalisation and agency metaphor strategies seen in tweets also add to the indication that people see the algorithm as a social actor too. There are, however, instances where the algorithm is portrayed in passive constructions, although blame could still be interpreted. In the final dates of the dataset explored, more tweets directed blame through agency at Ofqual and the UK government. There are some active constructions that involve the algorithm, but the majority are centered around the organisations or individuals. These social actors will now be explored in more detail.

4.3 Ofqual

This section explores Ofqual as a potential social actor, with a specific focus on active and passive agency, agency metaphor and individualism of a defined entity within Ofqual, Roger Taylor. Collocational strength of the top ten words associated with Ofqual is shown in Table 3. The trajectory of the collocations over time can be seen in Figs 4 and 5. Once again, lexicon associated with education was present. Collocations of interest included ignored. This was seen throughout the discourse, such as the 14th August (‘ofqual ignored offers of expert help with its algorithm’) and 20th August (‘ofqual ignored exams warning a month ago’). The use of the word ‘ignored’ here could be seen as significant as it places Ofqual as the active social actor in the tweet. Have was also collocationally strong, often performing as an auxiliary verb where Ofqual is the social actor (‘ofqual have created an algorithm which just doesn’t work’, ‘ofqual have downgraded’, ‘ofqual who have ruined young lives’ and ‘ofqual have favoured the unadjusted small cohorts’). Used is seen in constructions that are active (‘ofqual has used an unequal algorithm’) and passive (‘the algorithm used by ofqual’) throughout the discourse. There was a great deal of engagement with a tweet that stated ‘“ofqual exam results algorithm was unlawful, says labour’. Although not an examination of agency, the use of the adjective unlawful might be an indicator of blame.

Table 3. Collocational strength of Ofqual.

Collocate Freq Coll. freq. logDice
algorithm 2396 17225 11.7719
exam 299 1718 10.4625
results 330 3806 10.2255
exams 227 1110 10.1972
have 299 3726 10.096
ignored 182 308 10.0737
regulator 182 347 10.0636
used 206 1339 10.0059
unlawful 169 387 9.94632
not 226 2961 9.82106

Fig 4. Temporal trajectory of LogDice scores of collocates of Ofqual—Part A.

Fig 4

Fig 5. Temporal trajectory of LogDice scores of collocates of Ofqual—Part B.

Fig 5

Through further concordance examination, users showed other ways in which they blamed Ofqual. Immediately, it is clear that the process of assimilation is present in tweets pertaining to Ofqual due it being a group. One of the most common situations that this occurred was by attributing ownership of the algorithm to Ofqual, as seen in tweets that contained the phrases ‘its algorithm’, found throughout the discourse.

Upon the revision of results, Ofqual was mentioned more in the discourse as an social actor. This is seen in tweets that involve the possession of the algorithm and some that talk about Ofqual as a separate social actor. In tweets that do discuss Ofqual as owners of the algorithm, such as ‘experts question how their algorithm could so blatantly favour private schools’, seen on 17th August with 4274 engagements, this possession is clear. However, the algorithm here still has some sort of agency as it is the social actor doing the ‘favouring’. This blurs the lines between who the social actor is and, therefore, who is to blame. This implication of multiple entities that presents, with the algorithm as the social actor but Ofqual as the possessor, continues the following day. This is seen in a tweet with 270 engagements that states ‘the government knew ofqual’s algorithm would disadvantage the disadvantaged’. This may result in blurred blame.

As previously alluded to, there are tweets that foreground Ofqual as the social actor, rather than as the owners of the algorithm. For example, one tweet with 2029 engagements on 20th August contains ‘it’s their faith in these one-dimensional metrics that bedevills education’, with the possessive pronoun ‘their’ referring to Ofqual. This hyperbolic use of language to heighten emotion and impact, intensifies the focus on Ofqual as a blameworthy social actor. This is exemplified further in a tweet with 135 engagements on 22nd August, stating ‘ofqual […] applied the algorithm’.

In later parts of the corpus, this continues. One tweet with 106 engagements on the 2nd September expresses exasperation with Ofqual by stating ‘how did the ofqual people not realise that what they did with the algorithm would not be acceptable’. Ofqual is clearly presented as an implicated social actor here, with the algorithm part of the prepositional subject phrase. This emphasises Ofqual’s agency and, thus, implies blame to them. These tweets coincide with Ofqual Chair, Roger Taylor, speaking directly to the Educational Select Committee.

Users also placed agency and blame on Taylor himself through individualism. This is seen especially in early September 2020, when Taylor spoke to the Educational Select Committee. As early as 13th August, the day results were released to students, Taylor is actively implicated. In the same tweet that stated that the ‘algorithm caused today’s chaos’, the tweet author goes on to state that ‘ofqual chair roger taylor also chairs the centre for data ethics innovation’, which is heavily linked with Dominic Cummings, former advisor to the Boris Johnson. This active construction, and use of the verb ‘chairs’, which is indicative of status and power, could implicate Taylor, especially with the high engagement with the tweet (3,349 likes and 2,446 retweets). There are other tweets from around a similar time that could place blame on Taylor through agency. For example, one tweet on 16th August states ‘roger taylor, […] responsible for the algorithm, flunked his own a levels but was given a “second chance” after passing the entrance exam’ (117 engagements). Several verbal phrases in this tweet are attributed to Taylor—including that he is ‘responsible’ for the algorithm, and, potentially, the failure of the process. Additionally, blame is further implied through the idea that Taylor ‘flunked’ his exams and ‘was given’ (a passive construction) a second chance. Similarly to Ofqual, there are times throughout the discourse when the algorithm is attributed to his possession—such as ‘benefit from grade inflation under his algorithm’ (4164 engagements).

On the 24th August, Taylor is presented in both an active and passive way. For example, a tweet with 518 engagements states ‘roger taylor’s company was criticised’ for failures concerning algorithms in the past. This passive construction removes the social actor from the construction and foregrounds the importance of Taylor. This is further emphasised by the active role he is given later in the same tweet, when the author writes that ‘he’s chair of the body charged with overseeing algorithms’, and in another tweet that states ‘roger taylor chairs both centre for data ethics and innovation (cdei) ofqual’. As well as overtly critiquing Taylor’s conflicts of interest by holding multiple senior roles, the use of the lexical item ‘chair’ (in both noun and verb word classes) reinforces the status, power and responsibility that Taylor has.

On the 2nd September, Taylor appeared at the Educational Select Committee to discuss the algorithm’s impact. Tweets placed agency and blame with Taylor. An example includes ‘roger taylor […] admits the decision to use an algorithm to award results was a “fundamental mistake” (105 engagements). Taylor is clearly the focal social actor in the construction, with intensity heightened through the use of ‘admits’. However, there are other tweets on this date that do implicate Taylor as a blameworthy social actor, but do so by using the word ‘tells’ in place of ‘admits’, thus softening the potential blame on Taylor.

To summarise, Ofqual is seen to be presented as a key social actor in this discourse, attracting blame from Twitter users by using active agency and possession. Taylor, here, is seen to be blameworthy through repeated individualism.

4.4 The UK government

In this section, the UK government is explored as a potential social actor, focusing on assimilation and individualism for senior government figures. Collocational strength of the top ten words associated with government is shown in Table 4. The trajectory of the collocations over time can be seen in Figs 6 and 7. There are words that might be expected to be related to the government (uk, tory) and also words that are particularly associated with this specific discourse (ofqual, algorithm, a-level). U-turn, the word with the highest collocational strength, appears as both a noun (‘should the government perform a u-turn’), a verb (‘ofqual want the government to u-turn’) and, later in the discourse, a noun phrase (‘even with the government algorithm u-turn’). The majority attributed the action of the ‘u-turn’ to the government, as seen in excerpts such as ‘the government has u-turned’, ‘government u turn on exam results’ and ‘we welcome the government’s u-turn’. After is frequently used as a prior conjunction to clauses such as these, discussing the need for teacher assessed grades. Unlike the first two entities, this collocation analysis implies the government could be blameworthy.

Table 4. Collocational strength of government.

Collocate Freq Coll. freq. logDice
u-turn 147 582 11.1546
after 80 764 10.1577
must 56 401 9.89148
uk 58 548 9.83631
ofqual 176 5226 9.73726
tory 45 281 9.66849
algorithm 396 17225 9.43429
a-level 139 6006 9.23917
not 80 2961 9.18879
have 93 3726 9.17913

Fig 6. Temporal trajectory of LogDice scores of collocates of government—Part A.

Fig 6

Fig 7. Temporal trajectory of LogDice scores of collocates of government—Part B.

Fig 7

Must is used as a modal verb in a variety of constructions that call on the government to address the situation, such as ‘the government must u-turn’, ‘the government must apply cags’ and ‘the government must learn from the shambolic handling of a-level results’. All of these constructions place the government as blameworthy social actors.

Further concordance examination places blame on the UK government as a collective entity, as well as some individual figures. Once again, assimilation is found in many constructions. Tweets throughout the discourse refer to the algorithm as ‘the government’s algorithm’, which is expanded upon as a noun phrase by different tweet authors, such as referring to it as the ‘hastily-built government algorithm’ (665 engagements).

In a direct address to A Level students on 13th August, one author said ‘i am sorry this government has failed you’ (1599 engagements). Blame is places With the government as the implicated social actor. Further implications of blame could come from the active statements ‘government refusing to learn from a level fiasco’ (619 engagements) and ‘this government really don’t like teachers’ (1490 engagements). Another tweet stated that the choice of using the algorithm was ‘devastating by the uk government’ (512 engagements). Although passive, this construction might attribute blame to the government through the foregrounding of the particularly emotive word ‘devastating’. This is again seen in ‘negatively hurt by the tory algorithm’ (434 engagements), where emphasis is on the emotion (the ‘hurting’) rather than government. Although this is backgrounding, the implication of blame remains.

There are some instances of support, rather than blame, early on in the discourse, too. A tweet with 362 engagements contains ‘the government never trusts teachers but in this v unusual situation it is the fairest way’. The author implies the government is a social actor but in a positive way, despite the verbal phrase ‘never trusts’ usually being associated with negativity.

Upon the revoking of the use of the algorithm, tweets imply blame is with the government, including one example with 429 engagements that states ‘time for the government to hold up their hands’. The implied imperative, the government as the subject of the clause and the colloquialism ‘hold up […] hands’ may imply blame. A tweet with 1684 engagements from 18th August says ‘the government will blame ofqual’, with the active construction perhaps showing that the government is attempting to distract blame from themselves. This is coupled with tweets that expand the possessive noun phrase, such as ‘their rigged algorithm’ (4105 engagegemts). Later, on 25th August, active constructions further implicate the government, such as ‘the government ignored red flags’ (35 engagements). This links to the idea that ministers put their ‘faith’ in the algorithm.

On 26th August 2020, the day that UK Prime Minister Johnson announced that results had been jeopardised by a ‘mutant algorithm’, Twitter users placed blame with the government. The most engaged-with tweet on this day, which had 5884 likes and 1762 retweets, used a series of rhetorical questions to imply that the government was to blame for the results scandal. Part of the tweet reads, ‘who set the parameters for ofqual’s algorithm? ministers! who didn’t ask the right questions? ministers! who didn’t ask for a simulation of the impact? ministers!! so who should resign?’ This tweet’s use of effective tripling as a rhetorical device is noteworthy, but it also has aspects of agency to explore. The interrogative pronoun ‘who’ could be substituted for the government (or ‘ministers’ in this case), making them an implied active social actor in the fault of the algorithm. Although the responses to the tweet were not part of the original dataset, there were other tweets within the dataset that linked the same BBC article, thus acting as a springboard for conversation and framed contextually around this specific piece of information. These tweets presented the government as implicated social actors.

Once again, there are individual social actors within this body, explored as individualism. Firstly, there are specific instances where blame is attributed to UK Prime Minster Boris Johnson. Upon the release of results, structures in tweets indicated that he had ownership of the algorithm, such as ‘clever boris’ algorithm’ (96471 engagements), implying blame is with Johnson. Additional tweets also indicate blame with Johnson, specifically on 26th August. One user tweeted about Johnson that ‘he can’t wriggle out of responsibility with bluster and distortion’ (32 engagements). This presents Johnson as the active social actor and the verb phrase ‘wriggle out’ may indicate he is to blame.

There are also a number of tweets that discuss Gavin Williamson, UK Education Secretary of State at the time of the A Level results in 2020. On the day of the government u-turn, one tweet stated that Williamson has ‘signed off on’ the algorithm (700 engagements), showcasing him as an blameworthy social actor and decision-maker. On 18th August, after the reversal, constructions included ‘williamson is trying to blame ofqual’ and ‘he admits he didn’t even bother checking it’ (224 engagements). These constructions show his active agency. However, Williamson is also presented in passive constructions, with one tweet with 524 engagements saying that he ‘was badly advised’. This reduces blame towards Williamson, especially through the obscuring of an unknown social actor in the construction through exclusion.

In summary, the findings here indicate that elements of blame through active agency and social action for the government can be derived from the tweets. Passive constructions use emotive language that still imply blame is with the government). There are times when assimilation occurs and, as the discourse continues, individualism is more apparent for Johnson and Williamson.

4.5 Students

Collocational strength of the top ten words associated with students is shown in Table 5. Again, there are anticipated semantically-related words present (a-level, grades, gcse, england). Many of the occurrences of their relate to how well teachers know their students (seemingly in retaliation to the decision to use an algorithm to calculate grades, rather than teachers, and discussions about their futures in the wake of the decisions made.

Table 5. Collocational strength of students.

Collocate Freq Coll. freq. logDice
a-level 651 6006 11.23
their 395 2757 11.1663
grades 373 3475 10.9105
gcse 238 1344 10.8521
have 336 3726 10.7039
downgraded 155 914 10.3885
england 132 767 10.2139
many 115 613 10.0773
all 139 1425 10.0488
given 108 473 10.0458

The strength of the relationship of students and downgraded can also be examined. These are a mix of passive (‘students getting downgraded results by some algorithm’) and active (‘algorithm that downgraded many disadvantaged students’) constructions, where students were the object in either. There were instances where the verb ‘downgraded’ was intransitive and the social actor performing the action was not included in the tweet (‘40% of a-level students being downgraded’). While this reduces potential blame for students, it does not implicate another social actor. It is also important to note that this is another example of assimilation. Upon further CDA examination, it appeared that students were presented as passive in the majority of constructions, regardless of the verb used, including given when the decision was reversed (‘students in england will be given grades estimated by their teachers’—a tweet with many retweets). This may suggest that students are not as heavily implicated.

5 Discussion

In the following, we discuss the implications of blame being attributed to the algorithm itself, Ofqual and the UK government through the combination of collocation, transitivity and social action analysis. Although these are three different aspects, in this study they are explored in an intertwined way. We relate this to previous research into the algorithm and the A Level results of 2020 to contribute to existing analysis concerning blame and responsibility for the issuing of results. After, we consider how the results work in a complementary way to NLP-based computational linguistic findings, building on our previously identified research gap.

5.1 Blame for the A Level results

Through the analysis of transitivity in concordance lines, collocation and CDA, underpinned by SAR, it was possible to see how blame is attributed to social actors throughout this Twitter discourse. The algorithm itself is most commonly presented as having active agency. The tweets seen that support this seem to imply that the algorithm is a social actor, despite its inanimate state, and so blame is shifted to the algorithm. Tweets imply that the algorithm is able to make decisions independently. This is in line with expectations of agency and blame that are outlined by Richardson et al. [60] and personalisation by Van Leeuwen [11].

Through personification and agency metaphor, the algorithm is depicted as carrying out human-like actions. This appears to support the idea of Goatly [63] that this is done for increased dramatic effect and implies the algorithm has the capacity to make independent decisions, such as removing pathways to university.

Although less frequently, there are also times where the algorithm is included in passive constructions. This is especially true when the algorithm is being referred to as being used by an unknown social actor, thus shielding the ‘user’, and may take agency away from the algorithm and obscure blame. There are times when more intense verbs are used in passive constructions, still implicating the algorithm. This relates to the notions of agency specified by Clark [61] and could seen to be obscuring agency through backgrounding, according to principles of SAR [11].

However, considering verb choices, there are passive constructions that contain the verbs ‘assigned’ and ‘graded’. Thus, a small portion of tweets using passive constructions appear to imply that the algorithm can still be blamed. This can be categorised as agency metaphor according to Morris et al. [62].

This builds upon existing research that Bhopal and Myers found that students thought that the algorithm’s result generation was unfair, thus implicating the algorithm [26] and ties into the potential backlash against algorithms that was reported to have occurred—and predicted to intensify—by Kolkman [27] and Hecht [28]. This, in turn, supports one of our other findings: that students were not blamed through agency and transitivity in this Twitter discourse due to their passive presentation.

The UK government and the regulation body Ofqual were also presented as responsible social actors by Twitter users. For both social actors, active statements were seen that could implicate them as agents of blame. This was less frequent than the algorithm was implicated at the start of the sampled discourse and more frequent towards the end of the discourse. Assimilation and individualism were both seen here.

Some tweets show how blame is attributed to social actors through the possession of another. For example, Ofqual and the UK government were, in many tweets, seen to be the owners of the algorithm, which implicates that they are to blame for the failures of the algorithm. This occurs throughout the discourse, especially on dates of significant events, such as the algorithm belonging to Roger Taylor on the date he appeared at the Educational Select Committee, the algorithm belonging to Boris Johnson on the date he called it a ‘mutant algorithm’, and the algorithm belonging to Gavin Williamson on the date of the u-turn. The idea of another entity possessing the implicated entity of the algorithm also blurs blame. The examination of how context affects language plays a crucial role in finding how blame is expressed through transitivity and, also, possession [53].

5.2 Use of corpus linguistics and critical discourse analysis to complement NLP-based computational methods like sentiment analysis

One of the aims of this study was to see how the qualitative findings CL and CDA, in addition to statistical collocation measures, provided further nuance to the quantitative findings from using sentiment analysis.

Overall, using the sentiment trajectory from the study by Heaton et al. [5] provided a sound starting point for analysis. An example of this is the analysis conducted on 26th August, where the examination of VADER sentiment analysis pinpoints 26th August as the date with the largest sentiment change and the lowest sentiment value in the discourse. Through using CL and CDA, it was clear that the majority of blame—through active agency, agency metaphors, hyperbole, possession, assimilation and individualism—on this date was directed towards the UK government and Boris Johnson. This was the date he declared the algorithm to be ‘mutant’. The combination of analyses through may suggest that Johnson‘s actions implicated him as responsible for the failure of the algorithm’s deployment due to the fact that the previous sentiment scores were low and tweet authors portrayed him as an implicated social actor.

CL was used primarily to identify potential social actors of blame and uncover patterns of transitivity [39]. Combining these analytical perspectives enhances the findings beyond sentiment analysis.

There were, however, some issues with the data collection process. Upon reviewing tweets, it was clear that there were many replies to tweets that form part of the discourse. But, due to the specific parameters of the search criteria used to collect this data, these replies were not part of the dataset. This potentially limits findings, especially as CDA is underpinned by the analysis of interaction between others [53]. However, other tweets used the same news articles to provide context to their tweets. This is still a response to a main source and connects tweets to one another, therefore mitigating some of these shortcomings.

Above all, this demonstrates that the combination of CL and CDA continues to be a suitable mechanism to be deployed on Twitter discourses surrounding social and topical issues [1315]. It also demonstrates value for a combination of qualitative and quantitative measures being used to analyse social media [65]. This echoes the findings of previous studies that have done this successfully with different qualitative methods [35, 36] and showcases that this combination can be applied to Twitter discourses too. Ultimately, using CL and CDA provided a better lens to explore urgent social ideas and, in our case, blame and social actors [55].

5.3 Limitations and future work

There are some things that limit the success of the study. As there were over 18,000 tweets, it is not possible to have examined all of these in great detail [67]. Although the use of CL may have mitigated this somewhat, even more insight may be waiting to be unearthed in this dataset. As previously expressed, the search criteria used to form the initial dataset may be missing important aspects of the discourse due to its strict lexical conditions. Finally, using CDA means that we approached the analysis with our own biases and subjective perspectives, potentially questioning the validity of the insights [64, 68].

When considering future work, there is potential to use CL and CDA to investigate related threads or themes. For example, we could enhance this exploration by investigating thematisation (which would link to the latent topics found using computational linguistics) [82] and the use of structural-functional linguistics and social-semiotics. This allows greater depth of research into the views expressed about the algorithm and could be done by multiple researchers to mitigate subjective biases. On a related note, a further suggestion may be to continue to use SAR to examine how the different social actors interact with one another.

Another suggestion is to improve the approach of ‘quantitative first, qualitative second’ into a more iterative cycle. Considering principles of iterative data science, such as the ‘epicycles of data analysis’ [83], a process could focus on the cyclical development of expectations, analysis of data, and matching of expectations to data, which repeats. This might mitigate not being able to analyse the replies excluded from the original dataset. In this model, the discourse becomes a ‘moving feast’, where NLP-based tools can then be re-deployed to capture replies to key tweets, which are further analysed using CL and CDA. Similarly, Social Network Analysis could be used with NLP and CL approaches to explore language patterns in this discourse, in a similar way to McGlashan and Hardaker [84].

6 Conclusion

The sociolinguistic findings reported and discussed in this contribution show that, through using CL and CDA, many Twitter users blamed the algorithm as a standalone social actor for the A Level results. This reaction was expressed through active agency, including agency metaphor (such as ‘that algorithm is going to screw you) and personalisation of the algorithm (such as ‘the job of the algorithm’).

Additionally, the UK government and Ofqual, and devolved social actors within these organisations like Taylor and Johnson, were also blamed by Twitter users through similar constructions and elements of possession (such as ‘benefit from grade inflation under his algorithm’). This was seen less frequently at the start of the discourse and more frequently towards the end. This was mainly done through assimilation in earlier tweets and individualism in later tweets.

Furthemore, passive constructions could be seen for all of these social actors, with some indicating more blame than others (such as ‘the algorithm used by ofqual’). Techniques to obscure and shift blame were also seen, like backgrounding (such as ‘devastating by the uk government’) and exclusion (such as ‘he was badly advised’).

Ultimately, although it could not be determined which social actor out of the algorithm, Ofqual and the government was blamed the most, we conclude that these entities were presented as blameworthy social actors throughout the discourse. As well as providing insights into the online response to this particular event, there is potential for broader impact too. Despite the disruption of the pandemic coming to an end in the UK, this contribution provides insights into how members of the public may react to future decision-making algorithm interventions.

In addition, the methodological conclusions illustrate how CL and CDA can be used in a complementary way to NLP-based computational linguistic tools like sentiment analysis. More specifically, using quantitative data as starting points allows for more focused qualitative analysis. For example, the previously reported significant negative shifts in sentiment coincided with more authors suggesting blame was with the UK government and Boris Johnson. To ensure the application of ‘epicycles of data science’ creates an iterative computational and discursive methodological process, a more in-depth investigation of blame attribution and expression is needed.

Supporting information

S1 Dataset. Dataset used in the study.

Including search term, tweet ID, timestamp, number of favourites and number of retweets.

(CSV)

Data Availability

All relevant data are within the paper and its Supporting information files.

Funding Statement

All authors are supported by the UKRI Trustworthy Autonomous Systems Hub (UKRI Grant No. EP/V00784X/1) (https://gow.epsrc.ukri.org/NGBOViewGrant.aspx?GrantRef=EP/V00784X/1). Dan Heaton is supported by the Horizon Centre for Doctoral Training at the University of Nottingham (UKRI Grant No. EP/S023305/1) (https://gow.epsrc.ukri.org/NGBOViewGrant.aspx?GrantRef=EP/S023305/1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Wagner B. Liable, but not in control? Ensuring meaningful human agency in automated decision‐making systems. Policy & Internet. 2019. Mar;11(1):104–22. doi: 10.1002/poi3.198 [DOI] [Google Scholar]
  • 2. Olhede S, Wolfe PJ. Blame the algorithm?. Significance. 2020. Oct;17(5):12. doi: 10.1111/1740-9713.01441 [DOI] [Google Scholar]
  • 3. Weller K, Bruns A, Burgess J, Mahrt M, Puschmann C. Twitter and society: An introduction. Twitter and society [Digital Formations, Volume 89]. 2014:xxix–xviii. [Google Scholar]
  • 4. McCormick TH, Lee H, Cesare N, Shojaie A, Spiro ES. Using Twitter for demographic and social science research: Tools for data collection and processing. Sociological methods & research. 2017. Aug;46(3):390–421. doi: 10.1177/0049124115605339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Heaton D, Clos J, Nichele E, Fischer J. Critical reflections on three popular computational linguistic approaches to examine Twitter discourses. PeerJ Computer Science. 2023. Jan 30;9:e1211. doi: 10.7717/peerj-cs.1211 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Srinivasan B, Mohan Kumar K. Flock the similar users of twitter by using latent Dirichlet allocation. Int. J. Sci. Technol. Res. 2019;8:1421–5. [Google Scholar]
  • 7.Mustaqim T, Umam K, Muslim MA. Twitter text mining for sentiment analysis on government’s response to forest fires with vader lexicon polarity detection and k-nearest neighbor algorithm. InJournal of Physics: Conference Series 2020 Jun 1 (Vol. 1567, No. 3, p. 032024). IOP Publishing.
  • 8. Aribowo AS, Khomsah S. Implementation Of Text Mining For Emotion Detection Using The Lexicon Method (Case Study: Tweets About Covid-19). Telematika: Jurnal Informatika dan Teknologi Informasi. 2021. Mar 16;18(1):49–60. doi: 10.31315/telematika.v18i1.4341 [DOI] [Google Scholar]
  • 9. Stine RA. Sentiment analysis. Annual review of statistics and its application. 2019. Mar 7;6:287–308. doi: 10.1146/annurev-statistics-030718-105242 [DOI] [Google Scholar]
  • 10.Jiang JA, Brubaker JR, Fiesler C. Understanding diverse interpretations of animated gifs. InProceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems 2017 May 6 (pp. 1726-1732).
  • 11. Van Leeuwen T. Discourse and practice: New tools for critical discourse analysis. Oxford University Press; 2008. [Google Scholar]
  • 12. Mogashoa T. Understanding critical discourse analysis in qualitative research. International Journal of Humanities Social Sciences and Education. 2014. Jul;1(7):104–13. [Google Scholar]
  • 13.Aljarallah RS. A critical discourse analysis of twitter posts on the perspectives of women driving in Saudi Arabia. Arizona State University; 2017.
  • 14. Kreis R. refugeesnotwelcome: Anti-refugee discourse on Twitter. Discourse & Communication. 2017. Oct;11(5):498–514. doi: 10.1177/1750481317714121 [DOI] [Google Scholar]
  • 15. Sveinson K, Allison R. “Something Seriously Wrong With US Soccer”: A Critical Discourse Analysis of Consumers’ Twitter Responses to US Soccer’s Girls’ Apparel Promotion. Journal of Sport Management. 2021. Dec 21;36(5):446–58. doi: 10.1123/jsm.2021-0127 [DOI] [Google Scholar]
  • 16. Diakopoulos N. Accountability in algorithmic decision making. Communications of the ACM. 2016. Jan 25;59(2):56–62. doi: 10.1145/2844110 [DOI] [Google Scholar]
  • 17. Rosamond E. What Was to Have Happened? Tenses for a Cancelled Future. Metropolis M. 2020. Oct 12. [Google Scholar]
  • 18.Whittaker F. A-level results 2020: 8 key trends in England’s data [Internet]. Schools Week, editor. Schools Week. 13AD [cited 11AD Feb]. Available from: https://schoolsweek.co.uk/a-level-results-2020-8-key-trends-in-englands-data/
  • 19. Kelly A. A tale of two algorithms: The appeal and repeal of calculated grades systems in England and Ireland in 2020. British Educational Research Journal. 2021. Jun;47(3):725–41. doi: 10.1002/berj.3705 [DOI] [Google Scholar]
  • 20. Edwards C. Let the algorithm decide?. Communications of the ACM. 2021. May 24;64(6):21–2. doi: 10.1145/3460216 [DOI] [Google Scholar]
  • 21.BBC. A-levels and GCSEs: U-turn as teacher estimates to be used for exam results. BBC News [Internet]. 2020 Aug 17; Available from: https://www.bbc.co.uk/news/uk-53810655
  • 22.Timmins N. Schools and coronavirus [Internet]. www.instituteforgovernment.org.uk. 2021. Available from: https://www.instituteforgovernment.org.uk/sites/default/files/publications/schools-and-coronavirus.pdf
  • 23.Ofqual. Awarding GCSE, AS & A levels in summer 2020: interim report [Internet]. GOV.UK. Available from: https://www.gov.uk/government/publications/awarding-gcse-as-a-levels-in-summer-2020-interim-report
  • 24. Smith H. Algorithmic bias: should students pay the price?. AI & society. 2020. Dec;35(4):1077–8. doi: 10.1007/s00146-020-01054-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Coughlan S. A-levels and GCSEs: Boris Johnson blames “mutant algorithm” for exam fiasco. BBC News [Internet]. 2020 Aug 26; Available from: https://www.bbc.co.uk/news/education-53923279
  • 26.Bhopal K, Myers M. The impact of COVID-19 on A level students in England [Internet]. SocArXiv; 2020. Available from: osf.io/preprints/socarxiv/j2nqb
  • 27.Kolkman D. F** k the algorithm?: what the world can learn from the UK’s A-level grading fiasco. Impact of Social Sciences Blog. 2020 Aug 26.
  • 28.Hecht Y. UK’s Failed Attempt to Grade Students by an Algorithm [Internet]. Medium. 2020 [cited 2023 Mar 22]. Available from: https://pub.towardsai.net/ofqual-algorithm-5ecbe950c264?gi=6c83f561e35a
  • 29. Liu B. Sentiment analysis and subjectivity. Handbook of natural language processing. 2010. Feb;2(2010):627–66. [Google Scholar]
  • 30. Vyas V, Uma VJ. An extensive study of sentiment analysis tools and binary classification of tweets using rapid miner. Procedia Computer Science. 2018. Jan 1;125:329–35. doi: 10.1016/j.procs.2017.12.044 [DOI] [Google Scholar]
  • 31.Park J, Ciampaglia GL, Ferrara E. Style in the age of Instagram: Predicting success within the fashion industry using social media. InProceedings of the 19th ACM Conference on computer-supported cooperative work & social computing 2016 Feb 27 (pp. 64-73).
  • 32. Sivalakshmi P, Udhaya Kumar P, Vasanth M, Srinath R, Yokesh M. “COVID-19 Vaccine–Public Sentiment Analysis Using Python’s Textblob Approach”. International journal of current research and review 2021:166–172. doi: 10.31782/IJCRR.2021.SP218 [DOI] [Google Scholar]
  • 33. Maier D, Waldherr A, Miltner P, Wiedemann G, Niekler A, Keinert A, et al. Applying LDA topic modeling in communication research: Toward a valid and reliable methodology. Communication Methods and Measures. 2018. Apr 3;12(2-3):93–118. doi: 10.1080/19312458.2018.1430754 [DOI] [Google Scholar]
  • 34.Sengupta S. What are Academic Subreddits Talking About? A Comparative Analysis of r/academia and r/gradschool. InConference Companion Publication of the 2019 on Computer Supported Cooperative Work and Social Computing 2019 Nov 9 (pp. 357-361).
  • 35.González-Ibánez R, Muresan S, Wacholder N. Identifying sarcasm in twitter: a closer look. InProceedings of the 49th annual meeting of the association for computational linguistics: human language technologies 2011 Jun (pp. 581-586).
  • 36. Van Atteveldt W, Van der Velden MA, Boukes M. The validity of sentiment analysis: Comparing manual annotation, crowd-coding, dictionary approaches, and machine learning algorithms. Communication Methods and Measures. 2021. Apr 3;15(2):121–40. doi: 10.1080/19312458.2020.1869198 [DOI] [Google Scholar]
  • 37. Kennedy G. An introduction to corpus linguistics. Routledge; 2014. Sep 19. [Google Scholar]
  • 38. McEnery T, Hardie A. Corpus linguistics: Method, theory and practice. Cambridge University Press; 2011. Oct 6. [Google Scholar]
  • 39. Jaworska S. Corpus approaches: Investigating linguistic patterns and meanings. In The Routledge handbook of language and media 2017. Aug 4 (pp. 93–108). Routledge. [Google Scholar]
  • 40. Baker P. Using corpora in discourse analysis. A&C Black; 2006. Jun 23. [Google Scholar]
  • 41. Mautner G. Mining large corpora for social information: The case of elderly. Language in Society. 2007. Jan;36(1):51–72. doi: 10.1017/S0047404507070030 [DOI] [Google Scholar]
  • 42. Hoey M. Grammatical creativity: A corpus perspective. Text, discourse and corpora: Theory and analysis. 2007. Nov 28:31–56. [Google Scholar]
  • 43. Tognini-Bonelli E. Theoretical overview of the evolution of corpus linguistics. The Routledge handbook of corpus linguistics. 2010. Apr 5:14–28. [Google Scholar]
  • 44. Baker P. Sociolinguistics and corpus linguistics. Edinburgh University Press; 2010. Feb 28. [Google Scholar]
  • 45. Nugraha IS, Sujatna ET, Mahdi S. CORPUS LINGUISTIC STUDY OF TWEETS USING CHARLIEHEBDO HASHTAGS. JALL (Journal of Applied Linguistics and Literacy). 2021. Feb 27;5(1):54–70. [Google Scholar]
  • 46. Kopf S, Nichele E. Es-tu Charlie?. Doing Politics: Discursivity, performativity and mediation in political discourse. 2018. Dec 15;80:211. doi: 10.1075/dapsac.80.09kop [DOI] [Google Scholar]
  • 47. Baker P, Levon E. Picking the right cherries? A comparison of corpus-based and qualitative analyses of news articles about masculinity. Discourse & Communication. 2015. Apr;9(2):221–36. doi: 10.1177/1750481314568542 [DOI] [Google Scholar]
  • 48. Rose ER. A Month of Climate Change in Australia: A Corpus-Driven Analysis of Media Discourse. Text-Based Research and Teaching: A Social Semiotic Perspective on Language in Use. 2017:37–53. doi: 10.1057/978-1-137-59849-3_3 [DOI] [Google Scholar]
  • 49. Sulalah A. The Semantic Prosody analysis of ‘increase’in Covid-19: a Corpus-Based Study. Lire Journal (Journal of Linguistics and Literature). 2020. Oct 12;4(2):237–46. doi: 10.33019/lire.v4i2.92 [DOI] [Google Scholar]
  • 50. Liimatta A. Using lengthwise scaling to compare feature frequencies across text lengths on Reddit. Corpus approaches to social media. 2020. Nov 15:111–30. doi: 10.1075/scl.98.05lii [DOI] [Google Scholar]
  • 51. Schiffrin D. Discourse markers: Language, meaning, and context. The handbook of discourse analysis. 2005. Jan 1:54–75. [Google Scholar]
  • 52. Cook G. Discourse. Oxford University Press; 1989. Jun 29. [Google Scholar]
  • 53. Johnson M, Mclean E. Discourse analysis. In: International encyclopedia of human geography. Elsevier; 2019. [Google Scholar]
  • 54. Hodges BD, Kuper A, Reeves S. Discourse analysis. Bmj. 2008. Aug 7;337. [DOI] [PubMed] [Google Scholar]
  • 55. Van Dijk TA. What is political discourse analysis. Belgian journal of linguistics. 1997. Jan 1;11(1):11–52. doi: 10.1075/bjl.11.03dij [DOI] [Google Scholar]
  • 56. Fairclough N. Critical discourse analysis and the marketization of public discourse: The universities. Discourse & society. 1993. Apr;4(2):133–68. doi: 10.1177/0957926593004002002 [DOI] [Google Scholar]
  • 57. Van Dijk TA. Discourse, Ideology and Context. Folia Linguistica. 2002;35. [Google Scholar]
  • 58. Amoussou F, Allagbe AA. Principles, theories and approaches to critical discourse analysis. International Journal on Studies in English Language and Literature. 2018. Jan;6(1):11–8. [Google Scholar]
  • 59.Leslie AM. A theory of agency. Rutgers Univ. Center for Cognitive Science; 1993.
  • 60. Richardson P, Mueller CM, Pihlaja S. Cognitive Linguistics and religious language: An introduction. Routledge; 2021. Mar 28. [Google Scholar]
  • 61. Clark WR. Agents and structures: Two views of preferences, two views of institutions. International Studies Quarterly. 1998. Jun 1;42(2):245–70. doi: 10.1111/1468-2478.00081 [DOI] [Google Scholar]
  • 62. Morris MW, Sheldon OJ, Ames DR, Young MJ. Metaphors and the market: Consequences and preconditions of agent and object metaphors in stock market commentary. Organizational behavior and human decision processes. 2007. Mar 1;102(2):174–92. doi: 10.1016/j.obhdp.2006.03.001 [DOI] [Google Scholar]
  • 63. Goatly A. Washing the brain: Metaphor and hidden ideology. John Benjamins Publishing; 2007. [Google Scholar]
  • 64. Morgan A. Discourse analysis: An overview for the neophyte researcher. Journal of Health and Social Care Improvement. 2010. May;1(1):1–7. [Google Scholar]
  • 65. Wodak R. Pragmatics and critical discourse analysis: A cross-disciplinary inquiry. Pragmatics & cognition. 2007. Jan 1;15(1):203–25. doi: 10.1075/pc.15.1.13wod [DOI] [Google Scholar]
  • 66. Bucholtz M. Reflexivity and critique in discourse analysis. Critique of anthropology. 2001. Jun;21(2):165–83. doi: 10.1177/0308275X0102100203 [DOI] [Google Scholar]
  • 67. Wetherell M, Potter J. Discourse analysis and the identification of interpretative repertoires. Analysing everyday explanation: A casebook of methods. 1988;1688183. [Google Scholar]
  • 68. Gill R. Discourse analysis. Qualitative researching with text, image and sound. 2000. Jun 22;1:172–90. [Google Scholar]
  • 69.Olson H. Quantitative “versus” qualitative research: The wrong question. InProceedings of the Annual Conference of CAIS/Actes du congrès annuel de l’ACSI 1995.
  • 70. Weber M. Max Weber: selections in translation. Cambridge University Press; 1978. Mar 30. [Google Scholar]
  • 71. Engeström Y. Activity theory and individual and social transformation. Perspectives on activity theory. 1999. Jan 13;19(38):19–30. [Google Scholar]
  • 72. McGlashan M. Collective identity and discourse practice in the followership of the Football Lads Alliance on Twitter. Discourse & Society. 2020. May;31(3):307–28. doi: 10.1177/0957926519889128 [DOI] [Google Scholar]
  • 73. Fadanelli SB, Dal Pozzo DF, Fin CC. The representation of social actors in the tweets of Jair Messias Bolsonaro. Antares. 2020;12:74–99. doi: 10.18226/19844921.v12.n25.04 [DOI] [Google Scholar]
  • 74.Bernard T. The Discursive Representation of Social Actors in the Corporate Social Responsibility (CSR) and Integrated Annual (IA) Reports of Two South African Mining Companies. Critical Approaches to Discourse Analysis across Disciplines. 2018 Jan 2;10(1).
  • 75.Jianqiang Z. Pre-processing boosting Twitter sentiment analysis?. In2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity) 2015 Dec 19 (pp. 748-753). IEEE.
  • 76.Chong WY, Selvaretnam B, Soon LK. Natural language processing for sentiment analysis: an exploratory analysis on tweets. In2014 4th international conference on artificial intelligence with applications in engineering and technology 2014 Dec 3 (pp. 212-217). IEEE.
  • 77.Roesslein J. tweepy Documentation. [Online] http://tweepy.readthedocs.io/en/v3. 2009;5:724.
  • 78. Ilgarriff A. Itri-04-08 the sketch engine. Information Technology. 2004;105:116. [Google Scholar]
  • 79.Suchomel V. Better Web Corpora For Corpus Linguistics And NLP (Doctoral dissertation, PhD thesis, Masaryk University).
  • 80.Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau RJ. Sentiment analysis of twitter data. InProceedings of the workshop on language in social media (LSM 2011) 2011 Jun (pp. 30-38).
  • 81.Woodfield K, Morrell G, Metzler K, Blank G, Salmons J, Finnegan J, et al. Blurring the Boundaries? New social media, new social research: Developing a network to explore the issues faced by researchers negotiating the new research landscape of online social media platforms.
  • 82. Halliday MA. Spoken and written modes of meaning. Media texts: Authors and readers. 1994;7:51–73. [Google Scholar]
  • 83.Peng RD, Matsui E. The Art of Data Science: A guide for anyone who works with Data. Skybrude consulting LLC; 2016.
  • 84.McGlashan M, Hardaker C. Twitter rape threats and the discourse of online misogyny (DOOM): using corpus-assisted community analysis (COCOA) to detect abusive online discourse communities. 2015:234-5.

Decision Letter 0

Michal Ptaszynski

9 May 2023

PONE-D-23-08581"The Algorithm Will Screw You'" Blame, Social Actors and the 2020 A Level Results Algorithm on TwitterPLOS ONE

Dear Dr. Heaton,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jun 23 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Michal Ptaszynski, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match. 

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

3. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

"Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

4. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Thanks for presenting an interesting study.

The dataset of tweets is almost three-year-old. What is the significance of analyzing this old dataset? Do your conclusions still hold in 2023?

A better explanation and subsequent implications are needed for the drop in the VADER line around August 2020 in Figure 1.

I appreciate the use of discourse analysis.

Please explain what you mean by the agency. Do social actors always act as a social agency? A literature review on agency and situating your study against past research will be useful.

How do different social actors interact with each other, and what are the implications of their interaction for your study?

Are you sure about the text in the Acknowledgments section?

Reviewer #2: This is an interesting manuscript that touches upon a very important topic. In the following, I would like so hare some thoughts the authors might consider to possibly further improve the quality of the paper.

Introduction

The authors state “there is a research gap regarding public views expressed on Twitter […]”. I agree, but why does this need to be addressed? The following text does not explain this.

The rest of the introduction is very much about the chosen methodological approach. This is understandable. However, to underline the practical relevance of the authors’ approach, I would suggest spending some more time on contextualizing the methods in the wider, practical discourse. I understand that this is what the next section does. I think it would be an alternative option to already mention more about this earlier.

Context of the 2020 A Level Algorithm

I generally like this section. However, towards the end I think the authors should consider referring back to why and how their methods are going to add more information about the discourse. Moreover, the authors state “yet limited research into how social media users reacted to 101 the scandal, thus providing motivation for our research”. Again, I agree. But for the purpose of the paper, I think the authors should spend more time in explaining why this is important and what additional insights it can provide. The next section does this – I just think that the transitions could be improved and made a bit more fluent.

Related Work

“Using CL and CDA, underpinned by SAR, it will be possible to ultimately

contribute to filling the gap previously identified in the literature.“ Why and how?

NLP-Based Computational Linguistics to Examine Social Media

It seems that this section already provides results from the collected data. However, this is really not clear to me. Overall, this section is difficult to read and follow.

Using Corpus Linguistics to Examine Social Media

I like this section.

Using Critical Discourse Analysis to Examine Social Media

I like this section and how the authors describe how this approach can adhere to the gaps of the other approach. Yet, the link to social media and Twitter is rather short and I would like to suggest that the authors spend more time on expanding the paragraph on page 7, lines 277-287.

Research Gap

I think Social Actor Representation (SAR) and Social Action Theory (SAT) need to be mentioned earlier in the manuscript. This seems to be crucial for the paper. But only surface just before the Methods section. Right now, the link to the method and why it is important are too short and sometimes just constructed with one sentence.

Method

Good. I just think that Table 1 is not positioned well in this section.

Results

“Based on this first list, four potential social 379 actors (the algorithm, Ofqual, the government and students) were investigated through 380 the examination of collocational strength and CDA.” I am struggling a bit to consider “the algorithm” as a social actor. I think I know what the authors are referring to, but maybe they could spend some more time making an argument that this link can be made. The discussion touches upon this. But, in my opinion, it might be a bit late then.

The Algorithm

This is an interesting section. However, I think it would be really beneficial to add another Figure that shows the frequency of the collocations across time. I think it would add another very valuable layer to the description in the text. Similarly, while the next two sections are equally interesting, I wonder whether the authors could consider a more visual representation of their findings as they have a clear timeline going through the analyses.

Discussion

Good

Limitations and Future Work

I was a bit surprised not to see any reference to Social Network Analyses, which particularly in combination with CL is becoming more common in research. Additionally, while I understand the authors’ criticism of sentiment analyses, I think that it would have been interesting to more carefully combine this with their CL approach of collocations and POS.

Conclusion

The methodological conclusion is understandable. The overall conclusion of “blaming a social actor” is of course more difficult, but also less clear in the description.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2023 Jul 26;18(7):e0288662. doi: 10.1371/journal.pone.0288662.r002

Author response to Decision Letter 0


20 May 2023

Comment Page(s) Action

The dataset of tweets is almost three-year-old. What is the significance of analyzing this old dataset? Do your conclusions still hold in 2023? 20 We thank the reviewer for this comment. Whilst the disruption of the pandemic is coming to an end, this paper gives insight into how members of the public may react to future decision-making algorithm interventions. We have included this as a statement in the conclusion.

A better explanation and subsequent implications are needed for the drop in the VADER line around August 2020 in Figure 1. 4 More detail has been added here – this is most likely caused by ‘mutant’ holding negative sentiment score, combined with an increase in the number of negative responses.

Please explain what you mean by the agency. Do social actors always act as a social agency? A literature review on agency and situating your study against past research will be useful. 7-8 Thank you for this suggestion. We define agency on page 7 of the manuscript. We have expanded on the referenced literature on page 9 relating to using agency alongside CL and CDA to explore social media discourses.

How do different social actors interact with each other, and what are the implications of their interaction for your study? 20 We have considered this carefully and have concluded that the aim of the paper is to look at how the social actors are represented, rather than how they interact with one another. We have, instead, offered a future work suggestion for exploring this.

The authors state “there is a research gap regarding public views expressed on Twitter […]”. I agree, but why does this need to be addressed? The following text does not explain this. 1-2 Further explanation and detail has been added after this sentence to explain how using social media data can add to the bigger picture of the public’s response to the algorithm.

The rest of the introduction is very much about the chosen methodological approach. This is understandable. However, to underline the practical relevance of the authors’ approach, I would suggest spending some more time on contextualizing the methods in the wider, practical discourse. I understand that this is what the next section does. I think it would be an alternative option to already mention more about this earlier. 2-3 We have added a sentence to make reference to contextualising the methods in the wider, practical discourse and alluded to how these will be explored later in the paper.

However, towards the end of the ‘Context of the Algorithm’ section, I think the authors should consider referring back to why and how their methods are going to add more information about the discourse. 3 We agree and have added a short paragraph explaining this.

Moreover, the authors state “yet limited research into how social media users reacted to 101 the scandal, thus providing motivation for our research”. Again, I agree. But for the purpose of the paper, I think the authors should spend more time in explaining why this is important and what additional insights it can provide. The next section does this – I just think that the transitions could be improved and made a bit more fluent. 3 This has been addressed in the additional paragraph also.

“Using CL and CDA, underpinned by SAR, it will be possible to ultimately

contribute to filling the gap previously identified in the literature.“ Why and how? 4 This sentence has been extended to add further clarity.

NLP-Based Computational Linguistics to Examine Social Media: It seems that this section already provides results from the collected data. However, this is really not clear to me. Overall, this section is difficult to read and follow. 4-5 More discourse markers have been added to ensure clarity in this section.

Using Critical Discourse Analysis to Examine Social Media: I like this section and how the authors describe how this approach can adhere to the gaps of the other approach. Yet, the link to social media and Twitter is rather short and I would like to suggest that the authors spend more time on expanding the paragraph on page 7, lines 277-287. 7 Thank you for this comment. We have expanded this section with more specific findings from these case studies to illustrate the depth of understanding that CDA can uncover.

Research Gap: I think Social Actor Representation (SAR) and Social Action Theory (SAT) need to be mentioned earlier in the manuscript. This seems to be crucial for the paper. But only surface just before the Methods section. Right now, the link to the method and why it is important are too short and sometimes just constructed with one sentence. 8 We thank the reviewer for this observation and suggestion. Upon reflection, we have kept the main part of SAR and SAT in this section. However, we have introduced and provides information about these concepts earlier in order to foreground their importance in our work.

Method: Good. I just think that Table 1 is not positioned well in this section. 10 Agreed. This has been adjusted so it is in section 4.

“Based on this first list, four potential social 379 actors (the algorithm, Ofqual, the government and students) were investigated through 380 the examination of collocational strength and CDA.” I am struggling a bit to consider “the algorithm” as a social actor. I think I know what the authors are referring to, but maybe they could spend some more time making an argument that this link can be made. The discussion touches upon this. But, in my opinion, it might be a bit late then. 10 Additional rationale has been included here: as all of these words are nouns that can be presented actively in a grammatical construction, they are all capable of being a social actor.

The Algorithm: This is an interesting section. However, I think it would be really beneficial to add another Figure that shows the frequency of the collocations across time. I think it would add another very valuable layer to the description in the text. Similarly, while the next two sections are equally interesting, I wonder whether the authors could consider a more visual representation of their findings as they have a clear timeline going through the analyses. Figures Linked scatter diagrams – grouped on three day intervals – have been inserted to show the trajectories of the LogDice scores over time. These have been split into part A and part B to prevent cluttered figures.

Limitations and Future Work: I was a bit surprised not to see any reference to Social Network Analyses, which particularly in combination with CL is becoming more common in research. Additionally, while I understand the authors’ criticism of sentiment analyses, I think that it would have been interesting to more carefully combine this with their CL approach of collocations and POS. 18-19 We have considered this and feel that Social Network Analyses would be an example of future work. Thus, we have included reference to this in this section.

Conclusion: The methodological conclusion is understandable. The overall conclusion of “blaming a social actor” is of course more difficult, but also less clear in the description. 20 Discourse markers have been added to make the conclusion clearer: all three of the social actors explored are blameworthy, although it is not possible to confirm which one was blamed the most.

Attachment

Submitted filename: PLOS One Rebuttal Letter May 23.docx

Decision Letter 1

Michal Ptaszynski

2 Jul 2023

"The Algorithm Will Screw You'" Blame, Social Actors and the 2020 A Level Results Algorithm on Twitter

PONE-D-23-08581R1

Dear Dr. Heaton,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Michal Ptaszynski, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Thanks for incorporating reviewer feedback. I am satisfied with the revised version of your manuscript.

Reviewer #2: I would like to thank the authors for carefully considering the feedback and making applicable adjustments where suggested. There remain some small issues (e.g. seeminglz missing references - line 196). Other than that, this is a nice piece of research.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

Acceptance letter

Michal Ptaszynski

5 Jul 2023

PONE-D-23-08581R1

“The Algorithm Will Screw You”: Blame, Social Actors and the 2020 A Level Results Algorithm on Twitter

Dear Dr. Heaton:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Michal Ptaszynski

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Dataset. Dataset used in the study.

    Including search term, tweet ID, timestamp, number of favourites and number of retweets.

    (CSV)

    Attachment

    Submitted filename: PLOS One Rebuttal Letter May 23.docx

    Data Availability Statement

    All relevant data are within the paper and its Supporting information files.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES