AI in Formative Feedback: Current Challenges and Reflections Matthew
12 Slides375.52 KB
AI in Formative Feedback: Current Challenges and Reflections Matthew Voice [email protected]
The Task A 1,500 word book review (Voice, 2021) was fed to three popular large language models (LLMs): – ChatGPT 3.5 – Google Bard – Microsoft Bing Prompt sequencing designed to test and compare each LLM’s ability to provide formative feedback, e.g. on structure and language over content
Prompt Protocols 1. Initial identical prompt: ‘Giving examples from the text, how could I improve this book review? It is written for publication in an academic journal’ 2. Push to elicit examples: ‘What do you mean when you say [suggestion]? Can you provide an example of an improved version?’ 3. Ask for explanation: ‘You gave me the following feedback: [copied from earlier answer]. What makes [your rewrite] better than [my original]?’
Issue 1: Inaccuracies From Google Bard: The author could improve the neutral tone of the review by avoiding expressing personal opinions or beliefs. For example, the following sentence: "I think Murphy's book is well-written and informative." Could be improved to: "Murphy's book is well-written and informative” 75% of all Bard’s quotations from the original review were false! All LLMs got James Murphy’s first name wrong at some point, despite it appearing in the source review
Issue 2: Inconsistencies From ChatGPT 3.5: Me: You said that "it might be helpful to provide a brief conclusion to your review that summarizes your overall assessment of the book and its contribution to the field." That's something I was trying to do with the last paragraph. Was I not successful? ChatGPT: Yes, your last paragraph does summarize your overall assessment of the book and its contribution to the field. It provides a concise and informative conclusion that highlights the book's strengths and its potential impact on future research. Well done!
Issue 2: Inconsistencies From ChatGPT 3.5: ChatGPT: I apologize for the confusion. You are correct that your final paragraph does provide a concise conclusion and summary of your overall assessment of the book. I must have missed it in my previous feedback. Please disregard my earlier recommendation.
Issue 3: Quality and Relevance From Microsoft Bing: ‘The Discursive Construction of Blame is a valuable resource for scholars interested in the linguistic analysis of blame in public inquiry. It offers a comprehensive and systematic framework for identifying and describing various strategies of blame, blame avoidance, and apology in this genre of discourse. However, the book is mainly descriptive and does not engage much with critical perspectives or social implications of blame. Moreover, the book focuses on written texts and does not explore how blame is constructed in spoken or multimodal interactions.’
Issue 4: Pragmatic Nuance From ChatGPT 3.5: Original: Given this express avoidance of social or political commentary from the outset, it is somewhat surprising that the book’s final conclusions provide explicit recommendations for changes to the organisation of future public inquiries. Improved: Despite initially avoiding social or political commentary, the book's final conclusions offer explicit recommendations for enhancing the organization of future public inquiries. Explanation: The improved version conveys the same message as the original but presents it in a more straightforward and concise manner. It eliminates unnecessary phrases and clarifies the main point by directly stating that the book provides recommendations for improving future public inquiries.
An Improvement? To Google Bard: You gave me the following feedback: "Use of formal language. The author could improve the formal style of the review by avoiding informal language, such as the following: "Murphy's linguistic framework evolves throughout the book." This could be improved to: "Murphy's linguistic framework develops over the course of the book."" What makes 'evolves' a less formal language choice than 'develops'?
An Improvement? From Google Bard: ‘The word "evolves" can be interpreted in a number of different ways. It can mean that the linguistic framework changes over time, or it can mean that the linguistic framework becomes more complex over time. The word "develops" is more specific and has a more straightforward meaning. This makes it easier for the reader to understand the author's argument. Overall, the word "develops" is a more formal language choice than "evolves" because it is more literal, specific, and objective.’
Reflections Is this task reasonable? – The book review has already been edited and published. It meets the criteria for quality so what could a LLM be expected to contribute? Is this task realistic? – Are students likely to feed complete texts like this to LLMs as a process of gaining formative feedback?
Going Forward Bringing LLMs into the classroom – Teaching students to be critical of LLM output, and to understand the technology’s (present) limitations – While simultaneously acknowledging its value: addressing what they can do, and helping students to get the most from them Considering our position as teachers/researchers: – How do we understand student use of generative AI? – What can we teach them? What can they teach us?