by Livia Scott

19/06/25

Machine learning technology is transforming how institutions make sense of student feedback

Livia Scott and Debbie McVitty explore how AI can support rapid interpretation of students' qualitative comments in feedback surveys

by Livia Scott

Comment

19/06/25

Livia Scott

Partnerships Coordinator

by Jim Dickinson

staff

12/05/22

Livia Scott is Partnerships Coordinator and Associate Editor at Wonkhe

Debbie McVitty

Editor

by Mark Leach

staff

21/10/14

Debbie is Editor of Wonkhe

Let me blow your mind

That is where machine learning technology can genuinely change the game. Student Voice AI was founded by Stuart Grey, an academic at the University of Strathclyde (now working at the University of Glasgow), initially to help analyse student comments for large engineering courses. Working with Advance HE he was able to train the machine learning model on national PTES and PRES datasets. Now, further training the algorithm on NSS data, Student Voice AI offers literally same-day analysis of student comments for NSS results for subscribing institutions.

Put the words “AI” and “student feedback” in the same sentence and some people’s hackles will immediately rise. So Stuart spends quite a lot of time explaining how the analysis works. The word he uses to describe the version of machine learning Student Voice AI deploys is “supervised learning” – humans manually label categories in datasets and “teach” the machine about sentiment and topic. The larger the available dataset the more examples the machine is exposed to and the more sophisticated it becomes. Through this process Student Voice AI has landed on a discreet number of comment themes and categories for taught students and the same for postgraduate research students that the majority of student comments consistently fall into – trained on and distinctive to UK higher education student data. Stuart adds that the categories can and do evolve:

The categories are based on what students are saying, not what we think they might be talking about – or what we’d like them to be talking about. There could be more categories if we wanted them, but it’s about what’s digestible for a normal person.

In practice that means that institutions can see a quantitative representation of their student comments, sorted by category and sentiment. You can look at student views of feedback, for example, and see the balance of positive, neutral and negative sentiment, overall, segment it into departments or subject areas, or years of study, then click through to see the relevant comments to see what’s driving that feedback. That’s significantly different from, say, dumping your student comments into a third party generative AI platform (sharing confidential data with a third party while you’re at it) and asking it to summarise. There’s value in the time and effort saved, but also in the removal of individual personal bias, and the potential for aggregation and segmentation for different stakeholders in the system. And it also becomes possible to compare student qualitative feedback across institutions.

Now, Student Voice AI is partnering with student insight platform evasys to bring machine learning technology to qualitative data collected via the evasys platform. And evasys and Student Voice AI have been commissioned by Advance HE to code and analyse open comments from the 2025 PRES and PTES surveys – creating opportunities to drill down into a national dataset that can be segmented by subject discipline and theme as well as by institution.

Bruce Johnson, managing director at evasys is enthused about the potential for the technology to drive culture change both in how student feedback is used to inform insight and action across institutions:

When you’re thinking about how to create actionable insight from survey data the key question is, to whom? Is it to a module leader? Is it to a programme director of a collection of modules? Is it to a head of department or a pro vice chancellor or the planning or quality teams? All of these are completely different stakeholders who need different ways of looking at the data. And it’s also about how the data is presented – most of my customers want, not only quality of insight, but the ability to harvest that in a visually engaging way.

“Coming from higher education it seems obvious to me that different stakeholders have very different uses for student feedback data,” says Stuart Grey. “Those teaching at the coalface are interested in student engagement; at the strategic level the interest is in strategic level interest in trends and sentiment analysis and there are also various stakeholder groups in professional services who never get to see this stuff normally, but we can generate the reports that show them what students are saying about their area. Frequently the data tells them something they knew anyway but it gives them the ammunition to be able to make change.”

The results are in

Duncan Berryman, student surveys officer at Queens University Belfast, sums up the value of AI analysis for his small team: “It makes our life a lot easier, and the schools get the data and trends quicker.” Previously schools had been supplied with Excel spreadsheets – and his team were spending a lot of time explaining and working through with colleagues how to make sense of the data on those spreadsheets. Being able to see a straightforward visualisation of student sentiment on the various themes means that, as Duncan observes rather wryly, “if change isn’t happening it’s not just because people don’t know what student surveys are saying.”

Parama Chaudhury, professor of economics and pro vice provost education (student academic experience) at University College London explains where qualitative data analysis sits in the wider ecosystem for quality enhancement of teaching and learning. In her view, for enhancement purposes, comparing your quantitative student feedback scores to those of another department is not particularly useful – essentially it’s comparing apples with oranges. Yet the apparent ease of comparability of quantitative data, compared with the sense of overwhelm at the volume and complexity of student comments, can mean that people spend time trying to explain the numerical differences, rather than mining the qualitative data for more robust and actionable explanations that can give context to your own scores.

It’s not that people weren’t working hard on enhancement, in other words, but they didn’t always have the best possible information to guide that work. “When I came into this role quite a lot of people were saying ‘we don’t understand why the qualitative data is telling us this, we’ve done all these things,’” says Parama. “I’ve been in the sector a long time and have received my share of summaries of module evaluations and have always questioned those summaries because it’s just someone’s ‘read.’ Having that really objective view, from a well-trained algorithm makes a difference.”

UCL has tested two-page summaries of student comments to specific departments this academic year, and plans to roll out a version for every department this summer. The data is not assessed in a vacuum; it forms part of the wider institutional quality assurance and enhancement processes which includes data on a range of different perspectives on areas for development. Encouragingly, so far the data from students is consistent with what has emerged from internal reviews, giving the departments that have had the opportunity to engage with it greater confidence in their processes and action plans.

None of this stops anyone from going and looking at specific student comments, sense-checking the algorithm’s analysis and/or triangulating against other data. At the University of Edinburgh, head of academic planning Marianne Brown says that the value of the AI analysis is in the speed of turnaround – the institution carries out a manual reviewing process to be sure that any unexpected comments are picked up. But being able to share the headline insight at pace (in this case via a PowerBI interface) means that leaders receive the feedback while the information is still fresh, and the lead time to effect change is longer than if time had been lost to manual coding.

The University of Edinburgh is known for its cutting edge AI research, and boasts the Edinburgh (access to) Language Models (ELM) a platform that gives staff and students access to generative AI tools without sharing data with third parties, keeping all user data onsite and secured. Marianne is clear that even a closed system like ELM is not appropriate for unfettered student comment analysis. Generative AI platforms offer the illusion of a thematic analysis but it is far from robust because generative AI operates through sophisticated guesswork rather than analysis of the implications of actual data. “Being able to put responses from NSS or our internal student survey into ELM to give summaries was great, until you started to interrogate those summaries. Robust validation of any output is still required,” says Marianne. Similarly Duncan Berryman observes: “If you asked a gen-AI tool to show you the comments related to the themes it had picked out, it would not refer back to actual comments. Or it would have pulled this supposed common theme from just one comment.”

The holy grail of student survey practice is creating a virtuous circle: student engagement in feedback creates actionable data, which leads to education enhancement, and students gain confidence that the process is authentic and are further motivated to share their feedback. In that quest, AI, deployed appropriately, can be an institutional ally and resource-multiplier, giving fast and robust access to aggregated student views and opinions. “The end result should be to make teaching and learning better,” says Stuart Grey. “And hopefully what we’re doing is saving time on the manual boring part, and freeing up time to make real change.”

This article is published in association with evasys.

fest Festival side

TFOHE25_Website_Column_1000x1680_BTBS@2x

View here

by Mark Leach

featured message

19/05/23

post list Latest articles

Shutterstock_2252847289 — Image: Shutterstock

A university system reliant on international students has an obligation to understand them

by Jenny Steele

Comment

24/10/25

187southbankoncork — Image: Shutterstock

Higher education postcard: London South Bank University

by Hugh Jones

Comment

24/10/25

Wonkhe-Research-Curiosity — Image: Shutterstock

What’s in the post-16 white paper for postgraduate study?

by Mark Bennett

Comment

24/10/25

bioeriogbehtrjiogh — Image: Shutterstock

A generation of students has been failed and forgotten again

by Jim Dickinson

Comment

24/10/25

Overhead,View,Of,A,Broken,Elegant,Vintage,China,Teacup,With — Image: Shutterstock

The importance of civility in higher education

by Lydia Dye-Stonebridge

Comment

23/10/25

Can regulation cope with a unified tertiary system in Wales?

by Jim Dickinson

Policy Watch

22/10/25

Shutterstock_2491075083 — Image: Shutterstock

The white paper kept quiet on market exit

by Michael Salmon

Comment

22/10/25

wonkhe-two-goldfish — Image: Shutterstock

We need to talk about high-tariff recruitment behavior

by Elliot Newstead

Comment

22/10/25

Shutterstock_2056316579 — Image: Shutterstock

The white paper opens the door, but we need to ensure everyone gets in

by Fiona Ellison

Comment

22/10/25

Wonkhe_WonkheShow_Social_Blue@2x — Image: Wonkhe

Podcast: Skills white paper special

by Team Wonkhe

Podcasts

21/10/25

4 Comments

Oldest

Newest

Inline Feedbacks

View all comments

Tom

4 months ago

Entirely agree on the potential here, on the sharp distinction drawn between these systems and generative AI, and on the reasons why using generative AI here would not be a good idea. However, it seems to me that there is still a key limitation with this approach, namely that all it can do is put comments into the categories provided. It won’t, as far as I can see, have the ability to recognise new themes that have not occurred before – or am I mistaken?

Stuart Grey

4 months ago

Reply to Tom

You are right that a straight supervised model can’t spot brand-new issues, so alongside the classification, we run a separate, ongoing unsupervised analysis. This process regularly clusters comments without preset labels, helping us see emerging themes and decide whether they warrant new or revised categories. In practice, as new categories emerge from this process, we incorporate them into the next supervised model, and as we regularly reprocess all of our data with the latest models, we can see how/if students were talking about these ‘new’ topics back in the historical data which is really useful.

ClaireI

4 months ago

““I’ve been in the sector a long time and have received my share of summaries of module evaluations and have always questioned those summaries because it’s just someone’s ‘read.’ Having that really objective view, from a well-trained algorithm makes a difference.”” – Isn’t the training of an algorithm a ‘read’, though? We received some survey comment analysis in 2023 by Student Voice – the good thing is that it allows a view of the underlying comments to see what contributes to a particular positive or negative score. It did amuse me that ‘Nothing was too much trouble’ was coded as… Read more »

Stuart Grey

4 months ago

Reply to ClaireI

You are right, every supervised model also begins with human judgment. The difference lies in scale and transparency. Instead of relying on one person’s ‘read’, we use hundreds of thousands of comments labelled by multiple reviewers, assess how often they agree, and publish the categories so you can see exactly how we arrived at our conclusions. Regarding your second point, I’m glad you appreciated the transparency. Showing the full categorisation of every single comment is a key part of what we do, and highlighting where the model might have failed is part of that too! No model is perfect, but… Read more »

Let me blow your mind

The results are in

Share

Share

fest Festival side

post list Latest articles