This article is more than 4 years old

by Andy Youell

7/10/21

Understanding the differences and similarities between HE providers is an art

Andy Youell reflects on and responds to Wonkhe's run at a standard approach to describing and grouping providers of higher education.

This article is more than 4 years old

by Andy Youell

Comment

7/10/21

shutterstock_99177344 — Image: Shutterstock

2021-09 AY mug shot — Rob Lacey Photographer Editorial, PR & Corporate Photography Headshots, Conferences & Corporate Events Based in Cheltenham, Gloucestershire covering the Midlands, South West & London www.roblaceyphotographer.co.uk T: 01242 861118 M: 07802 542598

Andy Youell

HE data and systems specialist

by Mark Leach

staff

12/03/15

Andy Youell is a HE data and systems specialist

What is data?

At a time when everything seems to be data-driven, it’s worth taking a moment to reflect on what data is. We use data to describe the world – to help us understand and analyse reality. In this instance the reality is a really complex and diverse sector of HE providers who offer life-changing opportunities for millions of students every year. They work in different ways across the full spectrum of academic disciplines and they push the boundaries of knowledge through research and innovation.

The data structures and definitions that are used to describe these incredible organisations are, by comparison, very simple and they frequently fail to capture the nuances and complexities of the sector. In striving for consistency and comparability the sector-level datasets necessarily impose a simple, rigid data model onto a complex and dynamic reality.

What is analysis?

Having squeezed and shoe-horned this reality into these rigid data definitions, this endeavour attempts to make sense of this diversity through the use of algorithms. Good analysis is based on a foundation of algorithms – the “Data Science” bit – but it must go beyond that to tells us something meaningful about the real world.

Good data analysis is an art and in this case DKs choice of data, algorithms, the description of the groups and, yes those awkward manual adjustments, are the work of a Data Artisan not a mere Data Scientist.

What is quality?

Data is deceptive because its rigid, numerical tendencies lure us into thinking about quality in rigid and numerical ways. Of course those Data Science foundations can be assessed in terms of right or wrong but the quality of this analysis – and the meaning that is derived from it – cannot be quantified any more than the quality of the Mona Lisa or A Love Supreme.

When considering this way of grouping HE providers we can admire the technical approach taken, the use of (value-free) colours to describe the groups. But to reach a view as to whether these groupings are good we need to bring our collective knowledge of this rich and complex sector to the fore. Are these groupings credible and fair? Will they add value in real-life situations? Can they avoid the trap of becoming some kind of ranking-by-proxy (as if the league tables themselves are anything other than a ranking-by-proxy).

Off the fence

There are significant weaknesses in this proposal. By counting the things that are measured (and measurable) we are not necessarily measuring the things that count. Furthermore the (admirable) approach of using only publicly available data (UCAS – I’m looking at you) further restricts the richness of the analysis that is possible. I like what’s in here, but have a nagging feeling about what could have been.

And then there are the algorithms. Not only is the data blunt and often clumsy but this analysis applies rigid algorithms to allocate the data subjects into different categories. I seem to recall that some exam grades were awarded that way in 2020 and that didn’t end well. Fatuous comments about mutant algorithms aside, anybody attempting to hard-wire outcomes on the basis of rigid algorithmic rules applied to data like this is playing a very dangerous game indeed.

So maybe the question is not as simple as good or bad but a more focused question around the idea of good enough.

The standard way of grouping institutions?

This is perhaps the most complex question of all since the quality – or fitness for purpose – of this classification is ultimately dependent on the use to which it is put. If it is not used widely and consistently then it is not a standard since, contrary to the views of many ‘data standards’ initiatives, a specification only becomes a standard when it achieves widespread and consistent adoption.

This perhaps is the real nub of the issue. DKs work is thorough and technically admirable: The presentation of the groups and the detail that is set out in the data dictionary tells me that this has been done to a very high standard; The groups themselves feel right and relevant for the sector today – so I would suggest that this is good Data Art. The consultation process should further strengthen the model. There is a lot to like here.

But the question of whether this project will deliver value through creating a standard approach to grouping institutions lies not only in the hands of Wonkhe but with all those who use data-driven analysis to explore and understand the sector.

DK has set this ball rolling – it is up to all of us to run with it….or not.

post list Latest articles

Robot,Teacher,Explains,Modern,Theory.,Classroom,Interior,With,Empty,Black — Image: Shutterstock

High quality learning means developing and upskilling educators on the pedagogy of AI

by Debbie McVitty

Comment

1/12/25

ghjeiruoghjuioerhgiu — Image: Midjourney

The end of pretend – AI and the case for universities of formation

by Jim Dickinson

Long read

1/12/25

Student engagement does not work if institutions are stuck in survival mode

by Jonathan Eaton

Comment

28/11/25

Wonkhe-Scaffold-Framework — Image: Shutterstock

Skills England has a new way to talk about skills, and the sector needs to listen

by David Kernohan

Analysis

28/11/25

Higher education postcard: Peterhouse, Cambridge

by Hugh Jones

Comment

28/11/25

Wonkhe_WonkheShow_Social_Blue@2x — Image: Wonkhe

Podcast: Budget, R&D, Scotland’s tertiary bill

by Team Wonkhe

Podcasts

27/11/25

Universities now need to be much clearer about the total cost of a course

by Jim Dickinson

Analysis

27/11/25

Red,And,Blue,Pill,Choice,As,A,Person,At,A — Image: Shutterstock

The post-matrix university – trust, relevance, and the politics of plugging back in

by Amanda Broderick

Comment

27/11/25

Budget 2025 for universities and students

by Team Wonkhe

Policy Watch

26/11/25

Commuter students at station — Image: Shutterstock

The future of financial hardship support needs to be flexible

by Peter Gray

Comment

26/11/25

1 Comment

Oldest

Newest

Inline Feedbacks

View all comments

Albert Wright

4 years ago

No one said it would be easy.

What DK has done so far is to highlight the differences that already exist between institutions.

For me, the light bulb moment is the extent that we are not comparing like with like and the danger of thinking we can have “criteria for all”.

Grouping together helps greater understanding. The ability to see overlaps of one group with another at the same institution is also useful and illustrates the individuality of each institution.

I agree that how the data is used will ultimately prove its worth

What is data?

What is analysis?

What is quality?

Off the fence

The standard way of grouping institutions?

Share

Share

post list Latest articles