This article is more than 6 years old

by David Kernohan

12/03/19

We need to talk about Data Futures

Data Futures has been delayed for at least 12 months. David Kernohan asks why, and what next.

This article is more than 6 years old

by David Kernohan

Policy Watch

12/03/19

David Kernohan

Deputy Editor

by David Kernohan

staff

21/10/14

David Kernohan is Deputy Editor of Wonkhe

The future sound of HESA

Data Futures tries to solve two main concerns about current data – combining three similar but distinct returns (the main student return, the alternate provider student return, and the initial teacher training (ITT) return) into one, and allowing for data to be submitted much closer to the date it was collected rather than the current year-in-retrospect model. This latter attribute has been the focus of much excitement at the Office for Students as it would allow for the dashboard style “real-time” regulatory data that underpins the regulatory framework.

Well, it’s not quite dashboard-like – we’re still talking about submissions in each of three reference periods through an academic year. Though data submissions could occur as close as needed to the business events that generate the data, there would be three sign-off points where the institution would certify that the data they held and the data HESA held were in concordance. (The tricky first year, 2019-20, would not have seen final sign off until the summer to allow for further testing to take place).

This short description does not in any way get across the huge changes to working and liaison practices that such a change needs. The effect on student records teams is clear, but there is also an impact on the data infrastructure required to support in-year returns. Institutional IT departments and software vendors have been scrambling to update platforms, and will be grateful for this breathing space.

In essence, this is an old problem. Any change to data collection practices or systems will necessarily include a temporary reduction in data quality – HESA had been assuming that this temporary reduction would be addressed during the alpha (last year) and beta (this year) test phases. Clearly this has not happened – the key question now is whether an extra year of development will be able to bring quality back up, or whether there is a more fundamental issue.

Back to the futures

The Office for Students’ Richard Puttock is sanguine about the delay:

The Data Futures programme board, which includes a broad range of sector representatives, unanimously recommended that the HESA board delay full implementation of this project. This is a complex project, and it became clear that it was not practical to proceed on the original timeline. While we recognise this decision will cause some concern in the sector, it is crucial that our regulatory processes are underpinned by high quality data. We can’t rush this project, or compromise on the quality of data which we use. We want to understand how best to simplify sector-wide data collection in a way which ensures we have access to high quality data and minimises burden on individual providers.

He’s right that good data, like good food, can’t be rushed – but the decision by the programme board (on which he sits) meshes poorly with the way OfS has been talking about their use of data over the last year.

The Board paper setting out the decision to approve HESA as the Designated Data Body (DDB) notes:

Our judgement is that Data Futures is a suitable operational design for data collection to meet the OfS’s requirements in terms of technology, capacity and capability.

It does not make a similar claim for HESA’s current processes.

The OfS’ approach to institutional monitoring relies on “lead indicators”. These, in the words of the framework (s128), can be:

Indicators constructed from data and information flows, in as near real time as possible, that will assist the OfS to identify trends and anticipate future events.

Stating the obvious, the current HESA system does not offer data in anything like real time. So these components will not be available for monitoring, meaning that alternative approaches will need to be identified. In fairness, Data Futures provided for submissions during three “reference periods”, so this was hardly “real-time” either. But annual data will not offer OfS the ability to respond as rapidly, meaning that expectations at OfS (and DfE, for that matter) need to be managed.

Data Futures data was also intended for use in funding and performance indicator development by OfS, directly replacing current student datasets.

OfS involvement in the decision, and the absence of a suitable alternative DDB (nobody else even applied to be considered for the role) suggests that HESA will not suffer as a result of this decision. I should be clear that it was very obviously made for sound data quality reasons, and at the recommendation of an independent board.

Days of futures past

So, what will happen next? Clearly HESA (and Civita Digital, as Data Futures delivery partner) will be working closely with institutional contacts and the programme board to ensure that the data quality issues raised can be addressed in the extra year they have been granted.

In practice this will mean re-opening some of the discussions about data definitions that had been considered mostly complete following the release of version 2.0.0 of the guidance. We now sit at version 2.2.0 (as of 7 February) – the central two of the numbering system refers to two fairly major non-backwards compatible changes since the launch of 2.0.0 late in 2018.

With all such large data projects, you can only really identify the problems with the system when actual data starts being fed in. Providers participating in the Alpha and Beta test phases have been doing just that, starting in Alpha with data relating to mainstream students and moving to the edge cases as HESA moved towards beta.

Beta only kicked off – very much delayed – earlier this month – I’d be fascinated to know why the programme board signed of the transition to beta (a stop-go point as I’d understood it) before pulling the plug.

It’s been a busy time for everyone involved, and emotions will be mixed as the pressure of readying processes and systems for September has been replaced by another year of development and testing. It’s a different kind of pressure – and the need to patch up often creaking student records systems for an extra year of the old model having already got some way down the procurement path for a new Futures-compliant system (that, admittedly, would not have been entirely ready in time anyway) will not exactly delight institutional IT teams.

A culture often develops around such mammoth data-related change processes – there is a matched set of goodwill and well-meaning sarcasm among the HESA people (one of the most consistently delightful subcultures in UK HE) for Data Futures. The work to ensure the data is accurate extends far beyond Cheltenham, and we should not forget that the future improvements to data quality will come from the efforts of those in institutions working on the pre-release versions – repeatedly adding data, trying to get it to validate, and liaising with HESA.

One complaint that has shown up on occasion is that Data Futures was being bent to the will of OfS – that the needs of the regulator had begun to overshadow the wider needs of the sector. As the sector is collecting the data and the sector is using the data, the sector needs to continue to shape the process, not the regulator.

post list Latest articles

Wonkhe_WonkheShow_Social_Blue@2x — Image: Wonkhe

Podcast: Reform UK, local skills, students at work

by Team Wonkhe

Podcasts

20/11/25

shutterstock_1006866319 — Image: Shutterstock

The latest sector-wide financial sustainability assessment from the Office for Students

by Debbie McVitty

Policy Watch

20/11/25

Pickaxe,And,Gold,Nuggets,Among,Stones,On,A,Background,Of — Image: Shutterstock

AI is unlocking insights from PTES to drive enhancement of the PGT experience faster than ever before

by Helena Lim

Comment

20/11/25

Shutterstock_1898643757 — Image: Shutterstock

REF should be about technical professionals too

by Richard Traini

Comment

20/11/25

Students don’t think anything will change. They’re probably right

by Jim Dickinson

Comment

19/11/25

Shutterstock_676588450 — Image: Shutterstock

Labour takes steps to bring higher education and local skills closer together

by Michael Salmon

Policy Watch

19/11/25

Duty of care isn’t about mental health, it’s about preventing harm

by Robert Abrahart

Comment

19/11/25

Shutterstock_2444391843 — Image: Shutterstock

Higher education’s civic role has never been more important to get right

by Adam Leach

Comment

18/11/25

Spiral,Staircase,,Shortcut,Concept,To,Success,,3d,Render — Image: Shutterstock

The surprising pragmatism of Reform UK voters towards international education

by Rhiannon McQuone

Comment

17/11/25

Shutterstock_2382313729 — Image: Shutterstock

The political centre of gravity continues to shift towards higher education sceptics

by Steve O’Neil

Comment

17/11/25

4 Comments

Oldest

Newest

Inline Feedbacks

View all comments

mickelous

6 years ago

I’m not sure HESA’s expectations were that the reduction in data quality would be resolved in the Alpha and Beta. In all the discussions I’ve been involved in it was very much acknowledged that there would be a reduction in data quality in the first year but that was a necessary sacrifice for the greater good. OfS, however, are not willing to accept this, nor willing to work with the sector to identify how best to mitigate this.

Alex Leigh

6 years ago

Spot on Mick. I am extremely dubious that insitutions can drive up data quality to the levels of the current return. The maturity/processes/culture/focus required to clean at the point of capture, and maintain DQ through the many paths it takes through a complex university are not there yet. Another year will help – if resources are not shifted to other projects – but it’s hard to see how quality will be comparable when we move from a six month QA tail to a six week one. Anyway, that’s an excellent article David. I share the concern that the HEDIIP vision… Read more »

David Ealey

6 years ago

I think, that as we have been discussing on Twitter, Alex’s view that the HEDIIP vision has been overtaken by events – primarily the creation of the OFS – is spot on. It is clear that the original vision of Data Futures is not one that appears to be shared by the OFS, there is certainly the desire for more real time data but they seem to be finding it difficult to move away from the old HEFCE approach – based around funding, (why are we still returning FUNDCOMP). If this is to progress then think there needs to be… Read more »

Charlie

6 years ago

A good article, thanks David. One of the things perhaps understated is that the move to Data Futures was not simply about the timeliness of data. The data model is enormously more complicated than under the current HESA Student return, both in terms of the logical model and the amount of data collected. Institutions have been forced to change business processes in order to deliver data in the way HESA required, and the extent of that work has been a significant factor in the struggle to be ready for 2019/20.

The future sound of HESA

Back to the futures

Days of futures past

Share

Share

post list Latest articles