Skip to content

Connect 4

Learn how the Connect 4 project aims to improve data access between the four UK nations

About Connect 4

Connect 4 is a one-year project investigating the federation of data services across the UK’s four national Trusted Research Environments (TRE), focusing on the Integrated Data Service (IDS), which is a major cross-government project for which the Office for National Statistics (ONS) is the delivery lead, and the Scottish National Safe Haven. An element of the project is developing rich metadata for the discovery of federated sensitive data.

The project started in April 2024 and runs until March 2025. Connect 4 is funded by the Economic and Social Research Council (ESRC), under the ‘Future data services: pilots to enhance data services for the future’ programme (ES/Z502972/1). It is a multi-partner project led by EPCC at the University of Edinburgh with Research Data Scotland (RDS), the ONS, Public Health Scotland and National Records of Scotland.

Our work and impact

Much data kept by public organisations such as Government departments contain sensitive data about UK citizens and businesses. Since the Digital Economy Act 2017, progress has been made to enable accredited researchers to access this data to perform studies that are in the public benefit. Accredited researchers can submit proposals to a Trusted Research Environment (TRE) to gain access to data held by that organisation. This approach ensures that sensitive data is kept safe and secure.

The challenge for researchers is to answer research questions where datasets must be combined before analysis, and where two or more datasets are each owned by different TREs. The following two barriers were identified:

  • A researcher cannot see these datasets and - based on the available metadata for these datasets - cannot assess whether a combination is feasible or will lead to a sensible analysis
  • The policies that govern access are specific between the data owners and the TRE

Connect 4 aims to overcome these barriers, and there are three work packages involved to explore and develop solutions. RDS is leading on one of these work packages, and elements of another, in partnership and close collaboration with ONS. Through the first work package, we are working to agree  on a roadmap of the improvements and arrangements required to allow researchers to discover, apply for and analyse data held at UK national TREs through a single front door.

The second work package looks at information governance and understanding the standards, policies and procedures already in place to identify areas for alignment, and a shared service model that shapes the delivery of the changes outlined in the roadmap.

Other project work, led by EPCC, involves research requiring data access from ONS and the Scottish National Safe Haven, to carry out metadata enhancement work. This element of the second work package aims to develop software that automatically creates rich metadata, which will help researchers understand whether they could perform an analysis if they had access to the data, and therefore enable researchers to decide if the investment to combine datasets is worthwhile because they can determine beforehand if the necessary data are present.

There will also be a trial use case carried out in the third work package, aiming to inform the other Connect 4 work packages and lay the foundations for ongoing federation across UK national TREs. RDS is not directly involved in this workstream, but we work closely with EPCC and will provide suggestions and input as appropriate.

Since April 2024, work has begun on all three work packages. Work on the roadmap is underway with engagement from partners and stakeholders. A questionnaire has been developed and sent out to an audience of researchers to gather input to help understand researcher’s requirements with regards to metadata, and applications have been started for data access for the metadata and study tasks.

Jen Muir, Senior Data Analyst at RDS, said: “The roadmap will be the first step in delivering federated data access across the UK’s four nations. RDS is leading on describing a shared service model that the UK national TREs could adopt to provide a sustainable route for federated data access for research, and an information governance (IG) framework to support and enable the model with recommendations for policies across the TREs. These work packages will help provide clarity for researchers working across the system and requesting safe and secure access to data.”

Kostas Kavoussanakis, EPCC TRE Service Manager, said: "Connect 4 is an ambitious endeavour towards frictionless, UK-wide, impactful data-driven research and innovation. It is the seed for what needs to become co-ordinated action across the UK.”

Carmen Amador, Senior Assistant Statistician at NRS, said: “We’re delighted to be involved with the Connect 4 project. Making Scottish data available for research is really important, and this project is a step towards being able to connect that data to the rest of the UK.”

PHS Head of Data and Modelling Services, Carole Morris, said: “The creation of a single front door for accessing data held in TREs across the four nations would enable researchers to generate deeper insights to benefit people living in Scotland. The Connect 4 project is a valuable milestone towards this, and I’m excited to see how the project progresses.”

Emily Symmons, Head of Relationships, IDS, said: “I’m proud to be working on the Connect 4 project towards a standardised approach to accessing data across the UK. We will work across each of the project’s workstreams, and we’re pleased to be collaborating with the other organisations towards this shared goal.”

Related content