What did researchers have to say?
A total of 28 responses were received from researchers across the UK, (16 from England, 11 from Scotland and 1 from Wales) indicating preferences around how they currently interact with data dictionaries.
Most researchers responded that when deciding what datasets to use for a project, it is important to know:
- which variables are available
- the amount of missing data in any given variable
- the range of data in any given variable
- summary statistics
- geographic spread of data
Respondents generally preferred detailed and comprehensive catalogues but a few reported their frustration with the lack of consistency in cataloguing and would prefer better descriptions and guides along with a consistent or standard approach.
Many respondents reported that metadata catalogues they currently use are insufficient and some pointed to the lack of, or inaccuracy of, metadata, and unclear indications of where linkages are possible.
The responses highlighted potential negative impacts that poor metadata can have on a research project:
- time costs for researchers, primarily through back-and-forth with TREs to find the answers to questions left open by the metadata.
- researchers may submit vaguer proposals for their projects, as uncertainty about the data means there is uncertainty around what can be achieved.
- researchers might ask for more data/variables than they need, which runs counter to data minimisation principles (which ensure that organisations collect only the data they need, for specific, legitimate purposes, and retain it for no longer than necessary).
Respondents also highlighted the importance of avoiding static and dormant records, which is a good reminder for data catalogue providers to work with those responsible for managing the data (Data Asset Owners) to ensure that existing records are being updated whilst also expanding the range of datasets listed.
Overall, the responses to this survey validate assumptions about useful functionalities for the Connect 4 project to consider as it progresses. These researcher views will also guide RDS to consider how we might connect with other providers to widen our offering and provide researchers with a more comprehensive description of available datasets from a wide range of sources.
We would like to thank all the researchers who responded to this survey. We intend to get further feedback on the metadata we produce and will be planning more user engagement around metadata in the future. If you would be interested in helping RDS shape our services through activities like this, please sign up to our Engagement Contact List.