Connected Yorkshire

Our Data

Connected West Yorkshire: whole-system, lifecourse data to promote good health and wellbeing

Understanding the databases curated by Connected West Yorkshire 

It is important to distinguish between the two main datasets available, as they have different approval requirements and data coverage. 

CY provides linked records from the full population of service users accessing primary care, secondary care, community care, and other services (excluding education). The time coverage varies by data provider but typically spans from the earliest digitally held record to the most recent extract. This dataset contains records from 1.2 million service users. Connected Yorkshire has Research Ethics Committee (REC) approval to provide de-identified data for research purposes. 

CB links education records with records from service users accessing primary care, secondary care, community care, and other services. This dataset contains education records from 357,317 linked service users born after January 1988. In CB, some sensitive Health Data (for example, sexual health codes) have been removed; a codelist is available on request. The current data extract includes education records up to December 2025.  Connected Bradford has both REC approval and Confidentiality Advisory Group (CAG) approval. Data linkage between health and education data is undertaken with ‘s251’ support’ from the HRA following CAG advice.

Section  Dataset  Description  Population Count  Last Updated   Date Range 
Primary care  

 

Primary care  

 

       
    The Primary care dataset contains General Practice data from all GP’s within Bradford and Airedale geography , this partially covers some practices operating in Calderdale (limited) All data captured at a practice is included including clinical and administrative events, appointments, visits, medications, immunisations etc. This is coded in CTV3 and Snomed format.  

Typically cohorts can be identified using a demographic attribute linked to a clinical condition. 

1,252,943  24/04/2026  Start of record to 07/04/2026 
Secondary care  

 

  Data for Calderdale, Airedale is in the format of Secondary Use Sources (SUS+). The format can be found here . For Airedale and Calderdale we have ECDS (Emergency Care , APC (Admitted Patient Care) and OP (Outpatients) . These are in ICD10 format . and Emergency Care format. 

For Bradford Royal Infirmary we receive a direct feed from the hospital which includes a number of tables (including three in SUS+ format) plus maternity , lab results and other data sources (Contact for details)     

     
Bradford  1,215,114  05/03/2026  01/04/2021 to 30/07/2025 
Calderdale  476,167  26/11/2025  01/??/2017 to 30/09/2023 
 Airedale  439,910  19/02/2025  01/04/2015 to 09/02/2025 
  Accident and Emergency  For the three hospitals referenced we receive ECDS (Emergency Care in SUS+ format )  

  

 

 

 

 

 

 

 

 

 

Bradford  270,003  05/03/2026  01/04/2021 to 30/07/2025 
Calderdale  332,034  26/11/2025  01/17/2017 to 30/09/2023 
Airedale  300,694  19/02/2025  01/04/2015 to 09/02/2025 
  Adult Social Care  This contains Assessments, Contacts and Services offered, at a very high level. Non of this is coded. Additional information on this can be found in Github.   93,457  17/09/2025  01/01/2017 to 30/09/2024 
  Children’s social care  This data comes from historical council data and new from the Children’s trust. It includes Children in need, Children   4,5867  27/8/2025  03/04/2017 to 09/06/2021 
  Yorkshire Ambulance Service  This data includes 111, 999, patient transfer service and data from the YAS patient record system, covering the whole of Yorkshire and Humber is wide ranging dataset.   5,121,282  07/05/2026  01/04/2014 to 08/02/2026 
  National Child Measurement programme  This dataset contains the height and weight of children measured over a number of years.   168,351  01/01/2024  01/02/2022 to 31/10/2023 
  Dental activity  Captures data in the FP17 data format. It covers the full population  Leeds and Bradford postcodes.   902,843  06/06/2024  17/07/2006 to 22/03/2023 
  Death Certificates 

Currently restricted to BTHFT projects only 

Contains Death certificate data in ICD10 format and textual for the whole of yorkshire.  493,665  10/03/2026  11/08/1969 to 10/02/2026 
Education data    This dataset contains education records from  linked service users born after January 1988. School registration, attendance and attainment data. Contact the research team for detailed data structure. Please note there are additional restrictions on reporting school level and individual level education data in Connected Bradford projects  357,317  14/04/2026  01/09/1995 to 31/07/2025 
Property level data  There are a series of datasets related to property level data.  There is a single detailed dataset that contains green space data , air quality, property attributes, distance to fast food outlets etc within n within 300m circular buffer around the home address, using 2018 data from the Department for Environment, Food and Rural Affairs. There are also specific datasets relating to items below.   No. of properties (approx.)   Last updated 06/01/2023  As at 06/01/2023 
  Green space  Contains environmental data related to distances to green spaces and open areas from properties (pseudonymised uprns)   238, 504  06/01/2023   
  Air quality  Contains environmental data related to air quality and other factors linked to pseudonymised uprns. 

 

n/a  06/01/2023   
    There is a separate dataset for air quality monitors within Bradford District. Covering monitors on  : 

  1. TongStreet 
  1. MannighamLane 
  1. ThorntonRoad 
  1. RooleyLane 
  1. Keighley 
  1. TreadwellMills .  

 

    2018 to 2022 
    A further dataset details the distance from a pseudonymised uprn to the clean air zone in Bradford.  1539099 properties  12/11/2025   
  Housing attributes  As above but related to housing attributes such as property type, glazing type , energy rating etc. linked to pseudonymised uprns.  n/a  06/01/2023   
Area level data           
  Crime  There is crime data related to the LSOA,Postcode of a crime. This describes the actual offence and the actions taken. It is very high level and individuals cannot be identified.   374 lsoa’s  14/03/2023  2020/04_to_2022/03 
  Benefits  By postcode/lsoa lists the number of children and adults in a property and whether any Disabled income received. Presence in tis dataset indicates other benefits (unspecified) are being received   52,897 households  14/03/2024   
  LSOA  This is a dataset derived from the Primry Care Address History table that links a property to its  LSOA. Please note there is NO prioerty identification (no, street, town etc.)) we have partial postcode, lsoa and linked to pseudonymised uprns.       
  Deprivation  There is a dataset with Index of Multiple Deprivation (IMD) that can be linked to lsoa’s. 

 

Note a new 2025 release has not yet been  

  2019  Find out more here