Cleaning and Manipulating CPRD Data

Clinical Practice Research Datalink (CPRD) collects anonymised patient data from a network of GP practices across the UK. Primary care data are linked to a range of other health related data to provide a longitudinal, representative UK population health dataset.

This session will describe and illustrate the structure and complexity of the data, discuss the strengths and limitations of the data, describe how to manipulate the data, and explain how the NDPH CPRD Team can work with researchers on potential studies.

Topics to be covered
Structure of the CPRD databases
Indicators of data quality
Identifying and deriving exposures, covariates and outcomes of interest

Learning Objectives
Understand the structure of the CPRD databases
Aware of data cleaning measures to ensure data quality
Aware of ways to identify exposures, covariates and outcomes of interest

Pre requisite – Attendance to the “An introduction to Clinical Practice Research Datalink (CPRD) data” session is recommended.

NDPH staff and students who are intending to submit or have submitted a protocol for a study using CPRD data.