Ethnicity data resource in population-wide health records: completeness, coverage and granularity of diversity.
Author
Pineda-Moncusí, MartaAllery, Freya
Delmestri, Antonella
Bolton, Thomas
Nolan, John
Thygesen, Johan H
Handy, Alex
Banerjee, Amitava
Denaxas, Spiros
Tomlinson, Christopher
Denniston, Alastair K
Sudlow, Cathie
Akbari, Ashley
Wood, Angela
Collins, Gary S
Petersen, Irene
Coates, Laura C
Khunti, Kamlesh
Prieto-sAlhambra, Daniel
Khalid, Sara
Publication date
2024-02-22Subject
Health services. Management
Metadata
Show full item recordAbstract
Intersectional social determinants including ethnicity are vital in health research. We curated a population-wide data resource of self-identified ethnicity data from over 60 million individuals in England primary care, linking it to hospital records. We assessed ethnicity data in terms of completeness, consistency, and granularity and found one in ten individuals do not have ethnicity information recorded in primary care. By linking to hospital records, ethnicity data were completed for 94% of individuals. By reconciling SNOMED-CT concepts and census-level categories into a consistent hierarchy, we organised more than 250 ethnicity sub-groups including and beyond "White", "Black", "Asian", "Mixed" and "Other, and found them to be distributed in proportions similar to the general population. This large observational dataset presents an algorithmic hierarchy to represent self-identified ethnicity data collected across heterogeneous healthcare settings. Accurate and easily accessible ethnicity data can lead to a better understanding of population diversity, which is important to address disparities and influence policy recommendations that can translate into better, fairer health for all.Citation
Pineda-Moncusí M, Allery F, Delmestri A, Bolton T, Nolan J, Thygesen JH, Handy A, Banerjee A, Denaxas S, Tomlinson C, Denniston AK, Sudlow C, Akbari A, Wood A, Collins GS, Petersen I, Coates LC, Khunti K, Prieto-sAlhambra D, Khalid S; CVD-COVID-UK/COVID-IMPACT Consortium. Ethnicity data resource in population-wide health records: completeness, coverage and granularity of diversity. Sci Data. 2024 Feb 22;11(1):221. doi: 10.1038/s41597-024-02958-1.Type
ArticleAdditional Links
https://www.nature.com/sdata/PMID
38388690Journal
Scientific DataPublisher
Nature Publishing Groupae974a485f413a2113503eed53cd6c53
10.1038/s41597-024-02958-1