Concordance in the recording of stroke across UK primary and secondary care datasets: a population-based cohort study

Ann Morgan; Sarah-Jo Sinnott; Liam Smeeth; Caroline Minassian; Jennifer Quint

doi:10.3399/BJGPO.2020.0117

Abstract

Background Previous work has demonstrated that the recording of acute health outcomes, such as myocardial infarction (MI), may be suboptimal in primary healthcare databases.

Aim To assess the completeness and accuracy of the recording of stroke in UK primary care.

Design & setting A population-based longitudinal cohort study.

Method Cases of stroke were identified separately in Clinical Practice Research Datalink (CPRD) primary care records and linked Hospital Episode Statistics (HES). The recording of events in the same patient across the two datasets was compared. The reliability of strategies to identify fatal strokes in primary care and hospital records was also assessed.

Results Of the 75 674 stroke events that were identified in either CPRD or HES data during the period of the study, 54 929 (72.6%) were recorded in CPRD and 51 013 (67.4%) were recorded in HES. Two-fifths (n = 30 268) of all recorded strokes were found in both datasets (allowing for a time window of 120 days). Among these 'matched' strokes the subtype was recorded accurately in approximately 75% of CPRD records (compared with coding in HES); however, 43.5% of ischaemic strokes in HES were coded as 'non-specific' strokes in CPRD data. Furthermore, 48.2% had same-day recordings, and 56.2% were date-matched within ±1 day.

Conclusion The completeness and accuracy of stroke recording is improved by the use of linked hospital and primary care records. For studies that have a time-sensitive research question, the use of linked, as opposed to stand-alone, CPRD data is strongly recommended.

How this fits in

There is an increasing focus on the use of data from routine healthcare settings to support not only clinical risk prediction, but also pragmatic clinical trials and regulatory decision making. However, in any of these research scenarios, the successful use of such data hinges on the ability to accurately identify key outcomes and prevalent comorbidities, such as stroke. This study demonstrates that reliance on a single dataset to identify stroke is likely to underestimate cases of stroke, and, for this reason, the use of linked health data is advocated, especially for research in which the timing of stroke is critical. Linkage to stroke audit data, as a means of improving knowledge of stroke epidemiology in the UK, is also recommended as a desirable long-term goal.

Introduction

Stroke is the UK’s fourth most common cause of death¹ and a major cause of disability.² Furthermore, with costs to society totalling some £23 billion per year,³ stroke remains a major focus of cardiovascular research as academics, clinicians, and policymakers endeavour to better understand its epidemiology and aetiology, and so reduce its burden.

Routinely-collected data, including electronic health records (EHRs) from primary care and administrative data from hospitals, are frequently used to study stroke.^4,5 Indeed, such data are becoming increasingly important for regulatory decision making concerning the effectiveness and cardiovascular safety of drugs, especially since traditional clinical trials are expensive, limited in their generalisability, and require long follow-up times to accrue major events such as stroke.^6,7 Other uses of EHR data extend to clinical risk prediction^8,9 and interventional research such as pragmatic trials.¹⁰ The validity of any research based on real-world data is, however, dependent on how well researchers can identify outcomes such as stroke. Several studies have revealed discrepancies between data sources in the recording of certain health outcomes, in particular acute outcomes, and have noted that reliance on just one data source risks missing a substantial proportion of cases.^11,12

Only two studies — one on ischaemic and the other on haemorrhagic stroke — have examined the reliability of stroke recording in UK primary care databases.^13,14 Both were conducted in the same dataset (The Health Improvement Network [THIN]) and both were limited in that, first, they recognised only hospitalised strokes, and second, they validated the diagnostic accuracy of Read-coded primary care data from within the same data source (using different data fields) rather than against another data source (for example, hospital data). Furthermore, this previous work predates the introduction of the Quality and Outcomes Framework (QOF), an incentivisation system that will have impacted on the quality of the recording of stroke in primary care data post-2004.¹⁵

The aim of this study was to determine how well strokes are being recorded in primary care data by comparing the recording of stroke events in the same patient across their linked primary and secondary care records. The accuracy of that recording was assessed in terms of: completeness (is the event recorded in both databases?); timing (do the event dates match?); and diagnostic accuracy (is the stroke subtype correctly specified?). How reliably mortality associated with stroke can be determined in primary care and hospital data was also examined by cross-referencing against Office for National Statistics (ONS) cause-specific mortality data.

Method

Data sources

The CPRD is a repository of de-identified electronic medical records from a nationally representative set of UK general practices. It holds research-quality data on demographics, health-related behaviours, test results, diagnoses, referrals, and prescriptions for >11 million people.¹⁶ It is one of the largest databases of longitudinal medical records from primary care globally and has been extensively used in epidemiological research.¹⁷

For this study, CPRD data linked to both HES and ONS mortality data were used, a linkage that is possible for approximately 50% of practices contributing to CPRD, all located in England. The HES database provides data on the primary reason for a hospital admission, as well as other diagnoses and procedures carried out during that admission. For the purposes of this study, the HES database was accorded a 'gold standard' status for identifying strokes under the assumption that the majority of strokes are identified and treated in hospitals.¹⁸ The ONS mortality data contain the date and cause of death for deaths registered in England.

Study design and population

A cohort study design was used. In order to be eligible for inclusion in the study patients had to be aged ≥18 years; registered for at least 1 day at a HES or ONS-linked GP practice that contributed 'up-to-standard' data to CPRD; and have at least one record denoting a stroke in either CPRD, HES, and/or ONS during the study period, 1 January 2004 to 31 December 2016.

Identification of stroke events

Strokes were identified in CPRD using Read codes, and in HES and ONS using International Classification of Diseases (ICD)-10 codes (see Supplementary Appendix S1). All stroke events were included, including multiple events recorded in the same individual. Further details of the strategies used to identify stroke events in each dataset, including additional exclusion criteria applied, are provided in the Supplementary material (see Supplementary Appendix S2).

Analysis

For each data source, the number of recorded stroke events was counted, both overall and by stroke subtype. The extent to which: 1) strokes in HES occurred in CPRD; and 2) strokes in CPRD occurred in HES was assessed. Stroke recording was described as concordant (a 'match') if the CPRD stroke was ≤30 days before or ≤90 days after the HES stroke. The rationale for using a +90-day recording window was to allow for the fact that some stroke patients may remain in hospital for an extended period after their initial stroke. Consequently, their CPRD record may be significantly delayed if, for instance, the date of stroke is erroneously recorded as the date of discharge letter or only recorded when a post-hospitalisation visit to primary care occurs. Allowing a minus 30-day recording facilitated capture of stroke referrals from primary care. The degree of completeness of recording between the two data sources was reported using a Venn diagram. Sensitivity analyses explored the effect on concordance of restricting the analysis to: 1) non-fatal stroke (survival to 30 days); and 2) first-ever stroke only.

For matched strokes, the accuracy in timing was described in terms of the number of days between a recording in HES and a recording in CPRD. The level of diagnostic accuracy was assessed across the two datasets by estimating the proportion of matched strokes that were assigned the same stroke subtype.

Finally, a separate analysis was conducted in which various strategies and definitions were used to identify fatal strokes in CPRD and HES data . The ONS data were then used as a gold standard to ascertain the positive predictive value of these strategies for defining a fatal stroke.

Results

Within the study period, a total of 72 298 adults experienced at least one stroke event that was recorded in ≥1 of the three databases: CPRD (n = 54 929 events), HES (n = 51 013 events), or ONS (n = 17 977 deaths) (Figure 1). In both CPRD and HES data, approximately one-fifth of all strokes were coded as haemorrhagic (16.7% in CPRD and 22.0% in HES). In contrast, 59.9% of strokes in HES were recorded as ischaemic, while only 31.0% of CPRD strokes were coded as such (Table 1). Atrial fibrillation was the most common risk factor in those who suffered a fatal stroke, while diabetes and hypertension exhibited similar prevalence across all three datasets (Table 1).

Figure 1. Identification of eligible stroke events in each data source. In CPRD, individual stroke records were combined into a single record, which represented a single, discrete event using the 90-day rule (see Supplementary Appendix S2). Similar criteria were used to identify separate stroke events in the same patient in HES data. Moreover, events were restricted to those that occurred in the same (that is, concurrent) periods of follow-up across the three linked data sets within the study period, 1 January 2004 to 31 December 2016. CPRD = Clinical Practice Research Datalink. HES = Hospital Episode Statistics. ONS = Office for National Statistics (mortality data).

View this table:

Table 1. Number of recorded strokes and risk factor prevalence in the three cohorts identified in CPRD, HES, and ONS data (N = 72 298)

In CPRD data, just 10 individual codes accounted for 82.6% of all recorded strokes (see Supplementary Appendix S3: Table S3.1). Two non-specific codes ('cerebrovascular accident unspecified' and 'stroke and cerebrovascular attack unspecified') comprised almost 50% of all coded events. In HES data, over 90% of strokes identified were described by a set of only 10 ICD-10 codes (see Supplementary Appendix S3: Table S3.2).

Agreement between CPRD and HES data

Of 75 674 stroke events identified in either CPRD or HES data, 54 929 (72.6%) were recorded in CPRD and 51 013 (67.4%) were recorded in HES (Figure 2, Table 1). Two-fifths (n = 30 268) of coded strokes were 'matched strokes', that is were present in both datasets (Figure 2). Of all HES strokes, 59.3% were found in CPRD data. Of all CPRD strokes, 55.1% were found in HES data.

Figure 2. Number and percentage of all strokes (fatal and non-fatal) recorded in primary care (CPRD) and in hospital (HES) data sources (total number of recorded stroke events = 75 674). These data are based on a 120-day recording window, such that 30 268 HES-recorded stroke events had a 'matching' record in CPRD that was dated within 120 days of the date of hospital admission for stroke, either 30 days before or up to 90 days after. CPRD = Clinical Practice Research Datalink. HES = Hospital Episode Statistics.

When the analysis was restricted to non-fatal strokes, the proportion of events reported in both datasets increased slightly to 43.3%. The proportion of hospitalisations for non-fatal strokes that were reflected in the primary care record increased to 66.4% (see Supplementary Appendix S3: Figure S3.1). However, no improvement in concordance was observed when the analysis was limited to first strokes; the proportion of 'matched' events remained at around 39.7% (see Supplementary Appendix S3: Figure S3.2).

Agreement in subtyping for matched strokes

Nearly three-quarters of 'matched strokes' coded as haemorrhagic in HES were also coded as haemorrhagic in CPRD (Table 2a). Likewise, 74.1% of strokes identified as haemorrhagic in CPRD were coded as such in HES data (Table 2b). In contrast, only 43.5% of ischaemic strokes in HES data were also coded as ischaemic in CPRD data. Strokes coded as ischaemic in CPRD data were confirmed as such in 85.7% of cases in HES data. A large proportion (71.0%) of strokes coded with non-specific codes in CPRD were coded as ischaemic in HES data (Table 2b).

View this table:

Table 2. Degree of concordance in the recording of strokes by subtype across primary and secondary care data sources

Timeliness of matched strokes

Of the 30 268 CPRD–HES 'matched strokes', 48.2% (n = 14 587) had concordant event dates. This percentage increased to 56.2% (n = 17 006 strokes) when the criterion for an 'exact match' was extended to 1 day either side of a HES stroke (see Supplementary Appendix S3: Table S3.3) and increased to over 90% within 60 days of the date of hospital admission (see Supplementary Appendix S3: Figure S3.3).

Identification of fatal strokes

A total of 17 977 individuals were identified in ONS data as having died as a result of a stroke during the study period. A quarter (23.9%) of those deaths were attributed to haemorrhagic stroke, and 12.5% to ischaemic stroke. Over half were coded using ICD-10 code I64 (stroke not otherwise specified) (Table 3).

View this table:

Table 3 . Number of study-eligible fatal strokes recorded in ONS, CPRD, and HES data sources

A total of 5849 of 54 929 CPRD-recorded strokes were categorised as fatal strokes (defined as death within 30 days after stroke), giving a stroke mortality of 10.6% in CPRD (Table 3). Extending the definition of fatal stroke to include deaths within a year increased the stroke mortality to 19.4%. In HES data, 11 236 of 51 013 strokes were categorised as being fatal, giving a stroke mortality of 22.0%.

The strategies for identifying fatal strokes in CPRD and HES data captured relatively few ONS-recorded events, 3968 (22.1%) and 8314 (46.2%), respectively (Figure 3). However, almost 70% of CPRD-identified fatal strokes were confirmed as such in ONS data (see Supplementary Appendix S3: Table S3.4). For HES-identified stroke deaths, the positive predictive value was better still, at around 76% (see Supplementary Appendix S3: Table S3.5).

Figure 3. Number of fatal strokes recorded in primary care (CPRD: 30-day definition), in hospital care (HES: discharge status), and ONS (underlying cause of death). Total CPRD n = 54 929; total HES n = 51 013. CPRD = Clinical Practice Research Datalink. HES = Hospital Episode Statistics. ONS = Office for National Statistics.

Discussion

Summary

Overall, this study found a disappointingly low level of concordance in the recording of stroke events between primary care and hospital statistics. Only 40% of all identified strokes (n = 75 674) were captured in a timely fashion in both datasets (that is, within a timeframe of 120 days).

Strengths and limitations

To the authors’ knowledge, this is the first UK study to cross-reference the Read-coding of stroke in UK primary care data against other data sources, at least since the introduction of QOF. It is also the first attempt to examine the reliability of strategies to identify fatal strokes in primary care and hospital data by using linked ONS data as a gold standard.

A previous study examining the concordance of MI recording in the same data sources had the advantage of being able to draw on an additional linked dataset, the national MI register (the Myocardial Ischaemia National Audit Project [MINAP]).^12,19 Linkage to the Sentinel Stroke National Audit Programme, the national stroke registry, which contains data on around 90% of all stroke hospitalisations in England and Wales,²⁰ would likewise have added to the scope of this study, in particular in terms of confirming not only the type of stroke suffered, but also the date on which it occurred.

Other study limitations stem from the inherent nature of stroke itself. Relative to other acute cardiovascular diseases, a diagnosis of stroke is more uncertain. Moreover, the experience of stroke can vary from an acute event of a few days duration to a protracted illness, with multiple sequelae and permanent disability. These factors necessitated making certain assumptions and compromises when defining appropriate time scales for distinguishing multiple events in the same patient and the CPRD–HES recording window. The choice of +90 days for both was based on clinical experience, but it is acknowledged that this may have compromised the ability to count strokes that occur in rapid succession.

Comparison with existing literature

The findings are not dissimilar to those of a parallel study conducted for MI. The earlier MI study also identified a higher number of recorded events in CRPD (relative to HES), but the proportion of 'matched' events was higher, at around 60%.¹²

There are likely multiple reasons for the observed poor concordance in stroke recording between primary and secondary care. Strokes that occur in the community (for example, in nursing homes and never get coded in hospital data) may account for some of the discrepancy. In light of evidence that as many as 10%–16% of strokes occur in the community,^18,21 it is certainly plausible that some strokes, in particular milder strokes and transient ischaemic attacks that are treated in the community and/or in hospital outpatient clinics, might only ever be recorded in CPRD. The possible contribution of fatal community strokes to the discordance is less certain. This is because while the primary care record will almost invariably reflect the fact that a person has died, it is less likely to document the cause of death with the result that the stroke is neither documented in CPRD nor in HES. Indeed, the general lack of coding for cause of death in CPRD, even though death occurred in hospital, may help account for the 41% of HES-recorded strokes that did not materialise in CPRD (Figure 2). Some credence to this hypothesis is provided by the results of the sensitivity analysis. Restricting the analysis to non-fatal strokes produced a small improvement in concordance, implying those who survived to 30 days post-stroke were more likely to have a corresponding CPRD record than those who did not.

While it is highly likely that a proportion of the CPRD-recorded strokes represent prevalent events (that is, are repeat codings of an earlier, as opposed to a new, event), the extent to which prevalent coding is contributing to the poor CPRD–HES overlap is also uncertain. If prevalent coding was a significant factor, the authors would have expected to see an improvement in the level of concordance when the analysis was restricted to first strokes. However, this was not the case.

Other possible reasons for the discrepancy in stroke recording between CPRD and HES relate to GP-coding practices, which are known to vary between practices.²² These include a failure to code the reason for a recent hospitalisation and increasing use of monitoring codes for follow-up consultations for stroke in primary care over time.²³

This study also provided insight into the diagnostic accuracy of stroke recording in UK health databases. Of all strokes that occur in the UK, approximately 85% are ischaemic and 15% are haemorrhagic.¹ In this analysis, 17% of CPRD strokes and 22% of HES strokes were coded as haemorrhagic, indicating some slight overrepresentation in both datasets of the latter. Conversely, ischaemic strokes were underrepresented in CPRD data, with a prevalence of just 31%. This is likely owing to widespread use of non-specific codes in primary care; 53% of matched strokes coded as ischaemic in HES data were described using non-specific codes in CPRD data. However, of the CPRD strokes that were assigned a subtype, the subtyping was mostly accurate when compared with HES strokes. For example, 80% of ischaemic strokes in CPRD were also classified as ischaemic in HES.

Estimated case-fatality rates (10.6% in CPRD data) are broadly consistent with the one-in-eight 30-day fatality estimate derived from Sentinel Stroke National Audit Programme data.²⁰ The higher fatality rate in HES data in the present study (22.7%) likely reflects differences in the definition of a fatal stroke, coupled with a bias towards more severe cases of stroke in the hospital setting.

Implications for practice

This study has raised some concerns regarding the use of simple algorithms comprised of diagnostic clinical codes for the identification of stroke in primary care records. Only 60% of all hospitalisations for stroke were reflected in the primary care record in a timely fashion (within 90 days post-stroke). Diagnostic accuracy in CPRD is also questionable, given that a high proportion of ischaemic strokes are recorded using non-specific codes. However, when a stroke is subtyped as being ischaemic or haemorrhagic in origin, the subtyping is accurate in approximately 75% of cases (relative to HES recording). Thus, while reliance on primary care data alone may be adequate for the purposes of identifying people who have had a stroke, use of HES-linked data provides greater completeness, and better information on the timing and type of stroke experienced.

Notes

Funding

This work was undertaken as part of a British Lung Foundation funded PhD studentship (Ann Dorothy Morgan: grant number RG14/07). Sarah-Jo Sinnott, Liam Smeeth, Caroline Minassian, and Jennifer Quint have not declared specific grant funding for this piece of research.

Ethical approval

This study was approved by the Medicines and Healthcare products Regulatory Agency’s Independent Scientific Advisory Committee (ISAC): protocol number 16_095R. This study is based in part on data from the Clinical Practice Research Datalink (CPRD) obtained under licence from the UK Medicines and Healthcare products Regulatory Agency. The data is provided by patients and collected by the NHS as part of their care and support. The interpretation and conclusions contained in this study are those of the author/s alone. Linked pseudonymised data was provided for this study by CPRD. Data is linked by NHS Digital, the statutory trusted third party for linking data, using identifiable data held only by NHS Digital. Select general practices consent to this process at a practice level with individual patients having the right to opt-out.

Provenance

Freely submitted; externally peer reviewed.

Acknowledgements

Competing interests

The authors declare that no competing interests exist.

Author contributions

All authors were involved in the study design and reviewed draft versions of the manuscript. AM, SJS and CM performed the data analyses; AM and SJS contributed equally to the writing of the manuscript.

Received July 6, 2020.
Accepted August 27, 2020.

This article is Open Access: CC BY license (https://creativecommons.org/licenses/by/4.0/)

References

1.↵
1. Stroke Association UK
(2018) Stroke statistics. https://www.stroke.org.uk/resources/state-nation-stroke-statistics. 3 Feb 2021.
2.↵
1. Adamson J,
2. Beswick A,
3. Ebrahim S
(2004) Is stroke the most common cause of disability? J Stroke Cerebrovasc Dis 13(4):171–177, doi:10.1016/j.jstrokecerebrovasdis.2004.06.003, pmid:17903971.
OpenUrl CrossRef PubMed
3.↵
1. Patel A,
2. Berdunov V,
3. King D
(2017) Executive summary part 2: burden of stroke in the next 20 years and potential returns from increased spending on research (Stroke Association, London).
4.↵
1. Lee S,
2. Shafe ACE,
3. Cowie MR
(2011) UK stroke incidence, mortality and cardiovascular risk management 1999–2008: time-trend analysis from the general practice research database. BMJ Open 1(2), doi:10.1136/bmjopen-2011-000269, pmid:22021893. e000269.
OpenUrl Abstract/FREE Full Text
5.↵
1. Scowcroft ACE,
2. Cowie MR
(2014) Atrial fibrillation: improvement in identification and stroke preventive therapy — data from the UK Clinical Practice Research Datalink, 2000–2012. Int J Cardiol 171(2):169–173, doi:10.1016/j.ijcard.2013.11.086, pmid:24387894.
OpenUrl CrossRef PubMed
6.↵
1. Sherman RE,
2. Anderson SA,
3. Dal Pan GJ,
4. et al.
(2016) Real-world evidence — what is it and what can it tell us? N Engl J Med 375(23):2293–2297, doi:10.1056/NEJMsb1609216, pmid:27959688.
OpenUrl CrossRef PubMed
7.↵
1. Corrigan-Curay J,
2. Sacks L,
3. Woodcock J
(2018) Real-world evidence and real-world data for evaluating drug safety and effectiveness. JAMA 320(9):867–868, doi:10.1001/jama.2018.10136, pmid:30105359.
OpenUrl CrossRef PubMed
8.↵
1. Pylypchuk R,
2. Wells S,
3. Kerr A,
4. et al.
(2018) Cardiovascular disease risk prediction equations in 400 000 primary care patients in New Zealand: a derivation and validation study. Lancet 391(10133):1897–1907, doi:10.1016/S0140-6736(18)30664-0, pmid:29735391.
OpenUrl CrossRef PubMed
9.↵
1. van Staa T-P,
2. Gulliford M,
3. Ng ES-W,
4. et al.
(2014) Prediction of cardiovascular risk using Framingham, ASSIGN and QRISK2: how well do they predict individual rather than population risk? PLoS One 9(10), doi:10.1371/journal.pone.0106455, pmid:25271417. e106455.
OpenUrl CrossRef PubMed
10.↵
1. Staa T-Pvan,
2. Goldacre B,
3. Gulliford M,
4. et al.
(2012) Pragmatic randomised trials using routine electronic health records: putting them to the test. BMJ 344, doi:10.1136/bmj.e55, pmid:22315246. e55.
OpenUrl FREE Full Text
11.↵
1. Baker R,
2. Tata LJ,
3. Kendrick D,
4. Orton E
(2016) Identification of incident poisoning, fracture and burn events using linked primary care, secondary care and mortality data from England: implications for research and surveillance. Inj Prev 22(1):59–67, doi:10.1136/injuryprev-2015-041561, pmid:26136460.
OpenUrl Abstract/FREE Full Text
12.↵
1. Herrett E,
2. Shah AD,
3. Boggon R,
4. et al.
(2013) Completeness and diagnostic validity of recording acute myocardial infarction events in primary care, hospital care, disease registry, and national mortality records: cohort study. BMJ 346, doi:10.1136/bmj.f2350, pmid:23692896. f2350.
OpenUrl Abstract/FREE Full Text
13.↵
1. Ruigómez A,
2. Martín-Merino E,
3. Rodríguez LAG
(2010) Validation of ischemic cerebrovascular diagnoses in the health improvement network (THIN). Pharmacoepidemiol Drug Saf 19(6):579–585, doi:10.1002/pds.1919, pmid:20131328.
OpenUrl CrossRef PubMed
14.↵
1. Gaist D,
2. Wallander M-A,
3. González-Pérez A,
4. García-Rodríguez LA
(2013) Incidence of hemorrhagic stroke in the general population: validation of data from the Health Improvement Network. Pharmacoepidemiol Drug Saf 22(2):176–182, doi:10.1002/pds.3391, pmid:23229888.
OpenUrl CrossRef PubMed
15.↵
1. Gillam SJ,
2. Siriwardena AN,
3. Steel N
(2012) Pay-for-performance in the United Kingdom: impact of the quality and outcomes framework: a systematic review. Ann Fam Med 10(5):461–468, doi:10.1370/afm.1377, pmid:22966110.
OpenUrl Abstract/FREE Full Text
16.↵
1. Herrett E,
2. Gallagher AM,
3. Bhaskaran K,
4. et al.
(2015) Data resource profile: Clinical Practice Research Datalink (CPRD). Int J Epidemiol 44(3):827–836, doi:10.1093/ije/dyv098, pmid:26050254.
OpenUrl CrossRef PubMed
17.↵
1. Herrett E,
2. Thomas SL,
3. Schoonen WM,
4. et al.
(2010) Validation and validity of diagnoses in the general practice research database: a systematic review. Br J Clin Pharmacol 69(1):4–14, doi:10.1111/j.1365-2125.2009.03537.x, pmid:20078607.
OpenUrl CrossRef PubMed
18.↵
1. Kelly PJ,
2. Crispino G,
3. Sheehan O,
4. et al.
(2012) Incidence, event rates, and early outcome of stroke in Dublin, Ireland: the North Dublin population stroke study. Stroke 43(8):2042–2047, doi:10.1161/STROKEAHA.111.645721, pmid:22693134.
OpenUrl Abstract/FREE Full Text
19.↵
1. Denaxas SC,
2. George J,
3. Herrett E,
4. et al.
(2012) Data resource profile: cardiovascular disease research using linked bespoke studies and electronic health records (CALIBER). Int J Epidemiol 41(6):1625–1638, doi:10.1093/ije/dys188, pmid:23220717.
OpenUrl CrossRef PubMed
20.↵
1. Bray BD,
2. Cloud GC,
3. James MA,
4. et al.
(2016) Weekly variation in health-care quality by day and time of admission: a nationwide, registry-based, prospective cohort study of acute stroke care. Lancet 388(10040):170–177, doi:10.1016/S0140-6736(16)30443-3, pmid:27178477.
OpenUrl CrossRef PubMed
21.↵
1. Asplund K,
2. Bonita R,
3. Kuulasmaa K,
4. et al.
(1995) Multinational comparisons of stroke epidemiology. Evaluation of case ascertainment in the WHO MONICA stroke study. World Health Organization monitoring trends and determinants in cardiovascular disease. Stroke 26(3):355–360, doi:10.1161/01.str.26.3.355, pmid:7886706.
OpenUrl Abstract/FREE Full Text
22.↵
1. Mant J,
2. McManus RJ,
3. Hare R,
4. Mayer P
(2003) Identification of stroke in the community: a comparison of three methods. Br J Gen Pract 53(492):520–524, pmid:14694663.
OpenUrl Abstract/FREE Full Text
23.↵
1. Gulliford MC,
2. Charlton J,
3. Ashworth M,
4. et al.
(2009) Selection of medical diagnostic codes for analysis of electronic patient records. Application to stroke in a primary care database. PLoS One 4(9), doi:10.1371/journal.pone.0007168, pmid:19777060. e7168.
OpenUrl CrossRef PubMed

In this issue

Download PDF

Download PowerPoint

Email Article

Citation Tools

Keywords

More in this TOC Section

Show more Research

Cited By...

Intended for Healthcare Professionals

Tweets by @BJGPOpen

[1] 1.↵
Stroke Association UK
(2018) Stroke statistics. https://www.stroke.org.uk/resources/state-nation-stroke-statistics. 3 Feb 2021.

[2] Stroke Association UK

[3] 2.↵
Adamson J,
Beswick A,
Ebrahim S
(2004) Is stroke the most common cause of disability? J Stroke Cerebrovasc Dis 13(4):171–177, doi:10.1016/j.jstrokecerebrovasdis.2004.06.003, pmid:17903971.
OpenUrl CrossRef PubMed

[4] Adamson J,

[5] Beswick A,

[6] Ebrahim S

[7] 3.↵
Patel A,
Berdunov V,
King D
(2017) Executive summary part 2: burden of stroke in the next 20 years and potential returns from increased spending on research (Stroke Association, London).

[8] Patel A,

[9] Berdunov V,

[10] King D

[11] 4.↵
Lee S,
Shafe ACE,
Cowie MR
(2011) UK stroke incidence, mortality and cardiovascular risk management 1999–2008: time-trend analysis from the general practice research database. BMJ Open 1(2), doi:10.1136/bmjopen-2011-000269, pmid:22021893. e000269.
OpenUrl Abstract/FREE Full Text

[12] Lee S,

[13] Shafe ACE,

[14] Cowie MR

[15] 5.↵
Scowcroft ACE,
Cowie MR
(2014) Atrial fibrillation: improvement in identification and stroke preventive therapy — data from the UK Clinical Practice Research Datalink, 2000–2012. Int J Cardiol 171(2):169–173, doi:10.1016/j.ijcard.2013.11.086, pmid:24387894.
OpenUrl CrossRef PubMed

[16] Scowcroft ACE,

[17] Cowie MR

[18] 6.↵
Sherman RE,
Anderson SA,
Dal Pan GJ,
et al.
(2016) Real-world evidence — what is it and what can it tell us? N Engl J Med 375(23):2293–2297, doi:10.1056/NEJMsb1609216, pmid:27959688.
OpenUrl CrossRef PubMed

[19] Sherman RE,

[20] Anderson SA,

[21] Dal Pan GJ,

[22] et al.

[23] 7.↵
Corrigan-Curay J,
Sacks L,
Woodcock J
(2018) Real-world evidence and real-world data for evaluating drug safety and effectiveness. JAMA 320(9):867–868, doi:10.1001/jama.2018.10136, pmid:30105359.
OpenUrl CrossRef PubMed

[24] Corrigan-Curay J,

[25] Sacks L,

[26] Woodcock J

[27] 8.↵
Pylypchuk R,
Wells S,
Kerr A,
et al.
(2018) Cardiovascular disease risk prediction equations in 400 000 primary care patients in New Zealand: a derivation and validation study. Lancet 391(10133):1897–1907, doi:10.1016/S0140-6736(18)30664-0, pmid:29735391.
OpenUrl CrossRef PubMed

[28] Pylypchuk R,

[29] Wells S,

[30] Kerr A,

[31] et al.

[32] 9.↵
van Staa T-P,
Gulliford M,
Ng ES-W,
et al.
(2014) Prediction of cardiovascular risk using Framingham, ASSIGN and QRISK2: how well do they predict individual rather than population risk? PLoS One 9(10), doi:10.1371/journal.pone.0106455, pmid:25271417. e106455.
OpenUrl CrossRef PubMed

[33] van Staa T-P,

[34] Gulliford M,

[35] Ng ES-W,

[36] et al.

[37] 10.↵
Staa T-Pvan,
Goldacre B,
Gulliford M,
et al.
(2012) Pragmatic randomised trials using routine electronic health records: putting them to the test. BMJ 344, doi:10.1136/bmj.e55, pmid:22315246. e55.
OpenUrl FREE Full Text

[38] Staa T-Pvan,

[39] Goldacre B,

[40] Gulliford M,

[41] et al.

[42] 11.↵
Baker R,
Tata LJ,
Kendrick D,
Orton E
(2016) Identification of incident poisoning, fracture and burn events using linked primary care, secondary care and mortality data from England: implications for research and surveillance. Inj Prev 22(1):59–67, doi:10.1136/injuryprev-2015-041561, pmid:26136460.
OpenUrl Abstract/FREE Full Text

[43] Baker R,

[44] Tata LJ,

[45] Kendrick D,

[46] Orton E

[47] 12.↵
Herrett E,
Shah AD,
Boggon R,
et al.
(2013) Completeness and diagnostic validity of recording acute myocardial infarction events in primary care, hospital care, disease registry, and national mortality records: cohort study. BMJ 346, doi:10.1136/bmj.f2350, pmid:23692896. f2350.
OpenUrl Abstract/FREE Full Text

[48] Herrett E,

[49] Shah AD,

[50] Boggon R,

[51] et al.

[52] 13.↵
Ruigómez A,
Martín-Merino E,
Rodríguez LAG
(2010) Validation of ischemic cerebrovascular diagnoses in the health improvement network (THIN). Pharmacoepidemiol Drug Saf 19(6):579–585, doi:10.1002/pds.1919, pmid:20131328.
OpenUrl CrossRef PubMed

[53] Ruigómez A,

[54] Martín-Merino E,

[55] Rodríguez LAG

[56] 14.↵
Gaist D,
Wallander M-A,
González-Pérez A,
García-Rodríguez LA
(2013) Incidence of hemorrhagic stroke in the general population: validation of data from the Health Improvement Network. Pharmacoepidemiol Drug Saf 22(2):176–182, doi:10.1002/pds.3391, pmid:23229888.
OpenUrl CrossRef PubMed

[57] Gaist D,

[58] Wallander M-A,

[59] González-Pérez A,

[60] García-Rodríguez LA

[61] 15.↵
Gillam SJ,
Siriwardena AN,
Steel N
(2012) Pay-for-performance in the United Kingdom: impact of the quality and outcomes framework: a systematic review. Ann Fam Med 10(5):461–468, doi:10.1370/afm.1377, pmid:22966110.
OpenUrl Abstract/FREE Full Text

[62] Gillam SJ,

[63] Siriwardena AN,

[64] Steel N

[65] 16.↵
Herrett E,
Gallagher AM,
Bhaskaran K,
et al.
(2015) Data resource profile: Clinical Practice Research Datalink (CPRD). Int J Epidemiol 44(3):827–836, doi:10.1093/ije/dyv098, pmid:26050254.
OpenUrl CrossRef PubMed

[66] Herrett E,

[67] Gallagher AM,

[68] Bhaskaran K,

[69] et al.

[70] 17.↵
Herrett E,
Thomas SL,
Schoonen WM,
et al.
(2010) Validation and validity of diagnoses in the general practice research database: a systematic review. Br J Clin Pharmacol 69(1):4–14, doi:10.1111/j.1365-2125.2009.03537.x, pmid:20078607.
OpenUrl CrossRef PubMed

[71] Herrett E,

[72] Thomas SL,

[73] Schoonen WM,

[74] et al.

[75] 18.↵
Kelly PJ,
Crispino G,
Sheehan O,
et al.
(2012) Incidence, event rates, and early outcome of stroke in Dublin, Ireland: the North Dublin population stroke study. Stroke 43(8):2042–2047, doi:10.1161/STROKEAHA.111.645721, pmid:22693134.
OpenUrl Abstract/FREE Full Text

[76] Kelly PJ,

[77] Crispino G,

[78] Sheehan O,

[79] et al.

[80] 19.↵
Denaxas SC,
George J,
Herrett E,
et al.
(2012) Data resource profile: cardiovascular disease research using linked bespoke studies and electronic health records (CALIBER). Int J Epidemiol 41(6):1625–1638, doi:10.1093/ije/dys188, pmid:23220717.
OpenUrl CrossRef PubMed

[81] Denaxas SC,

[82] George J,

[83] Herrett E,

[84] et al.

[85] 20.↵
Bray BD,
Cloud GC,
James MA,
et al.
(2016) Weekly variation in health-care quality by day and time of admission: a nationwide, registry-based, prospective cohort study of acute stroke care. Lancet 388(10040):170–177, doi:10.1016/S0140-6736(16)30443-3, pmid:27178477.
OpenUrl CrossRef PubMed

[86] Bray BD,

[87] Cloud GC,

[88] James MA,

[89] et al.

[90] 21.↵
Asplund K,
Bonita R,
Kuulasmaa K,
et al.
(1995) Multinational comparisons of stroke epidemiology. Evaluation of case ascertainment in the WHO MONICA stroke study. World Health Organization monitoring trends and determinants in cardiovascular disease. Stroke 26(3):355–360, doi:10.1161/01.str.26.3.355, pmid:7886706.
OpenUrl Abstract/FREE Full Text

[91] Asplund K,

[92] Bonita R,

[93] Kuulasmaa K,

[94] et al.

[95] 22.↵
Mant J,
McManus RJ,
Hare R,
Mayer P
(2003) Identification of stroke in the community: a comparison of three methods. Br J Gen Pract 53(492):520–524, pmid:14694663.
OpenUrl Abstract/FREE Full Text

[96] Mant J,

[97] McManus RJ,

[98] Hare R,

[99] Mayer P

[100] 23.↵
Gulliford MC,
Charlton J,
Ashworth M,
et al.
(2009) Selection of medical diagnostic codes for analysis of electronic patient records. Application to stroke in a primary care database. PLoS One 4(9), doi:10.1371/journal.pone.0007168, pmid:19777060. e7168.
OpenUrl CrossRef PubMed

[101] Gulliford MC,

[102] Charlton J,

[103] Ashworth M,

[104] et al.

Main menu

User menu

Search

Concordance in the recording of stroke across UK primary and secondary care datasets: a population-based cohort study

Abstract

How this fits in

Introduction

Method

Data sources

Study design and population

Identification of stroke events

Analysis

Results

Agreement between CPRD and HES data

Agreement in subtyping for matched strokes

Timeliness of matched strokes

Identification of fatal strokes

Discussion

Summary

Strengths and limitations

Comparison with existing literature

Implications for practice

Notes

Funding

Ethical approval

Provenance

Acknowledgements

Competing interests

Author contributions

References

In this issue

Citation Manager Formats

Jump to section

Keywords

More in this TOC Section

Related Articles

Cited By...

NAVIGATE

RCGP

MY ACCOUNT

NEWS AND UPDATES

AUTHORS & REVIEWERS

CUSTOMER SERVICES

CONTRIBUTE

CONTACT US