Development, validation and comparison of multivariable risk scores for prediction of total stroke and stroke types in Chinese adults: a prospective study of 0.5 million adults

Matthew Chun; Robert Clarke; Tingting Zhu; David Clifton; Derrick A Bennett; Yiping Chen; Yu Guo; Pei Pei; Jun Lv; Canqing Yu; Ling Yang; Liming Li; Zhengming Chen; Benjamin J Cairns

doi:10.1136/svn-2021-001251

Article Text

Original research

Development, validation and comparison of multivariable risk scores for prediction of total stroke and stroke types in Chinese adults: a prospective study of 0.5 million adults

http://orcid.org/0000-0002-8819-2072Matthew Chun1,2,
http://orcid.org/0000-0002-9802-8241Robert Clarke1,
Tingting Zhu2,
David Clifton2,3,
Derrick A Bennett1,
Yiping Chen1,4,
Yu Guo5,
Pei Pei6,
Jun Lv7,8,
Canqing Yu7,8,
Ling Yang1,
Liming Li7,8,
Zhengming Chen4,
Benjamin J Cairns1
On behalf of the China Kadoorie Biobank Collaborative Group

¹ Clinical Trial Service Unit and Epidemiological Studies, Nuffield Department of Population Health, University of Oxford, Oxford, UK
² Department of Engineering Science, University of Oxford, Oxford, UK
³ Department of Biomedical Engineering, Oxford-Suzhou Centre for Advanced Research, Suzhou, China
⁴ Medical Research Council Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
⁵ CKB Project Department, Fuwai Hospital Chinese Academy of Medical Sciences, National Center for Cardiovascular Diseases, Beijing, China
⁶ CKB Project Department, Chinese Academy of Medical Sciences, Beijing, China
⁷ Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
⁸ Department of Epidemiology, Peking University Center for Public Health and Epidemic Preparedness and Response, Beijing, China

Correspondence to Professor Robert Clarke; robert.clarke{at}ndph.ox.ac.uk

Abstract

Background and purpose Low-income and middle-income countries have the greatest stroke burden, yet remain understudied. This study compared the utility of Framingham versus novel risk scores for prediction of total stroke and stroke types in Chinese adults.

Methods China Kadoorie Biobank (CKB) is a prospective study of 512 726 adults, aged 30–79 years, recruited from 10 areas in China in 2004–2008. By 1 January 2018, 43 234 incident first stroke cases (36 310 ischaemic stroke (IS); 8865 haemorrhagic stroke (HS)) were recorded in 503 842 participants with no history of stroke at baseline. We compared the predictive utility of the Framingham Stroke Risk Profile (FSRP) with novel CKB stroke risk scores and included recalibration, refitting, stratifying by study area and addition of other risk factors. Discrimination was assessed using area under the receiver operating characteristic curve (AUC) and calibration was assessed using Greenwood-Nam-D’Agostino χ² statistics.

Results Incidence of total stroke varied fivefold by area in China. The FSRP had good discrimination for total stroke (AUC (95% CI); men: 0.78 (0.77 to 0.79), women: 0.77 (95% CI 0.76 to 0.78)), but poor calibration (χ²; men: 1,825, women: 3,053), substantially underestimating absolute risks. Recalibration reduced χ² by >80%, but did not improve discrimination. Refitting the FSRP did not materially improve discrimination, but further improved calibration. Stratification by area improved discrimination (AUC; men: 0.82 (0.82 to 0.83); women: 0.82 (0.82 to 0.83)), but not calibration. Adding other risk factors yielded modest, but statistically significant, improvements in the AUCs. The findings for IS and HS were similar to those for total stroke.

Conclusions The FSRP reliably differentiated Chinese adults with incident stroke, but substantially underestimated the absolute risks of stroke. Novel local risk prediction equations that took account of differences in stroke incidence within China enhanced risk prediction of total stroke and major stroke pathological types.

stroke
risk factors
prospective studies
standard of care

Data availability statement

Data are available on reasonable request. Access details to a stroke risk calculator are provided in a workbook in the online supplemental materials to enable researchers to calculate risk scores for stroke using their own data. Researchers who are interested in obtaining the raw data from the China Kadoorie Biobank study that underlines this paper should contact ckbaccess@ndph.ox.ac.uk. A research proposal will be requested to ensure that any analysis is performed by bona fide researchers and - where data is not currently available to open access researchers - is restricted to the topic covered in this paper.

https://creativecommons.org/licenses/by/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.

https://doi.org/10.1136/svn-2021-001251

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Introduction

Stroke is a leading cause of death and disability worldwide, and about three-quarters of all stroke cases now occur in low-income and middle-income countries (LMICs), including China.1 Stroke accounted for 34 million prevalent cases and 2 million deaths in China in 2017.2 Cost-effective primary prevention of stroke requires both population-based lifestyle strategies (eg, salt reduction) and blood pressure-lowering and lipid-lowering medication in high-risk individuals.3 Risk prediction equations are required to identify those who would derive maximum benefit from such preventive treatments.4 5

The Framingham Stroke Risk Profile (FSRP), derived from a multigeneration prospective cohort study in Framingham, Massachusetts, USA, is a widely used risk score for prediction of stroke.5–8 It provides sex-specific predictions of the absolute risks of total stroke within a specified interval (typically in the next 10 years), based on age, current smoking, history of coronary heart disease (CHD), atrial fibrillation (AF), diabetes, systolic blood pressure and use of antihypertensive treatment.6–8 Recently updated in 2017, the FSRP has been validated in many high-income countries to predict risk of total stroke,8 9 but its clinical utility in LMICs, such as China, is uncertain.

The incidence rates of total stroke are higher in China than in Western populations, as are the proportions with haemorrhagic stroke (HS).10 Within China there are well-documented large, although unexplained differences in the incidence of stroke between geographical areas.11 Previous studies that estimated absolute risk of total stroke in Chinese populations were constrained by insufficient numbers of stroke cases, involvement of single rather than multiple areas, lack of reliable information on stroke types (eg, ischaemic stroke (IS) vs HS) and lack of contemporary evidence.12–14 Consequently, there is a need for more reliable prediction of absolute risks of total stroke and stroke types in Chinese individuals to guide targeted use of evidence-based cost-effective treatments including lipid-lowering and antiplatelet therapy.15

Using data from a large prospective study of 0.5M adults recruited into the China Kadoorie Biobank (CKB) in 2004–2008, we compared the performance of the established FSRP with newly developed and internally validated local risk equations to predict the absolute risks of total stroke and stroke types in Chinese adults. The aims of the present report were to develop and validate multivariable risk scores for prediction of total stroke, IS and HS in men and women living in China, and to compare the predictive value of (1) the 2017 FSRP; (2) a recalibrated FSRP; (3) a local recalibrated and refitted FSRP; (4) a recalibrated and refitted FSRP after stratifying by geographical area and (5) area-stratified, recalibrated and refitted models with additional risk factors. A risk calculator for total stroke and stroke pathological types is provided to enable other investigators to validate these stroke risk scores in independent populations.

Methods

Study population

The data included in the present analyses are available from the corresponding author on reasonable request. Details of the design and methods used in the CKB have been previously reported.16 17 Briefly, the CKB is a prospective cohort study of 512 726 participants, aged 30–79 years, enrolled from 10 geographically diverse areas (5 urban, 5 rural) of China in 2004–2008. An interviewer-administered electronic questionnaire was used to collect data on sociodemographic factors, lifestyle factors (eg, smoking, alcohol, diet), medical history and current medication and physical activity. Physical measurements included height, weight, hip and waist circumference, bioimpedance, blood pressure and heart rate. All participants provided a blood sample, and random plasma glucose levels were estimated to screen for diabetes. All participants provided written informed consent.

Follow-up for stroke outcomes

The vital status of participants was monitored through death registries supplemented by annual checks with local residential records and active confirmation by contacting local street committees or village administrators.17 All hospitalised cases of stroke were identified by electronic linkage to established registries of major diseases and health insurance records (covering >97% of participants), supplemented by annual home visits for uninsured participants. All fatal and non-fatal stroke cases were coded by trained medical staff using the International Classification of Diseases 10th revision. The major pathological types of stroke were IS (I63), HS (I60 and I61) and unspecified stroke (I64) (online supplemental eMethods II).18

Supplemental material

[svn-2021-001251supp001.pdf]

Statistical analyses

The present analyses were restricted to individuals with no prior history of stroke or transient ischaemic attack (205 293 men, 298 549 women) at the date of recruitment. The participants were followed up to detect stroke and death until 1 January 2018, and all incident cases of first stroke (19 587 strokes in men; 23 647 strokes in women) that were recorded for up to 9 years after the baseline survey were included.

For consistency with the sex-specific FSRP and current clinical practice, the present analyses were performed separately in men and women, using time-in-study as the time scale of interest. First, CKB individuals were randomly divided into a training set (85%) and test set (15%). The FSRP was then applied in the test set to predict the risk of total stroke for each individual within 9 years of the baseline survey. Since AF was not recorded in CKB, the FSRP predictions were calculated assuming AF was absent at baseline. No major violations of the proportional hazards assumption for the traditional FSRP covariates were identified (online supplemental eMethods III).

A recalibrated model (‘+Recalibration’) was subsequently developed, using the Breslow estimator to derive a baseline survival function that adjusted for the mean values of risk factors in CKB,19 20 while retaining the 2017 FSRP HRs.8 A recalibrated and refitted model (‘+Refitting’) was then constructed using Cox regression to derive new HRs for the FSRP risk factors in CKB. For recalibration and refitting, model parameters were derived from the training set, and all models were evaluated using the test set.

To adjust for differences in baseline hazards across the 10 CKB areas, we next developed a model (‘+Area stratification’) with separate area-specific baselines estimated at the sex-specific mean risk factor values for the overall CKB. In this model, area-stratified Cox regression was used to estimate new HRs for the FSRP risk factors. The model was constructed from the training set and evaluated in the test set.

After estimating separate area-specific baselines, we finally developed an expanded model (‘+Additional risk factors’) using 133 additional risk indicators recorded at baseline in CKB (online supplemental eWorkbook I), including sociodemographic factors, diet, alcohol consumption, personal and family medical history, physical activity, and physical measurements.17 The 133 additional risk factors were selected based on their suspected relationship with stroke, while excluding laboratory-based tests, genetic information, and brain imaging that are not widely available in lower-resource clinical settings in China. A subset of these risk factors was then selected automatically using 10-fold cross-validated, least absolute shrinkage and selection operator (LASSO) regularisation (a technique that penalises the inclusion of additional risk factors to prevent overfitting) within the training set,21 22 and the selected risk factors were used to fit an area-stratified Cox model using the complete training set. Evaluation of the fitted model was performed using the test set.

Supplemental material

[svn-2021-001251supp002.xlsx]

Since the associations of individual risk factors with stroke pathological types differ,15 18 23–25 we also hypothesised that developing separate risk equations for IS and HS could further improve predictive performance compared with a single model for total stroke. Hence, we repeated the analyses separately for IS and for HS pathological types.

To compare predictive performance across models, each model was assessed for discrimination and calibration of 9-year stroke risk predictions using the test set. Risk discrimination refers to the ability to correctly discriminate between individuals with and without stroke, and was evaluated using the area under the receiver operating characteristic curve (AUC). The AUCs for each model were compared with the FSRP using Delong’s test.26 Calibration refers to the similarity between observed and predicted absolute risks and was evaluated using calibration plots. The Greenwood-Nam-D’Agostino χ² test statistic was used to compare the observed incidence (calculated as 1 − Kaplan-Meier survival probability) and predicted risks by deciles of predicted risk (with lower χ² values indicating better model calibration).27 28 The 95% CIs were constructed for AUCs using 1000 bootstrapped samples from the test set. For models with area-specific baselines, AUCs were evaluated for both the overall study population and separately within each CKB area.

Sensitivity analyses included restricting the age range to those ≥55 years (for fair comparison with the Framingham study), adding risk factors to the FSRP prior to stratification by study area (to assess the reordering of incremental modelling improvements), and implementing cumulative incidence functions and Fine-Gray models (to account for the competing risk of death from causes other than stroke).29

LASSO variable selection and Fine-Gray analyses were performed in R V.3.6.1 using the glmnet package V.3.0–2 and riskRegression package version 8 December 2020, respectively.21 30 All other statistical analyses were performed using Python V.3.7.0. Cox proportional hazards models were implemented using the lifelines package version 0.21.1.31 AUC analyses were performed using the scikit-learn toolkit V.0.19.2.32 Additional details of the methods used for the statistical analyses are provided in online supplemental eMethods IV.

Results

Among the 503 842 CKB study participants in the present analyses, the mean (SD) age was 51.9 (10.6) years and 59% were women. During 9 years of follow-up, a total of 43 234 individuals had a first incident stroke irrespective of type (total stroke); 36 310 had a first IS and 8865 had a first HS (table 1). The incidence of first total stroke was higher in men than in women (9.5% vs 7.9%) and varied over fivefold across the 10 study areas. Compared with those who had no stroke, individuals who had a first stroke were older and more likely to have prior history of CHD, diabetes or hypertension. Individuals who had HS were more likely to be current smokers and have higher mean levels of systolic blood pressure than those who had IS. Overall, men and women had similar rates of prior history of CHD (2.5% vs 3.0%), diabetes (5.3% vs 6.0%), and use of blood pressure-lowering medication (9.9% vs 11.4%), but current smoking was much more common in men than in women (67.7% vs 3.2%) (table 1).

View this table:

Table 1

Distribution of established risk factors for total stroke and stroke pathological types in men and women in CKB

Assessment and update of FSRP for prediction of total stroke

The 2017 FSRP yielded moderate discrimination for total stroke in CKB (AUC (95% CI): 0.78 (0.77 to 0.79) in men, 0.77 (0.76 to 0.78) in women) (table 2). However, calibration was very poor, and the 2017 FSRP substantially underestimated the absolute risk of total stroke (χ²: 1825 in men, 3053 in women) (table 2; figure 1).

View this table:

Table 2

Comparison of performance of different models for prediction of total stroke and stroke pathological types in men and women in China Kadoorie Biobank

Figure 1

Calibration of the 2017 FSRP, and recalibrated and refitted models from China Kadoorie Biobank, for total stroke in men and women. The dashed line in each subplot represents the line of equality between observed risk and predicted risk. Models with better calibration have points lying closer to the line of equality. Observed 9-year incidence calculated as 1 − Kaplan-Meier estimate. AUC, area under the curve; FSRP, Framingham Stroke Risk Profile.

Recalibration did not alter the AUCs, but substantially corrected the calibration of the model (χ²: 156 in men, 506 in women). Refitting the HRs for the calibrated equations yielded little material improvement in discrimination (AUC: 0.79 (95% CI 0.78 to 0.80) in men, 0.78 (95% CI 0.77 to 0.78) in women), but further improved calibration (χ²: 51 in men, 148 in women). Refitted HRs and additional details of these models are provided in online supplemental eWorkbook I.

Prediction of total stroke after adjusting for areas in China

Stroke incidence rates varied markedly by geographical region within China and online supplememental eFigure 1 demonstrates the baseline survival curves for total stroke, IS and HS for each of the 10 study regions in CKB. Modelling separate area-specific baseline survival functions for total stroke yielded modest, but statistically significant improvement (p<0.001) in risk discrimination among all study participants (AUC: 0.82 (95% CI 0.82 to 0.83) in men; 0.82 (95% CI 0.82 to 0.83) in women), while maintaining good calibration (χ²: 124 in men; 178 in women) (table 2). The discrimination performance within each of the 10 areas is reported in online supplemental eTable I. HRs for individual risk factors obtained from these models differed from the 2017 FSRP (figure 2) and demonstrated substantially greater consistency between men and women and had much greater precision (as reflected by the narrower CIs since the CKB population was 100-fold larger than the Framingham cohort). A sensitivity analysis including age at which ever-regular smokers started smoking had larger HRs associated with ever-regular smoking in men (1.37) and women (1.17), but showed no material improvement in risk prediction for stroke (online supplemental eFigure II).

Figure 2

Multivariable HR and 95% CI for total stroke in men and women, for Framingham and for China Kadoorie Biobank. *The 2017 Framingham Stroke Risk Profile (FSRP) coefficients were refitted to China Kadoorie Biobank in a model including stratification by geographical area. BP, blood pressure; SBP, systolic BP.

Expanded risk equations with additional risk factors

In addition to controlling for area-specific differences, the addition of other risk indicators recorded in CKB was assessed for risk prediction of total stroke. The expanded models for total stroke, determined using LASSO regularisation for variable selection, included 66 risk factor indicators for men and 70 in women, including measures of diet, personal and family medical history and socioeconomic status (online supplemental eWorkbook I). These models did not yield any further material improvements in either risk discrimination (AUC: 0.83 (95% CI 0.82 to 0.84) in men, 0.83 (95% CI 0.82 to 0.84) in women) or calibration (χ²: 101 in men; 177 in women) (table 2). Discrimination performance within each area is reported in the online supplement (online supplemental eTable II).

Risk equations for different stroke pathological types

Analysis of separate risk equations for IS and HS demonstrated comparable results from recalibration, refitting, accounting for geographical area, and addition of other risk factors. The best-performing IS model yielded AUCs (95% CI) of 0.83 (0.82 to 0.84) in men and 0.83 (0.82 to 0.84) in women with χ² values of 55 and 90, respectively. The best-performing HS model yielded AUCs of 0.82 (95% CI 0.81 to 0.84) in men and 0.82 (95% CI 0.80 to 0.84) in women with χ² values of 14 and 9, respectively (table 2).

The individual risk equations for IS and HS demonstrated substantial differences between the two stroke pathological types. Overall, the absolute risk of IS was 4–5 fold greater than HS, and the ratio of IS to HS risks differed substantially between areas. Modelling area-specific baseline survival curves (ie, predicted survival rates for an individual with mean risk factor values) for IS and HS demonstrated striking differences in stroke risk between areas, consistent with geographical differences in observed stroke incidence during the 9-year follow-up period (figure 3, online supplemental eFigure III). For example, residents in Harbin had threefold higher 9-year incidence of IS compared with those in Hunan (19.68% vs 6.19%), but had half the incidence of HS (1.70% vs 3.19%). Furthermore, by training separate models for IS and HS, different HRs were determined for the same risk factors, including CHD (HR for IS/HS; men: 1.18/1.01, women: 1.32/0.97), diabetes ((age<65 years) men: 1.77/1.44, women: 1.60/1.25; (age 65+years) men: 1.34/1.11, women: 1.30/1.02) and blood pressure-related risk factors (online supplemental eWorkbook I).

Figure 3

Area-specific baseline survival curves and 9-year incidence for ischaemic stroke and haemorrhagic stroke in China Kadoorie Biobank. Study area-specific baseline survival curves are averaged across sex. Urban study areas are shown in blue while rural areas are shown in red. Note the different scales of the y-axes in subplots (A, B). Dot sizes on maps correspond to observed 9-year incidence (1 − Kaplan-Meier estimate).

Sensitivity analyses

Restriction to a subset of participants aged ≥55 years yielded comparable results from recalibration, refitting, accounting for area and including additional risk factors (online supplemental eTable III). However, due to increased homogeneity among included individuals, the AUCs were lower after excluding the younger adults (age 30–54 years). The age restriction also yielded more extreme differences in the AUCs for the best-performing models for IS and HS compared with the AUCs for the best-performing models for total stroke. Including additional risk factors prior to area-stratification demonstrated that extra risk factors improved AUCs for total stroke, IS and HS, with area-stratification contributing to further improved discrimination for total stroke and IS only (online supplemental eTable IV). After adjusting for competing causes of death, the incidence rates for total stroke and stroke pathological types were similar to the Kaplan-Meier derived estimates (figure 3, online supplemental eFigure IV), and likewise, the predicted risks from the Fine-Gray and Cox models were also similar (online supplemental eFigure V).

Opportunities for validation in independent populations

A risk calculator is provided in online supplemetnal eWorkbook I to enable validation of the CKB risk scores for total stroke and stroke pathological types in independent local populations. Details of the methods on how to use the risk calculators are provided in online supplemental eMethods V.

Discussion

This study, involving a 100-fold larger population than the original Framingham Study, demonstrated that the 2017 FSRP was effective at distinguishing between individuals with and without stroke (good discrimination), but greatly underestimated the absolute risks of total stroke in Chinese adults (poor calibration) due to higher incidence rates of both IS and HS in China compared with Western populations. Absolute risk prediction of total stroke was substantially improved by recalibrating the baseline survival function, with modest additional benefit from refitting HRs (online supplemental eFigure VI). Adjusting for 10 areas in China yielded modest, but statistically significant, improvements in risk discrimination, but there were no further material improvements achieved by adding 38–60 additional risk indicators available in CKB. There was also good performance of separate models for IS and HS, and evidence that the relative importance of predictors differed between these pathological types.

A few population-based prospective studies had previously assessed the utility of FSRP in Chinese adults.12–14 Overall, they found modest risk discrimination of FSRP for total stroke, but poor prediction of absolute risks of stroke, consistent with the findings of this study. For example, application of FSRP in the China-PAR study, involving 106 281 adults recruited from 4 cohorts in China with a few thousand recorded stroke events, yielded AUCs of 0.65–0.73, but greatly underestimated absolute risks of total stroke.12 These, and other studies, have highlighted the need for recalibration of Framingham-based equations for prediction of cardiovascular disease in LMICs like China.33 34 While this study yielded similar findings, it provides several advantages including contemporary risks with much greater precision and reliability due to the very large numbers of well-characterised stroke cases (20-fold greater than the China-PAR study); evaluation of differences by 10 widely distributed geographical areas within China; and separate risk prediction of total stroke, IS and HS.

First, the novel models developed in this study successfully controlled for area-specific differences that were unexplained by analysis of the FSRP risk factors alone. While previous studies such as the China-PAR study have focused on developing a single risk prediction model for the whole country,12 the results of this study highlight the importance of tailoring risk predictions for specific areas of China, which have substantial differences in incidence of total stroke. The present report provides novel local models for risk prediction of total stroke in 10 diverse areas of China, which have greater predictive utility than a single nationwide model for clinicians in the individual regions.

Second, the separate analysis by study area and stroke pathological types affords insight into the substantial differences in incidence of IS and HS between different areas within China. Some of this geographical variation may be explained by differences in blood pressure.35 However, much of this variation remains unexplained and may possibly reflect differences in detection (eg, from greater use of brain imaging in certain areas). Inclusion of additional risk factors (eg, sociodemographic factors, alcohol) captured most of the geographical variation in HS risk, but only a fraction for IS risk. Consequently, this study suggests that in studies where explicitly controlling for geographical areas is not feasible, the inclusion of additional risk factors in addition to those included in FSRP could capture some of the regional differences.

Third, this study has significant implications for prevention of different stroke pathological types. Current guidelines in both high-income countries and LMICs advocate the use of blood pressure-lowering medication, lipid-lowering medication and antiplatelet treatment for cardiovascular disease prevention.36–38 However, individual subtypes of stroke are heterogeneous in their aetiology, and likewise, risk factors have heterogeneous effects on individual stroke types.15 18 23–25 This study adds to the available evidence by highlighting differences in the HRs for risk factors such as CHD, diabetes and blood pressure-related variables for IS and for HS, and suggests that evaluating an individual’s risk for separate stroke types (as opposed to total stroke only) may also be informative for primary prevention.

This study also had some limitations. First, AF was not recorded in CKB, so could not be included in the models. However, other population-based studies of comparable age groups in China indicated that the prevalence of AF was substantially lower in China than in Framingham (0.4%–1.7% vs 5.0%),39–41 and in 2012, the AF-related stroke prevalence in China was estimated to be 0.13 per 1000 people.40 Consequently, although AF is a strong predictor of stroke,39 it is likely to affect risk prediction for only a small number of individuals in CKB from 2004 to 2008. As prevalence of AF increases in China,40 41 it may be increasingly important to incorporate AF into future local stroke risk equations. Another limitation of CKB was that recorded stroke events were limited to hospitalised strokes (92% having brain imaging to support diagnosis) and death from stroke.18 In addition, while the risk equations presented in this study are useful for risk prediction, the HRs for individual risk factors cannot be interpreted causally.

Moreover, the Cox models presented do not account for competing risks of death due to other causes, which may affect risk estimates, particularly in older individuals. However, with low rates of censoring in CKB (5.4%, with 4.8% of censoring due to death), the effect of this limitation is small, as indicated by the comparable 9-year stroke incidence rates of stroke after adjusting for competing risk of death and the similar predicted risks between the Cox and Fine-Gray models.

Finally, the risk equations outlined in the present report were not designed for immediate implementation in clinical practice, which would require additional validation in independent populations in China and potentially other LMICs. To our knowledge, there are currently no contemporary regional cohorts of middle-aged and older adults with sufficiently large sample size in China to perform an external validation. As such datasets become available (eg, via the China Precision Medicine Initiative and establishment of regional electronic health records), future studies can use the calculator provided (online supplemental eWorkbook I) to validate these equations.

Conclusions

This study developed novel local risk equations for total stroke, IS and HS and demonstrated modest, but statistically significant, improvements over the widely used 2017 FSRP in Chinese adults. Improvements in stroke risk prediction can be attributed to recalibration of baseline survival, refitting HRs and accounting for geographical differences in stroke incidence in China. The addition of a large number of other risk factors yielded no further material improvements, but may be useful in other studies when area-specific differences are not readily estimated. These techniques can be implemented to improve risk prediction in any Chinese or similar populations with unique and geographically diverse risk profiles for stroke. Moreover, separate risk equations for IS and HS could help to identify individuals at high risk of a particular stroke pathological type and guide treatment decisions for primary prevention. These equations should be validated and refined in independent populations before implementing them for prediction of stroke risk in clinical practice.

Data availability statement

Ethics statements

Patient consent for publication

Ethics approval

This study involves human participants and was approved by Oxford Tropical Research Ethics Committee (OxTREC) (2005)Reference number (OxTREC): 025-04.

Acknowledgments

The chief acknowledgment is to the participants, the project staff, and the China National Centre for Disease Control and Prevention and its regional offices for access to death and disease registries. The Chinese National Health Insurance scheme provides electronic linkage to all hospital admission data.

References

↵
2. Feigin VL ,
3. Krishnamurthi RV ,
4. Parmar P , et al
. Update on the global burden of ischemic and hemorrhagic stroke in 1990-2013: the GBD 2013 study. Neuroepidemiology 2015;45:161–76.doi:10.1159/000441085 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26505981
OpenUrl CrossRef PubMed
↵
1. Institute for Health Metrics and Evaluation
. GBD compare. Seattle, WA: IHME, University of Washington, 2015. https://vizhub.healthdata.org/gbd-compare
↵
2. Liu M ,
3. Wu B ,
4. Wang W-Z , et al
. Stroke in China: epidemiology, prevention, and management strategies. Lancet Neurol 2007;6:456–64.doi:10.1016/S1474-4422(07)70004-2 pmid:http://www.ncbi.nlm.nih.gov/pubmed/17434100
OpenUrl CrossRef PubMed Web of Science
↵
2. Richards A ,
3. Cheng EM
. Stroke risk calculators in the era of electronic health records linked to administrative databases. Stroke 2013;44:564–9.doi:10.1161/STROKEAHA.111.649798 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23204057
OpenUrl FREE Full Text
↵
2. Meschia JF ,
3. Bushnell C ,
4. Boden-Albala B , et al
. Guidelines for the primary prevention of stroke: a statement for healthcare professionals from the American heart Association/American stroke association. Stroke 2014;45:3754–832.doi:10.1161/STR.0000000000000046 pmid:http://www.ncbi.nlm.nih.gov/pubmed/25355838
OpenUrl Abstract/FREE Full Text
↵
2. Wolf PA ,
3. D'Agostino RB ,
4. Belanger AJ , et al
. Probability of stroke: a risk profile from the Framingham study. Stroke 1991;22:312–8.doi:10.1161/01.STR.22.3.312 pmid:http://www.ncbi.nlm.nih.gov/pubmed/2003301
OpenUrl Abstract/FREE Full Text
↵
2. D'Agostino RB ,
3. Wolf PA ,
4. Belanger AJ , et al
. Stroke risk profile: adjustment for antihypertensive medication. The Framingham study. Stroke 1994;25:40–3.doi:10.1161/01.STR.25.1.40 pmid:http://www.ncbi.nlm.nih.gov/pubmed/8266381
OpenUrl Abstract/FREE Full Text
↵
2. Dufouil C ,
3. Beiser A ,
4. McLure LA , et al
. Revised Framingham stroke risk profile to reflect temporal trends. Circulation 2017;135:1145–59.doi:10.1161/CIRCULATIONAHA.115.021275 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28159800
OpenUrl Abstract/FREE Full Text
↵
2. Flueckiger P ,
3. Longstreth W ,
4. Herrington D , et al
. Revised Framingham stroke risk score, nontraditional risk markers, and incident stroke in a multiethnic cohort. Stroke 2018;49:363–9.doi:10.1161/STROKEAHA.117.018928 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29311270
OpenUrl Abstract/FREE Full Text
↵
2. Tsai C-F ,
3. Thomas B ,
4. Sudlow CLM
. Epidemiology of stroke and its subtypes in Chinese vs white populations: a systematic review. Neurology 2013;81:264–72.doi:10.1212/WNL.0b013e31829bfde3 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23858408
OpenUrl CrossRef PubMed
↵
2. Chen JS ,
3. Campbell TC ,
4. JY L , et al
. Lifestyle and mortality in China: a study of the characteristics of 65 Chinese counties. Oxford University Press, 1990.
↵
2. Xing X ,
3. Yang X ,
4. Liu F , et al
. Predicting 10-year and lifetime stroke risk in Chinese population. Stroke 2019;50:2371–8.doi:10.1161/STROKEAHA.119.025553 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31390964
OpenUrl PubMed
↵
2. Chien K-L ,
3. Su T-C ,
4. Hsu H-C , et al
. Constructing the prediction model for the risk of stroke in a Chinese population: report from a cohort study in Taiwan. Stroke 2010;41:1858–64.doi:10.1161/STROKEAHA.110.586222 pmid:http://www.ncbi.nlm.nih.gov/pubmed/20671251
OpenUrl Abstract/FREE Full Text
↵
2. Leung JY ,
3. Lin SL ,
4. Lee RS , et al
. Framingham risk score for predicting cardiovascular disease in older adults in Hong Kong. Hong Kong Med J 2018;24(Suppl 4):S8–11.pmid:http://www.ncbi.nlm.nih.gov/pubmed/30135267
OpenUrl PubMed
↵
2. Sun L ,
3. Clarke R ,
4. Bennett D , et al
. Causal associations of blood lipids with risk of ischemic stroke and intracerebral hemorrhage in Chinese adults. Nat Med 2019;25:569–74.doi:10.1038/s41591-019-0366-x pmid:http://www.ncbi.nlm.nih.gov/pubmed/30858617
OpenUrl CrossRef PubMed
↵
2. Chen Z ,
3. Lee L ,
4. Chen J , et al
. Cohort profile: the Kadoorie study of chronic disease in China (KSCDC). Int J Epidemiol 2005;34:1243–9.doi:10.1093/ije/dyi174 pmid:http://www.ncbi.nlm.nih.gov/pubmed/16131516
OpenUrl CrossRef PubMed Web of Science
↵
2. Chen Z ,
3. Chen J ,
4. Collins R , et al
. China Kadoorie Biobank of 0.5 million people: survey methods, baseline characteristics and long-term follow-up. Int J Epidemiol 2011;40:1652–66.doi:10.1093/ije/dyr120 pmid:http://www.ncbi.nlm.nih.gov/pubmed/22158673
OpenUrl CrossRef PubMed Web of Science
↵
2. Chen Y ,
3. Wright N ,
4. Guo Y , et al
. Mortality and recurrent vascular events after first incident stroke: a 9-year community-based study of 0·5 million Chinese adults. Lancet Glob Health 2020;8:e580–90.doi:10.1016/S2214-109X(20)30069-3 pmid:http://www.ncbi.nlm.nih.gov/pubmed/32199124
OpenUrl PubMed
↵
2. Breslow NE
. Discussion of the paper by DR. Cox. J R Statist Soc B 1972;34:215–6.
OpenUrl
↵
2. Lin DY
. On the Breslow estimator. Lifetime Data Anal 2007;13:471–80.doi:10.1007/s10985-007-9048-y pmid:http://www.ncbi.nlm.nih.gov/pubmed/17768681
OpenUrl PubMed
↵
2. Friedman J ,
3. Hastie T ,
4. Tibshirani R
. Glmnet: lasso and elastic-net regularized generalized linear models. R package version 3.0-2, 2019. Available: http://CRAN.R-project.org/package=glmnet [Accessed 8 Apr 2020].
↵
2. Hastie T ,
3. Qian J
. Glmnet vignette, 2014. Available: http://www.web.stanford.edu/~hastie/Papers/Glmnet_Vignette.pdf [Accessed 8 Apr 2020].
↵
2. Chen Z ,
3. Iona A ,
4. Parish S , et al
. Adiposity and risk of ischaemic and haemorrhagic stroke in 0·5 million Chinese men and women: a prospective cohort study. Lancet Glob Health 2018;6:e630–40.doi:10.1016/S2214-109X(18)30216-X pmid:http://www.ncbi.nlm.nih.gov/pubmed/29773119
OpenUrl PubMed
↵
2. Holmes MV ,
3. Millwood IY ,
4. Kartsonaki C , et al
. Lipids, Lipoproteins, and Metabolites and Risk of Myocardial Infarction and Stroke. J Am Coll Cardiol 2018;71:620–32.doi:10.1016/j.jacc.2017.12.006 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29420958
OpenUrl FREE Full Text
↵
2. Dong W ,
3. Pan X-F ,
4. Yu C , et al
. Self-rated health status and risk of incident stroke in 0.5 million Chinese adults: the China Kadoorie Biobank study. J Stroke 2018;20:247–57.doi:10.5853/jos.2017.01732 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29886721
OpenUrl PubMed
↵
2. DeLong ER ,
3. DeLong DM ,
4. Clarke-Pearson DL
. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 1988;44:837–45.doi:10.2307/2531595 pmid:http://www.ncbi.nlm.nih.gov/pubmed/3203132
OpenUrl CrossRef PubMed Web of Science
↵
2. D’Agostino RB ,
3. Byung-Ho N
. Evaluation of the performance of survival analysis models: discrimination and calibration measures. Handbook of Statistics 2004;23:1–25.
OpenUrl
↵
2. Demler OV ,
3. Paynter NP ,
4. Cook NR
. Tests of calibration and goodness-of-fit in the survival setting. Stat Med 2015;34:1659–80.doi:10.1002/sim.6428 pmid:http://www.ncbi.nlm.nih.gov/pubmed/25684707
OpenUrl CrossRef PubMed
↵
2. Fine JP ,
3. Gray RJ
. A proportional hazards model for the Subdistribution of a competing risk. J Am Stat Assoc 1999;94:496–509.doi:10.1080/01621459.1999.10474144
OpenUrl CrossRef Web of Science
↵
2. Gerds TA ,
3. Blanche P ,
4. Mortensen R
. riskRegression: risk regression models and prediction scores for survival analysis with competing risks. R package version 2020.12.08, 2020. Available: http://CRAN.R-project.org/package=riskRegression [Accessed 12 Apr 2021].
↵
2. Davidson-Pilon C ,
3. Kalderstam J ,
4. Jacobson N
. CamDavidsonPilon/lifelines: v0.21.1 (version v0.21.1). Zenodo, 2019. Available: http://doi.org/10.5281/zenodo.2652543
↵
2. Pedregosa F ,
3. Varoquaux G ,
4. Gramfort A
. Scikit-learn: machine learning in python. JMLR 2011;12:2825–30.
OpenUrl
↵
2. Damen JA ,
3. Pajouheshnia R ,
4. Heus P , et al
. Performance of the Framingham risk models and pooled cohort equations for predicting 10-year risk of cardiovascular disease: a systematic review and meta-analysis. BMC Med 2019;17:109.doi:10.1186/s12916-019-1340-7 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31189462
OpenUrl CrossRef PubMed
↵
2. Pennells L ,
3. Kaptoge S ,
4. Wood A , et al
. Equalization of four cardiovascular risk algorithms after systematic recalibration: individual-participant meta-analysis of 86 prospective studies. Eur Heart J 2019;40:621–31.doi:10.1093/eurheartj/ehy653 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30476079
OpenUrl PubMed
↵
2. Lewington S ,
3. Li L ,
4. Sherliker P , et al
. Seasonal variation in blood pressure and its relationship with outdoor temperature in 10 diverse regions of China: the China Kadoorie Biobank. J Hypertens 2012;30:1383–91.doi:10.1097/HJH.0b013e32835465b5 pmid:http://www.ncbi.nlm.nih.gov/pubmed/22688260
OpenUrl CrossRef PubMed Web of Science
↵
2. Arnett DK ,
3. Blumenthal RS ,
4. Albert MA , et al
. 2019 ACC/AHA guideline on the primary prevention of cardiovascular disease: a report of the American College of Cardiology/American heart association Task force on clinical practice guidelines. Circulation 2019;140:e596–646.doi:10.1161/CIR.0000000000000678 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30879355
OpenUrl CrossRef PubMed
↵
1. Joint Committee for Guideline Revision
. 2018 Chinese guidelines for prevention and treatment of Hypertension-A report of the revision Committee of Chinese guidelines for prevention and treatment of hypertension. J Geriatr Cardiol 2019;16:182–241.doi:10.11909/j.issn.1671-5411.2019.03.014 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31080465
OpenUrl CrossRef PubMed
↵
1. Joint Committee for Guideline Revision
. 2016 Chinese guidelines for the management of dyslipidemia in adults. J Geriatr Cardiol 2018;15:1–29.doi:10.11909/j.issn.1671-5411.2018.01.011 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29434622
OpenUrl PubMed
↵
2. Zhou Z ,
3. Hu D
. An epidemiological study on the prevalence of atrial fibrillation in the Chinese population of mainland China. J Epidemiol 2008;18:209–16.doi:10.2188/jea.JE2008021 pmid:http://www.ncbi.nlm.nih.gov/pubmed/18776706
OpenUrl CrossRef PubMed Web of Science
↵
2. Guo Y ,
3. Tian Y ,
4. Wang H , et al
. Prevalence, incidence, and lifetime risk of atrial fibrillation in China: new insights into the global burden of atrial fibrillation. Chest 2015;147:109–19.doi:10.1378/chest.14-0321 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24921459
OpenUrl CrossRef PubMed
↵
2. Wang Z ,
3. Chen Z ,
4. Wang X , et al
. The disease burden of atrial fibrillation in China from a national cross-sectional survey. Am J Cardiol 2018;122:793–8.doi:10.1016/j.amjcard.2018.05.015 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30049467
OpenUrl CrossRef PubMed

Supplementary material

Supplementary Data

This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Data supplement 1
Data supplement 2

Footnotes

RC, ZC and BJC are joint senior authors.
Collaborators China Kadoorie Biobank Collaborative Study Group.
Contributors Study concept and design: MC, TZ, DC, BJC and RC. Data collection: RC, DAB, YG, YC, PP, JL, CY, LY, LL and ZC. Data analysis and interpretation: MC, TZ, DC, BJC and RC. Drafting of the manuscript: MC, TZ, DC, BJC and RC. Critical revision of the manuscript: all authors. Final approval: all authors.
Funding The baseline survey was funded by the Kadoorie Charitable Foundation, Hong Kong, China and the funding sources for the long-term continuation of the study included UK Wellcome Trust (202922/Z/16/Z, 104085/Z/14/Z, 088158/Z/09/Z), Chinese National Natural Science Foundation (81390540, 81390541, 81390544) and the National Key Research and Development Program of China (2016YFC0900500, 2016YFC0900501, 2016YFC0900504, 2016YFC1303904). Core funding was also provided to the CTSU, University of Oxford, by the British Heart Foundation (CH/1996001/9454), the UK Medical Research Council, and Cancer Research UK. MC was supported by a Rhodes Scholarship. BJC was supported by a Nuffield Department of Population Health Senior Research Fellowship. The University of Oxford Medical Research Council (MRC) Population Health Research Unit is funded through a strategic partnership between the MRC and the University of Oxford (MC_UU_00017/1, MC_UU_12026/2, MC_U137686851). The research was also supported by the National Institute for Health Research (NIHR) Oxford Biomedical Research Centre (BRC).
Map disclaimer The inclusion of any map (including the depiction of any boundaries therein), or of any geographical or locational reference, does not imply the expression of any opinion whatsoever on the part of BMJ concerning the legal status of any country, territory, jurisdiction or area or of its authorities. Any such expression remains solely that of the relevant source and is not endorsed by BMJ. Maps are provided without any warranty of any kind, either express or implied.
Competing interests None declared.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

[1] ↵

Feigin VL ,
Krishnamurthi RV ,
Parmar P , et al
. Update on the global burden of ischemic and hemorrhagic stroke in 1990-2013: the GBD 2013 study. Neuroepidemiology 2015;45:161–76.doi:10.1159/000441085 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26505981
OpenUrl CrossRef PubMed

[3] Feigin VL ,

[4] Krishnamurthi RV ,

[5] Parmar P , et al

[6] ↵
Institute for Health Metrics and Evaluation
. GBD compare. Seattle, WA: IHME, University of Washington, 2015. https://vizhub.healthdata.org/gbd-compare

[7] Institute for Health Metrics and Evaluation

[8] ↵

Liu M ,
Wu B ,
Wang W-Z , et al
. Stroke in China: epidemiology, prevention, and management strategies. Lancet Neurol 2007;6:456–64.doi:10.1016/S1474-4422(07)70004-2 pmid:http://www.ncbi.nlm.nih.gov/pubmed/17434100
OpenUrl CrossRef PubMed Web of Science

[10] Liu M ,

[11] Wu B ,

[12] Wang W-Z , et al

[13] ↵

Richards A ,
Cheng EM
. Stroke risk calculators in the era of electronic health records linked to administrative databases. Stroke 2013;44:564–9.doi:10.1161/STROKEAHA.111.649798 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23204057
OpenUrl FREE Full Text

[15] Richards A ,

[16] Cheng EM

[17] ↵

Meschia JF ,
Bushnell C ,
Boden-Albala B , et al
. Guidelines for the primary prevention of stroke: a statement for healthcare professionals from the American heart Association/American stroke association. Stroke 2014;45:3754–832.doi:10.1161/STR.0000000000000046 pmid:http://www.ncbi.nlm.nih.gov/pubmed/25355838
OpenUrl Abstract/FREE Full Text

[19] Meschia JF ,

[20] Bushnell C ,

[21] Boden-Albala B , et al

[22] ↵

Wolf PA ,
D'Agostino RB ,
Belanger AJ , et al
. Probability of stroke: a risk profile from the Framingham study. Stroke 1991;22:312–8.doi:10.1161/01.STR.22.3.312 pmid:http://www.ncbi.nlm.nih.gov/pubmed/2003301
OpenUrl Abstract/FREE Full Text

[24] Wolf PA ,

[25] D'Agostino RB ,

[26] Belanger AJ , et al

[27] ↵

D'Agostino RB ,
Wolf PA ,
Belanger AJ , et al
. Stroke risk profile: adjustment for antihypertensive medication. The Framingham study. Stroke 1994;25:40–3.doi:10.1161/01.STR.25.1.40 pmid:http://www.ncbi.nlm.nih.gov/pubmed/8266381
OpenUrl Abstract/FREE Full Text

[29] D'Agostino RB ,

[30] Wolf PA ,

[31] Belanger AJ , et al

[32] ↵

Dufouil C ,
Beiser A ,
McLure LA , et al
. Revised Framingham stroke risk profile to reflect temporal trends. Circulation 2017;135:1145–59.doi:10.1161/CIRCULATIONAHA.115.021275 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28159800
OpenUrl Abstract/FREE Full Text

[34] Dufouil C ,

[35] Beiser A ,

[36] McLure LA , et al

[37] ↵

Flueckiger P ,
Longstreth W ,
Herrington D , et al
. Revised Framingham stroke risk score, nontraditional risk markers, and incident stroke in a multiethnic cohort. Stroke 2018;49:363–9.doi:10.1161/STROKEAHA.117.018928 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29311270
OpenUrl Abstract/FREE Full Text

[39] Flueckiger P ,

[40] Longstreth W ,

[41] Herrington D , et al

[42] ↵

Tsai C-F ,
Thomas B ,
Sudlow CLM
. Epidemiology of stroke and its subtypes in Chinese vs white populations: a systematic review. Neurology 2013;81:264–72.doi:10.1212/WNL.0b013e31829bfde3 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23858408
OpenUrl CrossRef PubMed

[44] Tsai C-F ,

[45] Thomas B ,

[46] Sudlow CLM

[47] ↵

Chen JS ,
Campbell TC ,
JY L , et al
. Lifestyle and mortality in China: a study of the characteristics of 65 Chinese counties. Oxford University Press, 1990.

[49] Chen JS ,

[50] Campbell TC ,

[51] JY L , et al

[52] ↵

Xing X ,
Yang X ,
Liu F , et al
. Predicting 10-year and lifetime stroke risk in Chinese population. Stroke 2019;50:2371–8.doi:10.1161/STROKEAHA.119.025553 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31390964
OpenUrl PubMed

[54] Xing X ,

[55] Yang X ,

[56] Liu F , et al

[57] ↵

Chien K-L ,
Su T-C ,
Hsu H-C , et al
. Constructing the prediction model for the risk of stroke in a Chinese population: report from a cohort study in Taiwan. Stroke 2010;41:1858–64.doi:10.1161/STROKEAHA.110.586222 pmid:http://www.ncbi.nlm.nih.gov/pubmed/20671251
OpenUrl Abstract/FREE Full Text

[59] Chien K-L ,

[60] Su T-C ,

[61] Hsu H-C , et al

[62] ↵

Leung JY ,
Lin SL ,
Lee RS , et al
. Framingham risk score for predicting cardiovascular disease in older adults in Hong Kong. Hong Kong Med J 2018;24(Suppl 4):S8–11.pmid:http://www.ncbi.nlm.nih.gov/pubmed/30135267
OpenUrl PubMed

[64] Leung JY ,

[65] Lin SL ,

[66] Lee RS , et al

[67] ↵

Sun L ,
Clarke R ,
Bennett D , et al
. Causal associations of blood lipids with risk of ischemic stroke and intracerebral hemorrhage in Chinese adults. Nat Med 2019;25:569–74.doi:10.1038/s41591-019-0366-x pmid:http://www.ncbi.nlm.nih.gov/pubmed/30858617
OpenUrl CrossRef PubMed

[69] Sun L ,

[70] Clarke R ,

[71] Bennett D , et al

[72] ↵

Chen Z ,
Lee L ,
Chen J , et al
. Cohort profile: the Kadoorie study of chronic disease in China (KSCDC). Int J Epidemiol 2005;34:1243–9.doi:10.1093/ije/dyi174 pmid:http://www.ncbi.nlm.nih.gov/pubmed/16131516
OpenUrl CrossRef PubMed Web of Science

[74] Chen Z ,

[75] Lee L ,

[76] Chen J , et al

[77] ↵

Chen Z ,
Chen J ,
Collins R , et al
. China Kadoorie Biobank of 0.5 million people: survey methods, baseline characteristics and long-term follow-up. Int J Epidemiol 2011;40:1652–66.doi:10.1093/ije/dyr120 pmid:http://www.ncbi.nlm.nih.gov/pubmed/22158673
OpenUrl CrossRef PubMed Web of Science

[79] Chen Z ,

[80] Chen J ,

[81] Collins R , et al

[82] ↵

Chen Y ,
Wright N ,
Guo Y , et al
. Mortality and recurrent vascular events after first incident stroke: a 9-year community-based study of 0·5 million Chinese adults. Lancet Glob Health 2020;8:e580–90.doi:10.1016/S2214-109X(20)30069-3 pmid:http://www.ncbi.nlm.nih.gov/pubmed/32199124
OpenUrl PubMed

[84] Chen Y ,

[85] Wright N ,

[86] Guo Y , et al

[87] ↵

Breslow NE
. Discussion of the paper by DR. Cox. J R Statist Soc B 1972;34:215–6.
OpenUrl

[89] Breslow NE

[90] ↵

Lin DY
. On the Breslow estimator. Lifetime Data Anal 2007;13:471–80.doi:10.1007/s10985-007-9048-y pmid:http://www.ncbi.nlm.nih.gov/pubmed/17768681
OpenUrl PubMed

[92] Lin DY

[93] ↵

Friedman J ,
Hastie T ,
Tibshirani R
. Glmnet: lasso and elastic-net regularized generalized linear models. R package version 3.0-2, 2019. Available: http://CRAN.R-project.org/package=glmnet [Accessed 8 Apr 2020].

[95] Friedman J ,

[96] Hastie T ,

[97] Tibshirani R

[98] ↵

Hastie T ,
Qian J
. Glmnet vignette, 2014. Available: http://www.web.stanford.edu/~hastie/Papers/Glmnet_Vignette.pdf [Accessed 8 Apr 2020].

[100] Hastie T ,

[101] Qian J

[102] ↵

Chen Z ,
Iona A ,
Parish S , et al
. Adiposity and risk of ischaemic and haemorrhagic stroke in 0·5 million Chinese men and women: a prospective cohort study. Lancet Glob Health 2018;6:e630–40.doi:10.1016/S2214-109X(18)30216-X pmid:http://www.ncbi.nlm.nih.gov/pubmed/29773119
OpenUrl PubMed

[104] Chen Z ,

[105] Iona A ,

[106] Parish S , et al

[107] ↵

Holmes MV ,
Millwood IY ,
Kartsonaki C , et al
. Lipids, Lipoproteins, and Metabolites and Risk of Myocardial Infarction and Stroke. J Am Coll Cardiol 2018;71:620–32.doi:10.1016/j.jacc.2017.12.006 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29420958
OpenUrl FREE Full Text

[109] Holmes MV ,

[110] Millwood IY ,

[111] Kartsonaki C , et al

[112] ↵

Dong W ,
Pan X-F ,
Yu C , et al
. Self-rated health status and risk of incident stroke in 0.5 million Chinese adults: the China Kadoorie Biobank study. J Stroke 2018;20:247–57.doi:10.5853/jos.2017.01732 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29886721
OpenUrl PubMed

[114] Dong W ,

[115] Pan X-F ,

[116] Yu C , et al

[117] ↵

DeLong ER ,
DeLong DM ,
Clarke-Pearson DL
. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 1988;44:837–45.doi:10.2307/2531595 pmid:http://www.ncbi.nlm.nih.gov/pubmed/3203132
OpenUrl CrossRef PubMed Web of Science

[119] DeLong ER ,

[120] DeLong DM ,

[121] Clarke-Pearson DL

[122] ↵

D’Agostino RB ,
Byung-Ho N
. Evaluation of the performance of survival analysis models: discrimination and calibration measures. Handbook of Statistics 2004;23:1–25.
OpenUrl

[124] D’Agostino RB ,

[125] Byung-Ho N

[126] ↵

Demler OV ,
Paynter NP ,
Cook NR
. Tests of calibration and goodness-of-fit in the survival setting. Stat Med 2015;34:1659–80.doi:10.1002/sim.6428 pmid:http://www.ncbi.nlm.nih.gov/pubmed/25684707
OpenUrl CrossRef PubMed

[128] Demler OV ,

[129] Paynter NP ,

[130] Cook NR

[131] ↵

Fine JP ,
Gray RJ
. A proportional hazards model for the Subdistribution of a competing risk. J Am Stat Assoc 1999;94:496–509.doi:10.1080/01621459.1999.10474144
OpenUrl CrossRef Web of Science

[133] Fine JP ,

[134] Gray RJ

[135] ↵

Gerds TA ,
Blanche P ,
Mortensen R
. riskRegression: risk regression models and prediction scores for survival analysis with competing risks. R package version 2020.12.08, 2020. Available: http://CRAN.R-project.org/package=riskRegression [Accessed 12 Apr 2021].

[137] Gerds TA ,

[138] Blanche P ,

[139] Mortensen R

[140] ↵

Davidson-Pilon C ,
Kalderstam J ,
Jacobson N
. CamDavidsonPilon/lifelines: v0.21.1 (version v0.21.1). Zenodo, 2019. Available: http://doi.org/10.5281/zenodo.2652543

[142] Davidson-Pilon C ,

[143] Kalderstam J ,

[144] Jacobson N

[145] ↵

Pedregosa F ,
Varoquaux G ,
Gramfort A
. Scikit-learn: machine learning in python. JMLR 2011;12:2825–30.
OpenUrl

[147] Pedregosa F ,

[148] Varoquaux G ,

[149] Gramfort A

[150] ↵

Damen JA ,
Pajouheshnia R ,
Heus P , et al
. Performance of the Framingham risk models and pooled cohort equations for predicting 10-year risk of cardiovascular disease: a systematic review and meta-analysis. BMC Med 2019;17:109.doi:10.1186/s12916-019-1340-7 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31189462
OpenUrl CrossRef PubMed

[152] Damen JA ,

[153] Pajouheshnia R ,

[154] Heus P , et al

[155] ↵

Pennells L ,
Kaptoge S ,
Wood A , et al
. Equalization of four cardiovascular risk algorithms after systematic recalibration: individual-participant meta-analysis of 86 prospective studies. Eur Heart J 2019;40:621–31.doi:10.1093/eurheartj/ehy653 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30476079
OpenUrl PubMed

[157] Pennells L ,

[158] Kaptoge S ,

[159] Wood A , et al

[160] ↵

Lewington S ,
Li L ,
Sherliker P , et al
. Seasonal variation in blood pressure and its relationship with outdoor temperature in 10 diverse regions of China: the China Kadoorie Biobank. J Hypertens 2012;30:1383–91.doi:10.1097/HJH.0b013e32835465b5 pmid:http://www.ncbi.nlm.nih.gov/pubmed/22688260
OpenUrl CrossRef PubMed Web of Science

[162] Lewington S ,

[163] Li L ,

[164] Sherliker P , et al

[165] ↵

Arnett DK ,
Blumenthal RS ,
Albert MA , et al
. 2019 ACC/AHA guideline on the primary prevention of cardiovascular disease: a report of the American College of Cardiology/American heart association Task force on clinical practice guidelines. Circulation 2019;140:e596–646.doi:10.1161/CIR.0000000000000678 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30879355
OpenUrl CrossRef PubMed

[167] Arnett DK ,

[168] Blumenthal RS ,

[169] Albert MA , et al

[170] ↵
Joint Committee for Guideline Revision
. 2018 Chinese guidelines for prevention and treatment of Hypertension-A report of the revision Committee of Chinese guidelines for prevention and treatment of hypertension. J Geriatr Cardiol 2019;16:182–241.doi:10.11909/j.issn.1671-5411.2019.03.014 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31080465
OpenUrl CrossRef PubMed

[171] Joint Committee for Guideline Revision

[172] ↵
Joint Committee for Guideline Revision
. 2016 Chinese guidelines for the management of dyslipidemia in adults. J Geriatr Cardiol 2018;15:1–29.doi:10.11909/j.issn.1671-5411.2018.01.011 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29434622
OpenUrl PubMed

[173] Joint Committee for Guideline Revision

[174] ↵

Zhou Z ,
Hu D
. An epidemiological study on the prevalence of atrial fibrillation in the Chinese population of mainland China. J Epidemiol 2008;18:209–16.doi:10.2188/jea.JE2008021 pmid:http://www.ncbi.nlm.nih.gov/pubmed/18776706
OpenUrl CrossRef PubMed Web of Science

[176] Zhou Z ,

[177] Hu D

[178] ↵

Guo Y ,
Tian Y ,
Wang H , et al
. Prevalence, incidence, and lifetime risk of atrial fibrillation in China: new insights into the global burden of atrial fibrillation. Chest 2015;147:109–19.doi:10.1378/chest.14-0321 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24921459
OpenUrl CrossRef PubMed

[180] Guo Y ,

[181] Tian Y ,

[182] Wang H , et al

[183] ↵

Wang Z ,
Chen Z ,
Wang X , et al
. The disease burden of atrial fibrillation in China from a national cross-sectional survey. Am J Cardiol 2018;122:793–8.doi:10.1016/j.amjcard.2018.05.015 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30049467
OpenUrl CrossRef PubMed

[185] Wang Z ,

[186] Chen Z ,

[187] Wang X , et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Data availability statement

Statistics from Altmetric.com

Request Permissions

Introduction

Methods

Study population

Follow-up for stroke outcomes

Supplemental material

Statistical analyses

Supplemental material

Results

Assessment and update of FSRP for prediction of total stroke

Prediction of total stroke after adjusting for areas in China

Expanded risk equations with additional risk factors

Risk equations for different stroke pathological types

Sensitivity analyses

Opportunities for validation in independent populations

Discussion

Conclusions

Data availability statement

Ethics statements

Patient consent for publication

Ethics approval

Acknowledgments

References

Supplementary material

Supplementary Data

Footnotes

Read the full text or download the PDF:

Log in using your username and password