Development and external validation of a machine-learning based model to predict pre-sarcopenia in MASLD population: Results from NHANES 2017–2018

Yang, Siwei; Yu, Jianan; Chen, Qiyang; Sun, Xuedong; Hu, Yuefeng; Su, Tianhao; Li, Jian; Jin, Long

doi:10.1016/j.aohep.2024.101585

Article information

Abstract

Full Text

Bibliography

Download PDF

Statistics

Figures (7)

Show moreShow less

Tables (2)

Table 1. Characteristics of included participants in NHANES 2017–2018.

Table 2. All metrics of four models in internal validation set and of final RF model in internal and external validation sets.

Show moreShow less

Additional material (6)

Abstract

Introduction and Objectives

With rising prevalence of pre-sarcopenia in metabolic dysfunction-associated steatotic liver disease (MASLD), this study aimed to develop and validate machine learning-based model to identify pre-sarcopenia in MASLD population.

Materials and Methods

A total of 571 MASLD subjects were screened from the National Health and Nutrition Examination Survey 2017–2018. This cohort was randomly divided into training set and internal testing set with a ratio of 7:3. Sixty-six MASLD subjects were collected from our institution as external validation set. Four binary classifiers, including Random Forest (RF), support vector machine, and extreme gradient boosting and logistic regression, were fitted to identify pre-sarcopenia. The best-performing model was further validated in external validation set. Model performance was assessed in terms of discrimination and calibration. Shapley Additive explanations were used for model interpretability.

Results

The pre-sarcopenia rate was 17.51 % and 15.16 % in NHANES cohort and external validation set, respectively. RF outperformed other models with area under receiver operating characteristic curve (AUROC) of 0.819 (95 %CI: 0.749, 0.889). When six top-ranking features were retained as per variable importance, including weight-adjusted waist, sex, race, creatinine, education and alkaline phosphatase, a final RF model reached an AUROC being 0.824 (0.737, 0.910) and 0.732 (95 %CI: 0.529, 0.936) in internal and external validation sets, respectively. The model robustness was proved in sensitivity analysis. The calibration curve and decision curve analysis confirmed a good calibration capacity and good clinical usage.

Conclusions

This study proposed a user-friendly model using explainable machine learning algorithm to predict pre-sarcopenia in MASLD population. A web-based tool was provided to screening pre-sarcopenia in community and hospitalization settings.

Keywords:

Metabolic dysfunction-associated steatotic liver disease

Sarcopenia

Machine learning

Nutrition surveys

Abbreviations:

ALP

AP

ASM

ASMI

AUROC

BMI

CAP

CLD

CT

DCA

DM

DXA

F1

FIB-4

HBV

HCV

HDL

HT

LR

LUTE

MASLD

ML

NAFLD

NHANES

NPV

PPV

RF

ROC

SHAP

SLD

SMI

SVM

WWI

Full Text

1Introduction

Non-alcoholic fatty liver disease (NAFLD), is the most common chronic liver disease (CLD) worldwide and is associated with a high morbidity and mortality beyond previous estimations [1]. In 2023, metabolic dysfunction-associated steatotic liver disease (MASLD) was proposed to replace NAFLD as a subtype of steatotic liver disease (SLD) [2]. It has been reported that there is a high agreement between NAFLD and MASLD individuals, and findings from NAFLD studies could persist under the new MASLD definition [3,4].

Muscle loss and dysfunction shares common mechanisms with the early stage of MASLD, such as physical inactivity, insulin resistance, dyslipidaemia and chronic systemic inflammation [5,6]. With the progression of disease, advanced fibrosis or cirrhosis becomes a major predisposing condition for the development of sarcopenia [7]. Sarcopenia, as a disease entity, represents a progressive and generalized skeletal muscle disorder, associated with higher likelihood of decompensation risk and mortality [8]. Indeed, skeletal muscle loss and muscle strength reduction often occur and develop asynchronously. Therefore, the European Working Group on Sarcopenia in Older People defined “pre-sarcopenia” as low skeletal muscle mass [9]. This term is also recommended as phenotypic representation in cirrhotic population by the American Association for the Study of Liver Diseases [7].

The onset and development of pre-sarcopenia is progressive and silent due to the co-existence of obesity, which is observed in most cases with MASLD [10]. Current evidence confirmed that sarcopenia is a driving force for MASLD progression [11-13]. Furthermore, the concurrence of sarcopenia and MASLD could lead to advanced fibrosis and higher mortality [14,15].

Computed Tomography (CT), and Dual-energy X-ray Absorptiometry (DXA) are utilized for screening pre-sarcopenia. Nevertheless, widespread use remains a challenge due to the potential risks of overt screening, radiation exposure and high cost. Moreover, the specialized software and laborious image processing are required. In addition, there are thus far few existing tools delivering satisfactory performance in predicting sarcopenia, likely because sarcopenia is multifactorial [16]. A non-invasive model was proposed to predict sarcopenia previously in diabetes mellitus (DM) population [17], but recent studies of MASLD population remained sparse. Hence, a well-performing model aimed to identify early pre-sarcopenia in MASLD is desirable, especially acting as a rule-out strategy.

This exploratory study aimed to develop and validate a model to predict pre-sarcopenia in MASLD population, using a selected but limited number of non-invasive variables and a machine learning (ML) approach, without resort to medical imaging. ML could enable an integrated analysis of multidimensional data to detect the nonlinear interactions among large datasets and diverse variables, making prediction more precise. Shapley Additive explanations (SHAP) were used to interpret the ML model. Through this model, subjects at high risk of pre-sarcopenia may benefit from a more intensive surveillance strategy and preventive intervention.

2Materials and methods2.1Study participants

The NHANES database provides nationally representative data of the U.S. population with a complex, multistage, stratified sampling survey design and was obtained from the Centre for Disease Control and Prevention, which is publicly available (https://www.n.cdc.gov/nchs/nhanes/Default.aspx). This survey was approved by the institutional review board of the National Centre for Health Statistics (https://www.cdc.gov/nchs/nhanes/irba98.htm), and written informed consent was obtained from all participants. Written informed consent was waived due to the retrospective nature and the study protocol conforms to the ethical guidelines of the 1975 Declaration of Helsinki as reflected in a priori approval by the Ethics Committee.

Subjects diagnosed with MASLD who underwent DXA in National Health and Nutrition Examination Survey (NHANES) 2017–2018 were collected. This cohort was divided into training set and internal validation set with a ratio of 7:3.

MASLD was defined as the presence of hepatic steatosis with at least one cardiometabolic risk factor (CMRF) and low alcohol consumption. CMRF was detailed in supplementary materials. As proposed previously, a cut-off point of controlled attenuation parameter (CAP) from liver ultrasound transient elastography (LUTE)≥274 dB/m was considered suggestive of MASLD status with 90 % sensitivity in detecting all degrees of liver steatosis [18]. After exclusion of HBV, HCV, autoimmune liver diseases, hepatocellular carcinoma and excessive alcohol consumption, individuals with CAP≥274 dB/m, irrespective of liver fibrosis, were eligible for study. DXA and LUTE measurements were detailed in Supplemental material.

Given adults underwent DXA in NHANES database were limited within 60 years old, the MASLD subjects aged 18–59 and determined by liver biopsy or non-invasive criteria were retrospectively collected between 2020 and 2023 in our institution as external validation set. The medical records of eligible MASLD subjects were reviewed and their available CT within 6 months of diagnosis were analyzed. Inclusion and exclusion criteria were detailed in the Supplemental material.

2.2Outcome

In NHANES survey, pre-sarcopenia was defined using the Foundation for the National Institutes of Health definition. The arm appendicular skeletal muscle mass (ASM) (kg) and leg ASM mass (kg) was summed as ASM by DXA [19]. The cut-off points of ASM normalized by BMI (kg/kg/m2), <0.512 for females and <0.789 for males, were used to identify pre-sarcopenia [20]. In external validation set, available CT within 6 months of diagnosis date was analyzed by two radiologists (S.W.Y. and Q.Y.C. with 5 years of radiological experience) who were blinded to all information using a commercially software (slice-O-matic, version 4.2; Tomovision Inc., Montreal, Quebec, Canada). Any discrepancies were resolved through discussion. This software performs tissue demarcation by established Hounsfield unit threshold [21], and representative figure is elucidated in Figure S1. The cross-sectional area of skeletal muscle was summarized using three consecutive plain CT slices at level of 3rd lumbar. The averaged value (cm2) normalized by height in meters squared (m2) was defined as skeletal muscle index (SMI). The SMI value was converted to ASM index (ASMI) as the following steps: ASMICT=0.11*SMI+1.17 [22]; ASMICT was defined as ASM/height2. Considered that pre-sarcopenia cut-point in NHANES is derived from North American cohorts, the pre-sarcopenia was identified using the Asia-specific cut-off values (ASM/height2, kg/m2) in external validation set: <7.0 kg/m2 in men and <5.4 kg/m2 in women [23]. For sensitivity analysis, the ethnicity-specific cut-points of SMI, 42 cm2/m2 for men and 38 cm2/m2 for women, were taken [24].

2.3Data collection

According to prior studies exploring the association between muscle mass and MASLD [12,25], all available features were collected from the NHANES database 2017–2018.

In training and internal validation sets, a total of 25 features, including demographic, anthropometric, clinical, laboratory data, physical examination and self-report questionnaires were collected (detailed in the supplemental material). Any variable missing less than 10% would be imputed by multiple imputation using predictive mean matching; Otherwise, it would be excluded from this study. The features in external validation set were collected as required in trained model.

2.4Definitions

In the NHANES, Excessive alcohol consumption refers to an average alcohol intake of >3 drinks/day in males and >2 drinks/ day in females [26].

Fibrosis grade was determined by liver stiffness measurement with cutoff values of 8.2, 9.7, and 13.6 kPa for fibrosis grades ≥F2, ≥F3, and F4, respectively, as assessed by LUTE. Significant fibrosis was determined by ≥F2. [18] DM could be identified based on one of the following criteria: glycohemoglobin (HbA1C) ≥6.5%, the value of fasting plasma glucose ≥126 mg/dL (7.0 mmol/L), being told to have diabetes by doctor, or taking insulin as indicated in the questionnaire. If HbA1C ≥5.7% is met alone, prediabetes was diagnosed. Hypertension (HT) was defined as systolic blood pressure ≥130 mmHg, a diastolic blood pressure ≥80 mmHg. Smoking status was classified as current smoker, past smoker and non-smoker. Waist circumference (WC) was assessed by placing a measuring tape in a horizontal plane around the abdomen at the level of the iliac crest. Weight-adjusted waist (WWI) was calculated as WC (cm) divided by the square root of body weight (kg). Education level was categorized as high or low level (high school and lower level).

According the recommended ethnic-specific cut-off values, BMI categories were classified in normal weight (BMI, 18.5–22.9 kg/m2 for Asian Americans; BMI, 18.5–24.9 kg/m2 for non-Asian Americans) and overweight/obesity (BMI, ≥23 kg/m2 for Asian Americans; BMI, ≥25 kg/m2 for non-Asian Americans) [27]. Lean MASLD was a MASLD subgroup without overweight or obesity [28]. Additional definitions are provided in supplemental material.

2.5Feature selection and model construction

Feature selection and model building were performed in the training set and internal validation set. The five-fold cross-validation (CV) was used for hyperparameter tuning. The external validation set was employed to test model reliability.

The features with P-value < 0.1 between outcome-positive and -negative groups were preferred into model construction. For two features with intervariable correlation coefficient (ICC) 0.5, the one with the lower P-value was remained. Redundant features (i.e., features that were either statistically nonsignificant or highly correlated (absolute ICC >0.5) were excluded.

Four binary classifiers, including extreme gradient boosting (XGBoost), Random Forest (RF), Support Vector Machine (SVM) and Logistic regression (LR) were fitted to the data. The hyperparameters with the best average of area under receiver operating characteristic curves (AUROCs) over five-fold CV were determined in training set. Afterwards, the optimal model was determined by comparing these models in internal validation set. For feature reduction, all participated feature in rank of variable importance was picked up. The optimal cutoff was estimated using the Youden index. To avoid overfitting and improve practicability, a final model with the least feature number and stable AUROC was determined, and further tested in external validation set. The model performance was evaluated with AUROC, accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and F1 score. The calibration curve and Brier score was used for model calibration. Decision curve analysis (DCA) was used to evaluate the clinical net benefit of the model at different threshold probabilities. The hyperparameter tuning process and feature reduction is provided in supplemental material.

Moreover, to improve interpretability of black-box ML model, a quantitative model interpretation method, SHAP, was deployed to present the contribution of each feature on decision-making.

2.6Statistical analyses

Continuous variables expressed as the mean ± standard deviation or median (interquartile range) were compared by Student t-test or Mann–Whitney U test, according to the normality Shapiro test results; otherwise expressed as number (percentage) were compared by χ2 test or Fisher's exact tests. DeLong test was used to compare the AUROCs of different models. Goodness-of-fit of model was evaluated using Hosmer-Lemeshow test. The multiple imputation of missing data was performed using “MICE” package. ML analyses were performed with “tidyverse” package in R (version 4.3.2). P-value < 0.05 was indicative of a significant difference.

2.7Ethical statement

Our institution's ethical review board approved the present study. The written informed consent for treatment was waived due to the retrospective nature of the study.

3Results3.1Participant characteristics

A total of 571 subjects from NHANES were included in this study. Pre-sarcopenia was found in 17.51% (100/571) of subjects. A total of 399 subjects were randomly split into training set and 172 subjects in internal validation set (supplemental Table 1). In external validation set, there were 66 MASLD subjects were included (supplemental Table 2) and the prevalence of pre-sarcopenia was 15.16% (10/66). The overview of study design and patient selection flowchart are shown in Fig. 1.

Fig. 1.

The overview of study design and patient selection flowchart.

The characteristics of NHANES subjects are described in Table 1. Subjects with DM, overweight/obesity and never-smoker were predominant. The subjects with DM/pre-DM, significant fibrosis and overweight/obesity were more frequent in pre-sarcopenia group compared than those in non-pre-sarcopenia group. The subjects in pre-sarcopenia group were significantly more likely to be older, more frequent in lower education level group, and had higher WWI and alkaline phosphatase (ALP), along with lower high-density lipoprotein (HDL) and creatinine than those without pre-sarcopenia. And in external validation set, subjects with pre-sarcopenia had higher WWI, lower albumin and creatinine.

Table 1.

Characteristics of included participants in NHANES 2017–2018.

Characteristic	Overall(N = 571)	Non-presarcopenia(N = 471)	Pre-sarcopenia(N = 100)	P-value
Age	45.00 (34.00–53.50)	44.00 (34.00–52.00)	51.00 (38.00–55.00)	<0.001
Sex				0.576
Female	280 (49.04%)	234 (49.68%)	46 (46.00%)
Male	291 (50.96%)	237 (50.32%)	54 (54.00%)
Race				<0.001
Mexican American	101 (17.69%)	68 (14.44%)	33 (33.00%)
Non-Hispanic Asian	127 (22.24%)	110 (23.35%)	17 (17.00%)
Non-Hispanic Black	104 (18.21%)	97 (20.59%)	7 (7.00%)
Non-Hispanic White	156 (27.32%)	133 (28.24%)	23 (23.00%)
Other Hispanic	53 (9.28%)	38 (8.07%)	15 (15.00%)
Other/multiracial	30 (5.25%)	25 (5.31%)	5 (5.00%)
HT†				0.466
No	283 (49.56%)	229 (50.78%)	54 (56.84%)
Yes	263 (46.06%)	222 (49.22%)	41 (43.16%)
Diabetes mellitus				0.026
Non-DM	261 (45.71%)	227 (48.20%)	34 (34.00%)
Pre-DM	181 (31.70%)	145 (30.79%)	36 (36.00%)
DM	129 (22.59%)	99 (21.02%)	30 (30.00%)
BMI status				0.05
Normal weight	35 (6.13%)	32 (6.79%)	3 (3.00%)
Overweight/Obesity	536 (93.87%)	439 (93.21%)	97 (97.00%)
Significant fibrosis				0.005
No	497 (87.04%)	419 (88.96%)	78 (78.00%)
Yes	74 (12.96%)	52 (11.04%)	22 (22.00%)
WWI	11.24±0.71	11.12±0.69	11.81±0.54	<0.001
ALT	24.00 (16.00–33.00)	24.00 (16.00–33.00)	23.50 (16.00–33.25)	0.734
ALB	41.00 (38.00–43.00)	41.00 (39.00–43.00)	41.00 (38.00–42.00)	0.076
ALP	78.00 (66.00–93.00)	77.00 (64.00–92.00)	86.50 (71.75–101.75)	<0.001
AST	20.00 (16.00–26.00)	20.00 (16.00–26.00)	20.00 (16.00–25.00)	0.574
BUN	4.64 (3.93–5.71)	4.64 (3.93–5.71)	4.64 (3.93–5.71)	0.817
Creatinine	72.49 (59.23–84.86)	74.26 (60.11–86.63)	65.41 (51.94–78.68)	<0.001
GGT	26.00 (18.00–42.00)	26.00 (18.00–40.00)	28.00 (19.75–51.00)	0.061
TBIL	6.84 (5.13–8.55)	6.84 (5.13–8.55)	6.84 (5.13–8.55)	0.52
HDL	1.16 (1.01–1.35)	1.16 (1.01–1.37)	1.09 (0.98–1.29)	0.047
Total Cholesterol	5.01±0.95	5.00±0.94	5.06±0.99	0.579
vitamin D	53.40 (35.20–69.85)	52.60 (35.15–70.85)	55.50 (38.70–66.20)	0.893
HSCRP	2.86 (1.27–5.75)	2.75 (1.23–5.71)	3.25 (1.42–6.68)	0.097
WBC	7.40 (6.20–8.90)	7.40 (6.20–8.80)	7.25 (6.18–9.25)	0.913
PLT	248.00 (213.50–285.50)	248.00 (214.00–286.50)	239.50 (208.00–278.75)	0.395
HGB	14.20 (13.20–15.30)	14.20 (13.10–15.30)	14.40 (13.38–15.60)	0.396
Education level				<0.001
High	360 (63.05%)	320 (67.94%)	40 (40.00%)
Low	211 (36.95%)	151 (32.06%)	60 (60.00%)
Smoking Group				0.803
Current smoker	70 (12.26%)	56 (11.89%)	14 (14.00%)
Former smoker	93 (16.29%)	76 (16.14%)	17 (17.00%)
Never smoker	408 (71.45%)	339 (71.97%)	69 (69.00%)

NOTE: Results for continuous data are expressed as means ± standard deviations and for categorical data as N (%).

†

indicates presence of missing data;

Abbreviations: HT, hypertension(mmHg); DM: diabetes mellitus; BMI, body mass index(Kg/m2); ALT, alanine aminotransferase(U/L); ALB, albumin(g/L); ALP, alkaline phosphatase(IU/L); AST, aspartate aminotransferase(U/L); BUN, blood urea nitrogen(mmol/L); Cr, creatinine(umol/L); GGT, gamma-glutamyl transferase(IU/L); TBIL, total bilirubin(umol/L); HDL, high-density lipoprotein(mmol/L); HSCRP, high-sensitivity C-reactive protein(mg/L); WBC, White blood cell count(1000 cells/uL); PLT, Platelet count(1000 cells/uL); HGB, Hemoglobin(g/dL);.

3.2Feature selection

After a series of preprocessing, a total of 23 features were included in initial model to fit into training data. Significant statistically differences were noted in 13 features between groups, including age, race, DM, BMI status, significant fibrosis, WWI, albumin, ALP, creatinine, gamma-glutamyl transferase, HDL, high sensitivity C-reaction protein and education level (detailed in supplemental material).

3.3Models building and comparison

Four binary classifiers, including XGBoost, SVM, RF and LR were performed into model building. RF outperformed other models in internal validation set, with an AUROC of 0.819 (95%CI: 0.749, 0.889). The metrics of four models were shown in Fig. 2 and Table 2. The best hyperparameter combination were provided in the Supplemental Table 3.

Fig. 2.

The metrics of four models in model construction in training set.

Table 2.

All metrics of four models in internal validation set and of final RF model in internal and external validation sets.

Metrics	RF	XGBoost	LR	SVM	Final model
					Internal validation set	External validation set
					Internal validation set	DXA definition	CT definition
ROC_AUC	0.819	0.785	0.777	0.817	0.824	0.732	0.745
PR_AUC	0.463	0468	0.481	0.509	0.610	0.440	0.446
Accuracy	0.721	0.669	0.715	0.738	0.785	0.864	0.833
Sensitivity	0.833	0.667	0.6	0.733	0.633	0.600	0.500
Specificity	0.697	0.669	0.739	0.739	0.817	0.911	0.907
PPV	0.368	0.299	0327	0.373	0.422	0.545	0.545
NPV	0.952	0.905	0.897	0.929	0.913	0.927	0.891
Precision	0.368	0.299	0.327	0.373	0.422	0.545	0.545
Recall	0.833	0.667	0.6	0.733	0.633	0.600	0.500
F1 score	0.510	0.412	0.424	0.494	0.507	0.571	0.522

Abbreviation: ROC_AUC, Area under receiver operating characteristic curve; PR_AUC, Area under precision and recall curve; PPV, positive predictive value; NPV, negative predictive value; DXA, Dual-energy X-ray Absorptiometry; CT, Computed Tomography; RF, Random Forest; XGBoost, extreme gradient boosting; LR, logistic regression; SVM, support vector machine.

3.4RF model optimization and evaluation

The model performance during feature reduction in internal validation set was plotted in Fig. 3. When six top-ranking features were retained, several metrics of RF reached stable. Finally, a final model incorporating 6 features, including WWI, sex, race, creatinine, education level and ALP was determined. At the optimal cut-off of 0.251, it achieved AUROCs of 0.917 (95%CI: 0.885, 0.949) and 0.824 (95%CI: 0.737, 0.910) in training and internal validation (Fig. 4). In internal validation set, a moderate goodness-of-fit was observed in calibration curve and DCA showed that the net benefit probability was approximately gained between 10% and 75% (Fig. 5).

Fig. 3.

The model performance during feature reduction in internal validation set.

Fig. 4.

The comparison of ROC between training set and internal validation set.

Fig. 5.

The calibration and DCA curves in internal validation set. A moderate goodness-of-fit was observed in calibration curve and DCA showed that the net benefit probability was approximately gained between 10 % and 75 %.

The final model had an AUROC of 0.732 (95%CI: 0.529, 0.936) in external validation set, other metrics were shown in Table 2. Calibration and DCA curves were plotted in Figure S2.

3.5Feature contribution assessed by SHAP

The contribution of each feature in final RF model was visualized using SHAP (Fig. 6). The higher the width of the distribution of a certain feature is, the greater the contribution it makes. That the color of dot is prone to red indicates higher risk of this dot. In final model, WWI ranks first, sex, race, creatinine, education level and ALP follow afterwards. A higher level of WWI and ALP, a lower level of education and creatinine as well as male gender was predictive of pre-sarcopenia. As for race, Mexican American and other Hispanic are prone to higher risk of pre-sarcopenia.

Fig. 6.

SHAP visualization for models used to identify pre-sarcopenia in MASLD population. The width of the distribution of SHAP value of a certain feature on the horizon axis indicates the level of influence this feature has on the model decision-making. That the color of dot is prone to red indicates higher pre-sarcopenia risk of this case.

3.6Sensitivity analyses

In external validation set, when CT-based SMI was used, the pre-sarcopenia was observed in 12 (18.18%) subjects. The final model achieved an AUROC of 0.745 (95%CI: 0.575, 0.914). The comparison of ROC using DXA and CT-based definitions was plotted in Fig. 7(P > 0.05). Other metrics are shown in Table 2. The calibration and DCA curves are described in Figure S3. Out of 35 lean MASLD subjects, the final model exhibited an AUROC of 0.912 (0.757–1.000) and a good performance in DCA curve (Figure S4).

Fig. 7.

The comparison of ROC between DXA and CT definition in external validation set.

3.7A web-based prediction tool

The final model is available at: https://riskofpresarcopeniainmasld.shinyapps.io/shiny_RiskofPresarcopeiaInNAFLD/.

4Discussion

In this study, a ML-based model was proposed for identification of pre-sarcopenia in MASLD population. The model, utilizing available features common in clinical and community settings, demonstrated a good prediction performance, indicating promising prospect for broad application. Of note, the high NPV and specificity values suggested that this model could serve as a reliable rule-out strategy in screening settings. A public web-based tool was provided.

Due to improved living standards and popularity of high glucose and high salt diet, MASLD and subsequent fibrosis are projected to become more prevalent. Masked by the high BMI and young age of the MASLD population, it is not easy to identify pre-sarcopenia, which potentially advances hepatic steatosis and fibrosis as well as increasing mortality [29]. To improve practicability of model, all participated features are supposed to be available, short-term stable and easy to implement. The proposed web-based calculator offers convenient evaluation on development and recovery of sarcopenia. Consequently, the individuals with suspected pre-sarcopenia could be screened and monitored in real-life practices, with nutrition consultation and clinical intervention further provided as necessary.

The pre-sarcopenia incidence of MASLD determined by DXA in this study was consistent with previous data, ranging from 8.7% to 35% [30-32]. The difference of incidence among these studies could be explained by different measurement modalities and cut-off points.

SHAP analysis revealed that Mexican American/other Hispanic male subjects with higher WWI and ALP, along with lower creatinine and education level were at higher risk of pre-sarcopenia.

WWI was initially proposed to predict cardiometabolic morbidity and mortality in a large-scale research [33]. This study revealed that WWI contributed to a higher risk of pre-sarcopenia as the most important factor. Several community or population-based studies unmasked that WWI was oppositely associated with abdominal muscle mass and positively associated with abdominal fat mass [34,35]. A higher WC is commonly found in subjects with abdominal obesity, which is characterized by much visceral adipose and a reduced ratio of muscle mass and weight. The value of WC could be compared after normalization of weight.

Loss of skeletal muscle mass was primarily deemed as an age-related change. In this study, age was not included in final model, likely because all eligible subjects were less than 60 years old, and age acted as a continuous variable in model construction. Additionally, education level is related to income level. The impact of socioeconomic status on pre-sarcopenia is complex, by healthy awareness, diet and exercise style as well as quality of medical care.

This study identified male gender as predictive of pre-sarcopenia in the MASLD population, which was consistent with some prior findings [17,32,36]. However, this conclusion was open to debate, as mixed results were presented in cohorts from different regions [12]. It is noted that sex is the only feature without statistical differences between pre-sarcopenia and non-pre-sarcopenia groups in final model. The risk of sarcopenia on metabolic syndrome varies significantly after stratification of sex and WC [37]. It is speculated that the role of sex seemed to be altered due to the potential interaction between other included features.

It is understandable that lower creatinine, influenced by muscle metabolic status, was related to a higher risk of pre-sarcopenia [38,39]. Creatinine is a convenient and available indicator to reflect muscle mass depletion to some extent in population with normal kidney function.

The model performance in external validation set is lower than that in developmental sets. Some points could explain it. The baseline differences of some features between NHANES data and external validation set, especially race, may be responsible for the lower model performance. However, race was retained in order to improve model versatility and generalizability across diverse populations worldwide. Of note, although proposed model exhibited good prediction capacity and net clinical benefit in lean MASLD population. Considering small sample size and limited outcome events, the results need further validation in future studies. Caution is therefore an obligation until prospective studies supporting these findings are available.

In the light of subjects without pre-sarcopenia are predominant in most MASLD cohorts, accuracy is not suitable as performance metric, leading to overestimation of model performance. Hence, ROC, F1 score and Brier score were selected as main metrics for model evaluation.

While a promising model was proposed, it is important to acknowledge its limitations. Firstly, since participants in external validation set were screened form the hospitalized patients, the muscle mass was measured on CT. Although a validated conversion formulas and sensitivity analysis were used, there is an inevitable outcome bias from different imaging modalities and baseline characteristics differences between developmental and external validation sets. However, the generalization ability of the model was further proved. Secondly, the model was trained and validated in population aged less than 60 years old; therefore, the efficacy of the model in the elderly requires further validation. Thirdly, as much as available features were utilized for a comprehensive model at the cost of a reduced the sample size; yet homeostatic model assessment of insulin resistance was not included in model construct due to much its missing value (≥50%). Lastly, muscle quality is crucial in evaluating sarcopenia. Due to limited available data and the nature of retrospective studies, the term “pre-sarcopenia” was used in this study. More research is needed to investigate both muscle quantity and quality in MASLD population.

5Conclusions

In conclusion, this study developed and validated a noninvasive ML-based model for identification of pre-sarcopenia in MASLD population derived from NHANES database and a real-world cohort, and a web-based tool was available freely.

Funding statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Data availability statement

Data that support the findings of this study are available from the authors upon request.

Declaration of generative AI and AI-assisted technologies in the writing process

Any AI and AI-assisted technologies were not used during the preparation of this work.

Declaration of interests

None.

Appendix

Supplementary materials

References

[1]

K. Riazi, H. Azhari, J.H. Charette, F.E. Underwood, J.A. King, E.E. Afshar, et al.

The prevalence and incidence of NAFLD worldwide: a systematic review and meta-analysis.

Lancet Gastroenterol Hepatol, 7 (2022), pp. 851-861

http://dx.doi.org/10.1016/s2468-1253(22)00165-0 | Medline

[2]

M.E. Rinella, J.V. Lazarus, V. Ratziu, S.M. Francque, A.J. Sanyal, F. Kanwal, et al.

A multisociety Delphi consensus statement on new fatty liver disease nomenclature.

Hepatology, 78 (2023), pp. 1966-1986

http://dx.doi.org/10.1097/hep.0000000000000520 | Medline

[3]

S.J. Song, J.C. Lai, G.L. Wong, V.W. Wong, T.C. Yip.

Can we use old NAFLD data under the new MASLD definition?.

J Hepatol, 80 (2024), pp. e54-e56

http://dx.doi.org/10.1016/j.jhep.2023.07.021 | Medline

[4]

H. Hagström, J. Vessby, M. Ekstedt, Y. Shang.

99% of patients with NAFLD meet MASLD criteria and natural history is therefore identical.

J Hepatol, 80 (2024), pp. e76-e77

http://dx.doi.org/10.1016/j.jhep.2023.08.026 | Medline

[5]

C.H. De Fré, M.A. De Fré, W.J. Kwanten, B.J. Op de Beeck, L.F. Van Gaal, S.M. Francque.

Sarcopenia in patients with non-alcoholic fatty liver disease: is it a clinically significant entity?.

Obes Rev, 20 (2019), pp. 353-363

http://dx.doi.org/10.1111/obr.12776 | Medline

[6]

M. Merli, S. Dasarathy.

Sarcopenia in non-alcoholic fatty liver disease: targeting the real culprit?.

J Hepatol, 63 (2015), pp. 309-311

http://dx.doi.org/10.1016/j.jhep.2015.05.014 | Medline

[7]

J.C. Lai, P. Tandon, W. Bernal, E.B. Tapper, U. Ekong, S. Dasarathy, et al.

Malnutrition, frailty, and sarcopenia in patients with cirrhosis: 2021 practice guidance by the american association for the study of liver diseases.

Hepatology, 74 (2021), pp. 1611-1644

http://dx.doi.org/10.1002/hep.32049 | Medline

[8]

A.J. Cruz-Jentoft, G. Bahat, J. Bauer, Y. Boirie, O. Bruyère, T. Cederholm, et al.

Sarcopenia: revised European consensus on definition and diagnosis.

Age Ageing, 48 (2019), pp. 16-31

http://dx.doi.org/10.1093/ageing/afy169 | Medline

[9]

A.J. Cruz-Jentoft, J.P. Baeyens, J.M. Bauer, Y. Boirie, T. Cederholm, F. Landi, et al.

Sarcopenia: european consensus on definition and diagnosis: report of the european working group on sarcopenia in older people.

Age Ageing, 39 (2010), pp. 412-423

http://dx.doi.org/10.1093/ageing/afq034 | Medline

[10]

S. Carias, A.L. Castellanos, V. Vilchez, R. Nair, A.C. Dela Cruz, J. Watkins, et al.

Nonalcoholic steatohepatitis is strongly associated with sarcopenic obesity in patients with cirrhosis undergoing liver transplant evaluation.

J Gastroenterol Hepatol, 31 (2016), pp. 628-633

http://dx.doi.org/10.1111/jgh.13166 | Medline

[11]

O. El Sherif, A. Dhaliwal, P.N. Newsome, M.J Armstrong.

Sarcopenia in nonalcoholic fatty liver disease: new challenges for clinical practice.

Expert Rev Gastroenterol Hepatol, 14 (2020), pp. 197-205

http://dx.doi.org/10.1080/17474124.2020.1731303 | Medline

[12]

H.C. Hong, S.Y. Hwang, H.Y. Choi, H.J. Yoo, J.A. Seo, S.G. Kim, et al.

Relationship between sarcopenia and nonalcoholic fatty liver disease: the Korean Sarcopenic Obesity Study.

Hepatology, 59 (2014), pp. 1772-1778

http://dx.doi.org/10.1002/hep.26716 | Medline

[13]

H.J. Choe, H. Lee, D. Lee, S.H. Kwak, B.K. Koo.

Different effects of low muscle mass on the risk of non-alcoholic fatty liver disease and hepatic fibrosis in a prospective cohort.

J Cachexia Sarcopenia Muscle, 14 (2023), pp. 260-269

http://dx.doi.org/10.1002/jcsm.13125 | Medline

[14]

D. Kim, K. Wijarnpreecha, K.K. Sandhu, G. Cholankeril, A. Ahmed.

Sarcopenia in nonalcoholic fatty liver disease and all-cause and cause-specific mortality in the United States.

Liver Int, 41 (2021), pp. 1832-1840

http://dx.doi.org/10.1111/liv.14852 | Medline

[15]

B.K. Koo, D. Kim, S.K. Joo, J.H. Kim, M.S. Chang, B.G. Kim, et al.

Sarcopenia is an independent risk factor for non-alcoholic steatohepatitis and significant fibrosis.

J Hepatol, 66 (2017), pp. 123-131

http://dx.doi.org/10.1016/j.jhep.2016.08.019 | Medline

[16]

S.A. Polyzos, I.D. Vachliotis, C.S. Mantzoros.

Sarcopenia, sarcopenic obesity and nonalcoholic fatty liver disease.

Metabolism, 147 (2023),

http://dx.doi.org/10.1016/j.metabol.2023.155676

[17]

R. Li, S. Lin, J. Tu, Y. Chen, B. Cheng, X. Mo, et al.

Establishment and evaluation of a novel practical tool for the diagnosis of pre-sarcopenia in young people with diabetes mellitus.

J Transl Med, 21 (2023), pp. 393

http://dx.doi.org/10.1186/s12967-023-04261-w | Medline

[18]

P.J. Eddowes, M. Sasso, M. Allison, E. Tsochatzis, Q.M. Anstee, D. Sheridan, et al.

Accuracy of fibroscan controlled attenuation parameter and liver stiffness measurement in assessing steatosis and fibrosis in patients with nonalcoholic fatty liver disease.

Gastroenterology, 156 (2019), pp. 1717-1730

http://dx.doi.org/10.1053/j.gastro.2019.01.042 | Medline

[19]

J.A. Batsis, T.A. Mackenzie, J.D. Jones, F. Lopez-Jimenez, S.J. Bartels.

Sarcopenia, sarcopenic obesity and inflammation: results from the 1999-2004 national health and nutrition examination survey.

Clin Nutr, 35 (2016), pp. 1472-1483

http://dx.doi.org/10.1016/j.clnu.2016.03.028 | Medline

[20]

S.A. Studenski, K.W. Peters, D.E. Alley, P.M. Cawthon, R.R. McLean, T.B. Harris, et al.

The FNIH sarcopenia project: rationale, study description, conference recommendations, and final estimates.

J Gerontol A Biol Sci Med Sci, 69 (2014), pp. 547-558

http://dx.doi.org/10.1093/gerona/glu010 | Medline

[21]

R.A. Bhanji, C. Moctezuma-Velazquez, A. Duarte-Rojo, M. Ebadi, S. Ghosh, C. Rose, et al.

Myosteatosis and sarcopenia are associated with hepatic encephalopathy in patients with cirrhosis.

Hepatol Int, 12 (2018), pp. 377-386

http://dx.doi.org/10.1007/s12072-018-9875-9 | Medline

[22]

M. Mourtzakis, C.M. Prado, J.R. Lieffers, T. Reiman, L.J. McCargar, V.E. Baracos.

A practical and precise approach to quantification of body composition in cancer patients using computed tomography images acquired during routine care.

Appl Physiol Nutr Metab, 33 (2008), pp. 997-1006

http://dx.doi.org/10.1139/h08-075 | Medline

[23]

L.K. Chen, J. Woo, P. Assantachai, T.W. Auyeung, M.Y. Chou, K. Iijima, et al.

Asian working group for sarcopenia: 2019 consensus update on sarcopenia diagnosis and treatment.

J Am Med Dir Assoc, 21 (2020), pp. 300-307.e302

http://dx.doi.org/10.1016/j.jamda.2019.12.012 | Medline

[24]

H. Nishikawa, M. Shiraki, A. Hiramatsu, K. Moriya, K. Hino, S. Nishiguchi.

Japan Society of Hepatology guidelines for sarcopenia in liver disease (1st edition): recommendation from the working group for creation of sarcopenia assessment criteria.

Hepatol Res, 46 (2016), pp. 951-963

http://dx.doi.org/10.1111/hepr.12774 | Medline

[25]

J.S. Moon, J.S. Yoon, K.C. Won, H.W. Lee.

The role of skeletal muscle in development of nonalcoholic Fatty liver disease.

Diabetes Metab J, 37 (2013), pp. 278-285

http://dx.doi.org/10.4093/dmj.2013.37.4.278 | Medline

[26]

D. Kim, P. Konyn, K.K. Sandhu, B.B. Dennis, A.C. Cheung, A. Ahmed.

Metabolic dysfunction-associated fatty liver disease is associated with increased all-cause mortality in the United States.

J Hepatol, 75 (2021), pp. 1284-1291

http://dx.doi.org/10.1016/j.jhep.2021.07.035 | Medline

[27]

Appropriate body-mass index for Asian populations and its implications for policy and intervention strategies.

Lancet, 363 (2004), pp. 157-163

http://dx.doi.org/10.1016/s0140-6736(03)15268-3 | Medline

[28]

A.H. Patel, D. Peddu, S. Amin, M.I. Elsaid, C.D. Minacapelli, T.M. Chandler, et al.

Nonalcoholic fatty liver disease in lean/nonobese and obese individuals: a comprehensive review on prevalence, pathogenesis, clinical outcomes, and treatment.

J Clin Transl Hepatol, 11 (2023), pp. 502-515

http://dx.doi.org/10.14218/jcth.2022.00204 | Medline

[29]

S.K. Joo, W. Kim.

Interaction between sarcopenia and nonalcoholic fatty liver disease.

Clin Mol Hepatol, 29 (2023), pp. S68-s78

http://dx.doi.org/10.3350/cmh.2022.0358 | Medline

[30]

Y. Seko, N. Mizuno, S. Okishio, A. Takahashi, S. Kataoka, K. Okuda, et al.

Clinical and pathological features of sarcopenia-related indices in patients with non-alcoholic fatty liver disease.

Hepatol Res, 49 (2019), pp. 627-636

http://dx.doi.org/10.1111/hepr.13321 | Medline

[31]

Y.H. Lee, S.U. Kim, K. Song, J.Y. Park, D.Y. Kim, S.H. Ahn, et al.

Sarcopenia is associated with significant liver fibrosis independently of obesity and insulin resistance in nonalcoholic fatty liver disease: nationwide surveys (KNHANES 2008-2011).

Hepatology, 63 (2016), pp. 776-786

http://dx.doi.org/10.1002/hep.28376 | Medline

[32]

P. Golabi, L. Gerber, J.M. Paik, R. Deshpande, L. de Avila, Z.M. Younossi.

Contribution of sarcopenia and physical inactivity to mortality in people with non-alcoholic fatty liver disease.

JHEP Rep, 2 (2020),

http://dx.doi.org/10.1016/j.jhepr.2020.100171

[33]

Y. Park, N.H. Kim, T.Y. Kwon, S.G. Kim.

A novel adiposity index as an integrated predictor of cardiometabolic disease morbidity and mortality.

Sci Rep, 8 (2018), pp. 16753

http://dx.doi.org/10.1038/s41598-018-35073-4 | Medline