Toolkit/CatBoost C2h-21 model
CatBoost C2h-21 model
Also known as: C2h-21
Taxonomy: Technique Branch / Method. Workflows sit above the mechanism and technique branches rather than replacing them.
Summary
The CatBoost C2h-21 model achieved an ROC-AUC of 0.81/1.00 (train/test) on average across ten random three-fold cross-validations, using engineered features including EVs, clinical features like diabetes and advanced fibrosis, and anthropomorphic data like body mass index and weight for identifying severe steatosis (S3).
Usefulness & Problems
No literature-backed usefulness or problem-fit explainer has been materialized for this record yet.
Published Workflows
Objective: To investigate whether circulating plasma EV characteristics, alone or combined with clinical and anthropomorphic variables, can support non-invasive MASLD steatosis staging using machine learning and explainable artificial intelligence.
Why it works: The workflow pairs non-invasive EV measurements with steatosis/fibrosis staging labels and then uses ML and XAI to learn and interpret relationships between EV and clinical features and steatosis stage.
Stages
- 1.Patient enrollment and eligibility completion(selection)
This stage defines the final analyzable cohort before measurement and modeling.
Selection: Patients with metabolic dysfunction were enrolled, then eligibility criteria and study procedure completion determined the analyzed cohort.
- 2.Non-invasive staging and EV feature acquisition(functional_characterization)
This stage generates the input labels and biomarker features required for downstream ML modeling.
Selection: Obtain steatosis/fibrosis stage labels by transient elastography and EV size/concentration features by nanoparticle tracking.
- 3.Machine learning model development(broad_screen)
This stage explores multiple model/task configurations to identify useful EV-based and multimodal classifiers.
Selection: Develop six models for S0 versus S1-S3 and fourteen models for severe steatosis identification using different feature sets.
- 4.Performance assessment and interpretability analysis(confirmatory_validation)
This stage identifies the best-performing models and explains feature-stage relationships.
Selection: Assess models using ROC-AUC, specificity, sensitivity, correlation analysis, and XAI/SHAP.
Steps
- 1.Enroll patients with metabolic dysfunction
Assemble the initial study population for MASLD-related biomarker analysis.
Enrollment is required before eligibility filtering and study measurements can occur.
- 2.Apply eligibility criteria and complete study procedures
Define the final analyzable cohort with complete data collection.
Eligibility and completion filtering narrows the cohort before feature acquisition and modeling.
- 3.Stage steatosis and fibrosis by transient elastographystaging assay
Generate steatosis and fibrosis stage information for the classification tasks.
Stage labels are needed before supervised model development.
- 4.Measure circulating plasma EV characteristics by nanoparticle trackingEV characterization assay
Generate EV size and concentration features for model input.
Feature acquisition follows cohort definition and provides the biomarker variables used in modeling.
- 5.Develop EV-only and multimodal ML models for steatosis tasksclassification models
Train models to distinguish S0 from S1-S3 and to identify severe steatosis.
Model development requires both stage labels and feature inputs from the prior measurement stage.
- 6.Evaluate model performance by repeated cross-validation
Estimate predictive performance using ROC-AUC, specificity, and sensitivity.
Performance evaluation follows model training to identify the strongest classifiers.
- 7.Interpret feature relationships using correlation analysis and SHAP/XAIinterpretability method
Explain how EV and other features relate to steatosis stages and model predictions.
Interpretability analysis is performed after model development so feature contributions can be examined on trained models.
Taxonomy & Function
Primary hierarchy
Technique Branch
Method: A concrete computational method used to design, rank, or analyze an engineered system.
Mechanisms
model interpretability via shap/xai analysismultimodal feature integrationsupervised machine-learning classificationTechniques
Computational DesignTarget processes
diagnosticValidation
Supporting Sources
Ranked Claims
Combining EV, clinical, and anthropomorphic features improved diagnostic accuracy for identifying severe steatosis compared with EV-only modeling aims described in the study.
The CatBoost C1a model distinguished S0 from S1-S3 using EV features alone.
The CatBoost C2h-21 model identified severe steatosis using engineered EV, clinical, and anthropomorphic features.
Correlation, XAI, and SHAP analyses revealed non-linear relationships between disease features and steatosis stages.
Approval Evidence
The CatBoost C2h-21 model achieved an ROC-AUC of 0.81/1.00 (train/test) on average across ten random three-fold cross-validations, using engineered features including EVs, clinical features like diabetes and advanced fibrosis, and anthropomorphic data like body mass index and weight for identifying severe steatosis (S3).
Source:
Combining EV, clinical, and anthropomorphic features improved diagnostic accuracy for identifying severe steatosis compared with EV-only modeling aims described in the study.
Source:
The CatBoost C2h-21 model identified severe steatosis using engineered EV, clinical, and anthropomorphic features.
Source:
Comparisons
No literature-backed comparison notes have been materialized for this record yet.
Ranked Citations
- 1.