Zicheng Wang (zw3088), Xiaohui Kang (xk2163), Mengyuan Chen (mc5698), Shuai yuan (sy3270), Dang Lin (dl3757)
Exploring the Role of Inflammatory Biomarkers in Predicting Mortality Risks Among UK Adults with Fatty Liver or Cirrhosis, Using Evidence from UKB Data
Fatty liver disease (FLD) is one of the most prevalent chronic diseases and includes non-alcoholic fatty liver disease (NAFLD), metabolic-associated fatty liver disease (MAFLD), and metabolic-associated steatohepatitis-related liver disease (MASLD). Among these, NAFLD is the most common form, and its prevalence is rising rapidly. This condition can progress to severe complications, such as cirrhosis, liver failure, or hepatocellular carcinoma. Systemic inflammation and immune dysregulation are closely associated with FLD, underscoring the need to identify reliable, non-invasive biomarkers. Such biomarkers are critical for predicting disease progression, understanding underlying mechanisms, enabling early detection, guiding interventions, and ultimately improving clinical outcomes.
Immune-related indices like the Lymphocyte-to-Monocyte Ratio (LMR), Systemic Immune-Inflammation Index (SII), Naples Prognostic Score (NPS), and Neutrophil-Percentage-to-Albumin Ratio (NPAR) have gained attention for their potential utility in assessing inflammation and immune responses. These indices are derived from routine blood tests, making them accessible and cost-effective. LMR, for instance, measures the balance between lymphocytes, which are linked to immune surveillance and anti-inflammatory responses, and monocytes, which are associated with inflammation and tissue damage. SII integrates neutrophil, platelet, and lymphocyte counts to provide a comprehensive view of immune activity and thrombosis risk. NPS combines multiple prognostic factors, including inflammatory markers, albumin levels, and platelet counts, offering a broader perspective on patient health. NPAR evaluates the relationship between neutrophil percentages and albumin levels, connecting immune function with nutritional status.
The UK Biobank (UKB) dataset is well-suited for studying the relationships between these immune markers and chronic disease outcomes. With its extensive biomarker, demographic, and longitudinal health data on a representative UK population, the UKB provides an opportunity for detailed cross-sectional and longitudinal analyses. These analyses could help clarify how inflammation and immune stress contribute to chronic disease progression, ultimately advancing both clinical and public health approaches.
Lymphocyte-to-Monocyte Ratio (LMR): \[\text{LMR} = \frac{\text{Absolute Lymphocyte Count}}{\text{Absolute Monocyte Count}}\]
Systemic Immune-Inflammation Index (SII): \[\text{SII} = \frac{\text{Neutrophil Count} \times \text{Platelet Count}}{\text{Lymphocyte Count}}\]
Naples Prognostic Score (NPS): This score is based on a combination of factors, typically including:
The final NPS is the sum of these points (range: 0–4).
Neutrophil-Percentage-to-Albumin Ratio (NPAR): \[\text{NPAR} = \frac{\text{Neutrophil Percentage}}{\text{Albumin Level (g/dL)}}\]
Note: The variables were categorized into tertiles based on the 25% and 75% percentiles after being ranked in ascending order, except for NPS, which was dichotomized based on whether it was ≥ 2.
The UK Biobank (UKB) dataset, which includes: - Biomarker Data: Relevant indices such as LMR, SII, NPAR, etc. - Demographics: Age, gender, ethnicity, socioeconomic status, etc. - Health Outcomes: Mortality data, disease diagnoses, metabolic syndrome indicators, and more.
Throughout the project, we anticipate addressing several coding challenges, including efficiently handling the large-scale UKB dataset, managing missing or inconsistent data, and implementing survival analysis and multivariable regression models in Python and R. Optimizing code for computational efficiency and ensuring reproducibility through clear documentation will be key priorities in overcoming these challenges.
November 1-8: Data collection, cleaning, and preliminary exploration.
November 9-15: Start survival and cross-sectional analyses on mortality and metabolic syndrome.
November 16-25: Conduct analyses related to additional outcomes (e.g., kidney function) and refine survival models.
November 26-30: Finalize all analyses and begin drafting the report.
December 1-7: Complete the draft report with visualizations and interpretation.
December 8-12: Review and finalize the project for submission.