Highlighted Projects
Deciphering Abnormal Platelet Subpopulations in COVID-19, Sepsis and Systemic Lupus Erythematosus through Machine Learning and Single-Cell Transcriptomics
Qiu, Xinru, Meera G. Nair, Lukasz Jaroszewski, and Adam Godzik
2024

Key Findings:
- Platelet-to-T cell ratio (Pla-T ratio) is a strong predictor of patient survival.
- Platelets play a role in lymphocyte activation, proliferation, and differentiation.
- Platelet-monocyte aggregation is associated with disease severity in both COVID-19 and sepsis.
- We identified distinct platelet subpopulations that were overrepresented in convalescent, surviving, and fatal patients.
- Notably, three types of platelets were highly represented in patients with fatal outcomes: a coagulation cluster (C4), a hypoxic cluster (C9), a quiescent cluster (C11).
Implications:
- Monitoring changes in the Pla-T ratio and platelet subpopulations could be valuable for predicting patient outcomes and guiding treatment decisions.
- Understanding the interplay between platelets and other immune cells may pave the way for therapies that modulate the immune response and improve patient outcomes.
- Targeting heparanase activity, which is increased in platelets during sepsis and COVID-19, could be a promising therapeutic strategy.
Bioinformatics Approaches:
- scRNA-seq Data Integration: Combined 413 samples from COVID-19, sepsis, SLE, and healthy individuals
- Focus on Platelets: Extracted platelet expression data from PBMC datasets
- Quality Control and Preprocessing: Used source data annotations, HGNChelper tool, and Harmony software
- Cell Type Identification and Clustering: Used Seurat, SingleR, and Louvain Clustering algorithm
- Differential Gene Expression Analysis: Employed MAST method from Seurat
- Machine Learning: Used XGB and DNN for biomarker identification and feature importance
- Pathway and Gene Set Enrichment Analysis: Used clusterProfiler, AddModuleScore, DOSE, and GSVA packages
- Ligand-Receptor Interaction Analysis: Based on iTALK database
- Pseudotime Trajectory Inference: Constructed using Monocle3
Dynamic changes in human single-cell transcriptional signatures during fatal sepsis
Qiu, Xinru, Jiang Li, Jeff Bonenfant, Lukasz Jaroszewski, Aarti Mittal, Walter Klein, Adam Godzik, and Meera G. Nair
2021

Key Findings:
- Lymphocyte subsets are reduced in sepsis, especially in fatal outcomes.
- Platelets and erythroid precursors emerge in late-stage fatal sepsis.
- Sepsis drives hypoxic stress associated with disease severity and dysfunctional erythropoiesis.
- Monocytes in fatal sepsis undergo immune suppression.
- Increased CD52 expression within hours of sepsis recognition is associated with improved sepsis outcomes.
Implications:
- Antiplatelet therapies may be beneficial for treating severe sepsis.
- S100A9 may be a valuable biomarker to stratify sepsis severity.
- CD52 may serve as both a biomarker for sepsis progression and a therapeutic target to promote immune homeostasis.
Bioinformatics Approaches:
- Data Processing and Quality Control: Used Cell Ranger Software Suite
- Data Integration and Batch Effect Removal: Utilized Seurat v.3
- Dimensionality Reduction and Clustering: Performed using Seurat
- Cell Type Annotation: Used a consensus-based approach combining information from canonical marker genes, SingleR, and scCATCH
- Differential Gene Expression Analysis: Employed MAST method from Seurat
- Pathway Enrichment Analysis: Conducted using clusterProfiler
- Module Score Analysis: Calculated to assess expression of predefined gene sets
Advances in AI for Protein Structure Prediction: Implications for Cancer Drug Discovery and Development
Qiu, Xinru, Han Li, Greg Ver Steeg, and Adam Godzik
2024

Key Findings:
- AI is revolutionizing drug development, streamlining processes and increasing success rates.
- Traditional drug development faces obstacles like high costs, long timelines, and high failure rates.
- AI-driven protein structure prediction provides valuable insights for drug discovery.
- Several AI algorithms offer distinct approaches to protein structure prediction:
- AlphaFold2 (AF2): High accuracy, relies on multiple sequence alignments
- ESMFold: Efficient for orphan proteins, faster than AF2
- RoseTTAFold: Unique "three-track" neural network architecture
- OpenFold: Open-source version of AF2, offers flexibility
- Generative AI is emerging as a powerful tool in drug development.
- AI-driven drug discovery is demonstrating success beyond cancer research.
Implications:
- Widespread adoption of AI could significantly alter the pharmaceutical landscape.
- Understanding strengths and limitations of different AI algorithms is crucial for effective application.
- Integration of AI necessitates careful consideration of data privacy, security, and validation procedures.
- Continued research in AI algorithms is essential for maximizing impact in drug discovery.
- AI-driven successes across diverse disease areas underscore potential to revolutionize healthcare.
Future Directions:
- Develop AI algorithms for predicting protein dynamics and ligand-induced folding.
- Expand applications of generative AI in designing novel proteins and antibodies.
- Integrate AI-driven approaches with traditional drug discovery methods.
- Address regulatory challenges posed by AI in pharmaceutical research.
- Explore AI applications in personalized medicine and rare disease treatment.