Chengxin Zhang, PhD
Research Assistant Professor
Computational Medicine and Bioinformatics
[email protected]

Available to mentor

Chengxin Zhang, PhD
Research Assistant Professor
  • About
  • Qualifications
  • Recent Publications
  • About

    My research focuses on the development of novel algorithms for predicting the structures and functions of proteins and RNA. The goal is to enable scalable, genome-wide annotation of human and microbial proteins, as well as non-coding RNAs. I have previously developed leading methods for predicting protein structure and function. Notable contributions include my work on COFACTOR, MetaGO, and StarFunc. These were among the highest-performing Gene Ontology prediction methods in recent Critical Assessment of Function Annotation (CAFA) challenges. They notably demonstrated the benefits of using computational structure models for protein function prediction. Additionally, I co-developed TripletRes, a deep learning-based program for predicting inter-residue contact and distance within proteins. I also designed D-QUARK, which enables distance-based protein folding guided by distances derived through deep learning. Both D-QUARK and the TripletRes algorithm stood among top servers in the recent Critical Assessment of Structure Prediction (CASP) 13 and 14 experiments. Beyond that, the DeepMSA package, which I created for constructing multiple sequence alignments, is now a commonly utilized tool by several CASP teams. Leveraging my previous expertise in protein structure and function modeling, my recent research has shifted its focus towards exploring new methods of RNA structure and function analysis. In this domain, I have developed tools such as US-align, the first tool capable of aligning the structures of RNAs, DNAs, and proteins in both monomeric and oligomeric forms via a unified TM-score. My rMSA pipeline, another significant contribution, has gained popularity among top-performing CASP teams. It aids in the generation of RNA sequence alignments necessary for deep learning-based RNA structure prediction. Between 2017 and 2022, I had 47 peer-reviewed manuscripts published, with 12 of them, in high-impact journals such as Nature Methods, Nucleic Acid Research, and Bioinformatics, being first author papers. The manuscript introducing the COFACTOR algorithm received over 400 citations. Another manuscript, which explores the zoonotic origin of SARS-CoV-2 using predicted protein structure and metagenomic sequence assembly, has been cited 300 times.

    Qualifications
    • HHMI Postdoctoral Associate
      Yale University, United States, 2023
    • PhD
      University of Michigan–Ann Arbor, Ann Arbor, 2020
    • BSc
      Fudan University, Shanghai, 2015
    Recent Publications See All Publications
    • Journal Article
      Proteomic screens of SEL1L-HRD1 ER-associated degradation substrates reveal its role in glycosylphosphatidylinositol-anchored protein biogenesis.
      Wei X, Lu Y, Lin LL, Zhang C, Chen X, Wang S, Wu SA, Li ZJ, Quan Y, Sun S, Qi L. Nat Commun, 2024 Jan 22; 15 (1): 659 DOI:10.1038/s41467-024-44948-2
      PMID: 38253565
    • Journal Article
      Remodelling of Skeletal Muscle Myosin Metabolic States in Hibernating Mammals.
      Lewis CTA, Melhedegaard EG, Ognjanovic MM, Olsen MS, Laitila J, Seaborne RAE, Gronset MN, Zhang C, Iwamoto H, Hessel AL, Kuehn MN, Merino C, Amigo N, Frobert O, Giroud S, Staples JF, Goropashnaya AV, Fedorov VB, Barnes BM, Toien O, Drew KL, Sprenger RJ, Ochala J. bioRxiv, 2024 Feb 27; DOI:10.1101/2023.11.14.566992
      PMID: 38014200
    • Journal Article
      TM-search: An Efficient and Effective Tool for Protein Structure Database Search.
      Liu Z, Zhang C, Zhang Q, Zhang Y, Yu D-J. J Chem Inf Model, 2024 Feb 12; 64 (3): 1043 - 1049. DOI:10.1021/acs.jcim.3c01455
      PMID: 38270339
    • Journal Article
      Improving deep learning protein monomer and complex structure prediction using DeepMSA2 with huge metagenomics data.
      Zheng W, Wuyun Q, Li Y, Zhang C, Freddolino PL, Zhang Y. Nat Methods, 2024 Feb; 21 (2): 279 - 289. DOI:10.1038/s41592-023-02130-4
      PMID: 38167654
    • Journal Article
      BioLiP2: an updated structure database for biologically relevant ligand-protein interactions.
      Zhang C, Zhang X, Freddolino PL, Zhang Y. Nucleic Acids Res, 2024 Jan 5; 52 (D1): D404 - D412. DOI:10.1093/nar/gkad630
      PMID: 37522378
    • Preprint
      Predicting the Structural Impact of Human Alternative Splicing.
      Song Y, Zhang C, Omenn GS, O'Meara MJ, Welch JD. 2023 Dec 24; DOI:10.1101/2023.12.21.572928
      PMID: 38187531
    • Preprint
      FURNA: a database for function annotations of RNA structures.
      Zhang C, Freddolino PL. 2023 Dec 19; DOI:10.1101/2023.12.19.572314
      PMID: 38187637
    • Journal Article
      BeEM: fast and faithful conversion of mmCIF format structure files to PDB format.
      Zhang C. BMC Bioinformatics, 2023 Jun 20; 24 (1): 260 DOI:10.1186/s12859-023-05388-9
      PMID: 37340457
    Featured News & Stories
    Department News
    DCMB welcomes new faculty
    DCMB welcomes eight new faculty in 2022-2023.