Modeling and integration of N-glycan biomarkers in a comprehensive biomarker data model

Authors

Document Type

Journal Article

Publication Date

8-4-2022

Journal

Glycobiology

DOI

10.1093/glycob/cwac046

Keywords

Cancer biomarker panel; Data integration; Glyco-informatics; Liver disease; N-linked glycans

Abstract

Molecular biomarkers measure discrete components of biological processes that can contribute to disorders when impaired. Great interest exists in discovering early cancer biomarkers to improve outcomes. Biomarkers represented in a standardized data model, integrated with multi-omics data, may improve understanding and use of novel biomarkers such as glycans and glycoconjugates. Among altered components in tumorigenesis, N-glycans exhibit substantial biomarker potential, when analyzed with their protein carriers. However, such data are distributed across publications and databases of diverse formats, which hampers their use in research and clinical application. Mass spectrometry measures of fifty N-glycans, on seven serum proteins in liver disease, were integrated (as a panel) into a cancer biomarker data model, providing a unique identifier, standard nomenclature, links to glycan resources, and accession and ontology annotations to standard protein, gene, disease, and biomarker information. Data provenance was documented with a standardized FDA-supported BioCompute Object. Using the biomarker data model allows capture of granular information, such as glycans with different levels of abundance in cirrhosis, hepatocellular carcinoma, and transplant groups. Such representation in a standardized data model harmonizes glycomics data in a unified framework, making glycan-protein biomarker data exploration more available to investigators and to other data resources. The biomarker data model we describe can be used by researchers to describe their novel glycan and glycoconjugate biomarkers, can integrate N-glycan biomarker data with multi-source biomedical data, and can foster discovery and insight within a unified data framework for glycan biomarker representation thereby making the data FAIR (Findable, Accessible, Interoperable, Reusable) (https://www.go-fair.org/fair-principles/).

Department

Biochemistry and Molecular Medicine

Share

COinS