Brief Introduction to the Function and Usage of FCCLnc
Discovering the Potentially Disease-associated lncRNAs by SNP-disease Associations
The SNP-disease associations were first collected from GWASdb (Li MJ, et al. Nucleic Acids Res. 44: D869-D876, 2016), NHGRI-EBI GWAS Catalog (Buniello A, et al. Nucleic Acids Res. 47: D1005-D1012, 2019), GRASP2 (Zhong C, et al. BMC Bioinformatics. 20: 276, 2019) together with our comprehesive literature review on PubMed, which led to 24,339 associations between 193 standardized diseases and 22,458 SNPs. Second, the chromosome data of lncRNAs was downloaded from the NONCODEV5 (Fang S, et al. Nucleic Acids Res. 47: 46: D308-D314, 2018) to match the disease-associated SNPs to the lncRNA region. Finally, 10,936 lncRNAs (with at least one disease-associated SNP) were identified to be “potentially disease-associated”.
Detecting the Interindividual Variability of lncRNA by Condition-specific Expression
The interindividual expression variability of lncRNA is assessed using the standard measure ‘coefficient of variation (CV)’ (Ecker S, et al. Genome Med. 7: 8, 2015). A low value of CV denotes a lncRNA in normal cell, while a high value represents the disease-related lncRNA (Signal B, et al. Trends Genet. 32: 620-637, 2016). Herein, the CV is first defined as the ratio between the standard deviation of the lncRNA expression levels measured across the patients and its mean (Ecker S, et al. Genome Med. 7: 8, 2015). Using those “potentially disease-associated” lncRNAs identified in previous section, their CV values were then calculated and ranked. Finally, top-N ranked lncRNAs were identified as “disease-associated”.
Constructing the Co-expression Network Based on lncRNAs’ Neighboring Genes
The comprehensive data of 96,308 lncRNAs and 19,975 protein coding genes were first collected from NONCODEV5 (Fang S, et al. Nucleic Acids Res. 47: 46: D308-D314, 2018) and GENCODEV31 (Frankish A, et al. Nucleic Acids Res. 47: D766-D773, 2019), respectively. Then, the neighboring genes within 5kb ~ 500kb up/downstream of the studied lncRNAs were calculated, which resulted in a collection of neighboring genes of the studied disease-associated lncRNAs. Third, WGCNA (Langfelder P, et al. BMC Bioinformatics. 9: 559, 2008) was used to compute a co-expression network based on the studied lncRNAs and their neighboring genes. The resulting co-expression network was illustrated and downloadable in FCCLnc.
Characterizing the lncRNA Function in Comorbidity Using Common Disease Genes
The mechanism of lncRNAs in comorbid diseases were explained by their shared genetic factors (namely “common disease genes”) (Goh KI, et al. Proc Natl Acad Sci U S A. 104: 8685-8690, 2007; Ko Y, et al. Sci Rep. 6: 39433, 2016). Thus, uploaded matrices containing the data of multiple diseases was allowed. First, the RNA expression data of each disease were analyzed using the sequential steps discussed above, which resulted in multiple co-expression networks. Then, a direct overlap of disease-associated RNAs among multiple diseases was conducted, which identified a set of common disease genes. Third, multiple networks were linked together based on this set of common disease genes, and the resulting network is the network of the comorbidity. Finally, the common disease lncRNAs were characterized as "comorbidity-associated", and their function was annotated by their co-expressed mRNAs.
Table of Contents
1. The Compatibility of Browser and Operating System (OS)
2. Step-by-step Instruction on the Usage of FCCLnc
2.1 Required Formats of the Input Files
2.2 Upload your lncRNA and mRNA expression matrix separately or the Sample Data Provided in FCCLnc
2.3 Discover Disease-associated lncRNAs
2.4 Construct Co-expression Network
2.5 Annotate lncRNA Function
3. FCCLnc platform intergrated the SNP associated-disease based on GWAS from various publicly database and manually searching
3.1 NHGRI
3.2 GWASdb
3.3 GRASP2
4. WGCNA computation methods
5. GO terms(BP, MF, CC) and KEGG pathway
5.1 GO
5.2 KEGG Pathway
6. Combining several computational methods
6.1 Differential expression
6.2 Guilt-by-association
6.3 Condition-specific expression
6.4 Combining several computational methods
2. Step-by-step Instruction on the Usage of FCCLnc
2.1 Required Formats of the Input Files
3. FCCLnc platform intergrated the SNP associated-disease based on GWAS from various publicly database and manually searching
6.3 Condition-specific expression
6.4 Combining several computational methods
@ ZJU
Please feel free to visit our website at https://idrblab.org
Dr. Jing Tang (tangj@cqu.edu.cn)
Ms. Hengyong Wang (yhwang@stu.cqmu.edu.cn)
Dr. Jianbo Fu (fujianbo@zju.edu.cn)
Ms. Xianglu Wu (wuxl@stu.cqmu.edu.cn)
Prof. Feng ZHU* (zhufeng@zju.edu.cn)
Address
College of Pharmaceutical Sciences,
Zhejiang University,
Hangzhou, China
Postal Code: 310058
Phone/Fax