Professional Experience
Director of BIG Data Center, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, China, 2017 - Present
Professor in “100-Talent” Program of CAS, Beijing Institute of Genomics, Chinese Academy of Sciences (CAS), China, 2017 - Present
Staff Scientist, National Center for Biotechnology Information (NCBI)/NLM/NIH, USA, 2005 - 2017
Viral Genome Scientist, Computercraft Corporation (as a government contractor working at NCBI), USA, 2001-2005
Postdoctoral Associate and Senior Research Associate, Noble Foundation, USA, 1994 - 2001
Teaching and Research Assistant, Peking University, China, 1987-1991
Education
PhD in Genetics, John Innes Center (through University of East Anglia), UK, 1994
BS in Biochemistry, Peking University, China, 1987
Research Interests
Bioinformatics
Viral Genomics
Academic Activities
Journal Reviewer: Archive of Virology; Bioinformatics; BMC Bioinformatics; BMC Microbiology; Computers in Biology; Current Genomics; Database; Infection, Genetics and Evolution; Journal of Computational Biology; Journal of Genetics and Genomics; Journal of Virology; Molecular Phylogenetics and Evolution; Nucleic Acids Research; Plant Molecular Biology; PLoS ONE; PNAS; Vaccine
Member: Virus Data Subcommittee, International Committee on Taxonomy of Viruses (ICTV), 2011-2017
Publications
1. Bao Y. as co-corresponding author in CNCB-NGDC Members and Partners. (2021). Database resources of the National Genomics Data Center, China National Center for Bioinformation in 2021. Nucleic Acids Research 49, D18-D28.
2. Bao Y. as co-corresponding author in Aging Atlas Consortium. Aging Atlas: a multi-omics database for aging biology. (2021). Nucleic Acids Research 49, D825-D830.
3. Gong. Z., Zhu J.W., Li C.P., Jiang S., Ma L.N., Tang B.X., Zou D., Chen M.L., Sun Y.B., Song S.H., Zhang Z., Xiao J.F., Xue Y.B., Bao Y.M., Du Z.L., Zhao W.M. An online coronavirus analysis platform from the National Genomics Data Center. (2020). Zoological Research 41, 705-708.
4. Nawaz M.S, Asghar R., Pervaiz N., Ali S., Hussain I., Xing P., Bao Y., Abbasi, A.A. Molecular evolutionary and structural analysis of human UCHL1 gene demonstrates the relevant role of intragenic epistasis in Parkinson's disease and other neurological disorders. (2020). BMC Evolutionary Biology 20, 130.
5. Zhang Z., Song S., Yu J., Zhao W., Xiao J., Bao Y.. The elements of data sharing. (2020). Genomics Proteomics Bioinformatics 18, 1-4.
6. Chen M., Ma Y., Li R., Bao Y. (2020). Current status and prospects of genomics data analysis methods. Frontiers of Data and Computing 2, 1-19.
7. Shah, S., Malik, A.H., Zhang, B., Bao, Y., Qazi, J. (2020). Metagenomic analysis of relative abundance and diversity of bacterial microbiota in Bemisia tabaci infesting cotton crop in Pakistan. Infection, Genetics and Evolution 84, 104381.
8. Zhao W.M., Song S.H., Chen M.L., Zou D., Ma L.N., Ma Y.K., Li R.J., Hao L.L., Li C.P., Tian D.M., Tang B.X., Wang Y.Q., Zhu J.W., Chen H.X., Zhang Z., Xue Y.B., Bao Y.M. (2020). The 2019 novel coronavirus resource. Yi Chuan 42, 212-221.
9. Xiong Z., Li M., Yang F., Ma Y., Sang J., Li R., Li Z., Zhang Z, Bao Y. (2020). EWAS Data Hub: a resource of DNA methylation array data and metadata. Nucleic Acids Research 48, D890-D895.
10. Bao Y. as co-corresponding author in National Genomics Data Center Members and Partners. (2020). Database Resources of the National Genomics Data Center in 2020. Nucleic Acids Research 48, D24-D33.
11. Pervaiz N., Shakeel N., Qasim A., Zehra R., Anwar S., Rana N., Xue Y., Zhang Z., Bao Y., Abbasi A.A. (2019). Evolutionary history of the human multigene families reveals widespread gene duplications throughout the history of animals. BMC Evol Biol. 19, 128.
12. Ma L.N., Cao J., Liu L., Li Z., Shireen H., Pervaiz N., Batool F., Raza R., Zou D., Bao Y., Abbasi A.A., Zhang Z. (2019). Community curation and expert curation of human long non-coding RNAs. Current Protocols in Bioinformatics 67, e82.
13. Amarasinghe G.K., Ayllon M.A., Bao Y., Basler C.F., et al. (2019). Taxonomy of the order Mononegavirales: update 2019. Archives of Virology 164, 1967-1980.
14. Wang G., Yin H., Li B., Yu C., Wang F., Xu X., Cao J., Bao Y., Wang L., Abbasi A.A., Bajic V.B., Ma L., Zhang Z. (2019). Characterization and identification of long non-coding RNAs based on feature relationship. Bioinformatics, 35, 2949-2956.
15. Seemab S., Pervaiz N., Zehra R., Anwar S., Bao Y., Abbasi A.A. (2019). Molecular evolutionary and structural analysis of familial exudative vitreoretinopathy associated FZD4 gene. BMC Evolutionary Biology 19, 72.
16. Bao Y. as co-corresponding author among BIG Data Center Members. (2019). Database Resources of the BIG Data Center in 2019. Nucleic Acids Research 47, D8-D14.
17. Li M., Zou D., Li Z., Gao R., Sang J., Zhang Y., Li R., Xia L., Zhang T., Niu G., Bao Y., Zhang Z. (2019). EWAS Atlas: a curated knowledgebase of epigenome-wide association studies. Nucleic Acids Research 47, D983-D988.
18. Tang B., Zhou Q., Dong L., Li W., Zhang X., Lan L., Zhai S., Xiao J., Zhang Z., Bao Y., Zhang Y-P., Wang G-D., Zhao W. (2019). iDog: an integrated resource for domestic dogs and wild canids. Nucleic Acids Research 47, D793-D800.
19. Zhao Y., Wang J., Liang F., Liu Y., Wang Q., Zhang H., Jiang M., Zhang Z.W., Zhao W., Bao Y., Zhang Z., Wu J., Asmann Y.W., Li R., Xiao J. (2019). NucMap: a database of genome-wide nucleosome positioning map across species. Nucleic Acids Research 47, D163-D169
20. Ma Y. & Bao Y. (2018). Prospects for national biological big data centers. Hereditas (Beijing) 40, 938-943.
21. Pavesi A., Vianelli A., Chirico N., Bao Y., et al. (2018) Overlapping genes and the proteins they encode differ significantly in their sequence composition from non-overlapping genes. PLoS ONE 13: e0202513.
22. Bao Y. & Xue Y. (2018). Current status and prospect of life and health big data. Bulletin of the Chinese Academy of Sciences 33, 861-865.
23. Bao Y. & Kuhn J.H. (2018). Preliminary classification of novel hemorrhagic fever-causing viruses using sequence-based PAirwise Sequence Comparison (PASC) analysis. Methods Mol Biol. 1604, 43-53.
24. Maes P., Alkhovsky S.V., Bao Y., et al. (2018). Taxonomy of the family Arenaviridae and the order Bunyavirales: update 2018. Archives of Virology 163, 2295-2310.
25. Li R.J., Liang F., ..., Bao Y., et al. (2018). MethBank 3.0: a database of DNA methylomes across a variety of species. Nucleic Acids Research 46, D288-D295.
26. Song S.H., Tian D.M., ..., Bao Y., et al. (2018). Genome Variation Map: a data repository of genome variations in BIG Data Center. Nucleic Acids Research 46, D944-D949.
27. Bao Y. as co-corresponding author among BIG Data Center Members. (2018). Database Resources of the BIG Data Center in 2018. Nucleic Acids Research 46, D14-D20.
28. Sang J., Wang Z., …, Bao Y., et al. (2018). ICG: a wiki-driven knowledgebase of internal control genes for RT-qPCR normalization. Nucleic Acids Research 46, D121-D126.
Group Members:
Staff
Dr. LI Rujiao, Senior Engineer
Dr. CHEN Meili, Assistant Professor
Dr. MA Yingke, Assistant Professor
Dr. ZHENG Xinchang, Engineer
Postdoctoral Fellows
LI Lun, 2020
Graduate Students
XING Peiqi, 2017
GONG Zheng, 2018
YANG Fei, 2018
XIONG Zhuang, 2017
ZHANG Tao, 2017
LI Zhaohua, 2018
KANG Hongen, 2018
ZONG Wenting, 2018
ZHANG Mochen, 2019
WU Song, 2019
JIN Tong, 2019
WANG Guoliang, 2020
ZHAO Wei, 2020