Open Access

Identifying Scientific Project-generated Data Citation from Full-text Articles: An Investigation of TCGA Data Citation


Cite

Bourne, P.E., Lorsch, J.R., & Green, E.D. (2015). Perspective: Sustaining the big-data ecosystem. Nature, 527(7576), S16–17.BourneP.E.LorschJ.R.GreenE.D.2015Perspective: Sustaining the big-data ecosystemNature5277576S161710.1038/527S16a26536219Search in Google Scholar

Brennan, C.W., Verhaak, R.G., McKenna, A., Campos, B., Noushmehr, H., Salama, S.R. (2013). The somatic genomic landscape of glioblastoma. Cell, 155(2), 462–477.BrennanC.W.VerhaakR.G.McKennaA.CamposB.NoushmehrH.SalamaS.R.2013The somatic genomic landscape of glioblastomaCell155246247710.1016/j.cell.2013.09.034391050024120142Search in Google Scholar

Cancer Genome Atlas Research Network (2008). Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature, 455(7216), 1061–1068.Cancer Genome Atlas Research Network2008Comprehensive genomic characterization defines human glioblastoma genes and core pathwaysNature45572161061106810.1038/nature07385267164218772890Search in Google Scholar

Chin, L., Hahn, W.C., Getz, G., & Meyerson, M. (2011). Making sense of cancer genomic data. Genes & Development, 25(6), 534–555.ChinL.HahnW.C.GetzG.MeyersonM.2011Making sense of cancer genomic dataGenes & Development25653455510.1101/gad.2017311305982921406553Search in Google Scholar

Green, E.D., Watson, J.D., & Collins, F.S. (2015). Human Genome Project: Twenty-five years of big biology. Nature, 526(7571), 29–31.GreenE.D.WatsonJ.D.CollinsF.S.2015Human Genome Project: Twenty-five years of big biologyNature5267571293110.1038/526029a510194426432225Search in Google Scholar

Kafkas, S., Kim, J.H., & McEntyre, J.R. (2013). Database citation in full text biomedical articles. PLoS One, 8(5), e63184.KafkasS.KimJ.H.McEntyreJ.R.2013Database citation in full text biomedical articlesPLoS One85e6318410.1371/journal.pone.0063184366707823734176Search in Google Scholar

Kafkas, S., Kim, J.H., Pi, X., & McEntyre, J.R. (2015). Database citation in supplementary data linked to Europe PubMed Central full text biomedical articles. Journal of Biomedical Semantics, 6, 1.KafkasS.KimJ.H.PiX.McEntyreJ.R.2015Database citation in supplementary data linked to Europe PubMed Central full text biomedical articlesJournal of Biomedical Semantics6110.1186/2041-1480-6-1436320625789152Search in Google Scholar

Kibbe, W.A., Arze, C., Felix, V., Mitraka, E., Bolton, E., Fu, G., Mungall, C.J., Binder, J.X., Malone, J., Vasant, D., Parkinson, H., & Schriml, L.M. (2015). Disease Ontology 2015 Update: an expanded and updated database of human diseases for linking biomedical knowledge through disease data. Nucleic Acids Research, 43(Database issue), D1071–1078.KibbeW.A.ArzeC.FelixV.MitrakaE.BoltonE.FuG.MungallC.J.BinderJ.X.MaloneJ.VasantD.ParkinsonH.SchrimlL.M.2015Disease Ontology 2015 Update: an expanded and updated database of human diseases for linking biomedical knowledge through disease dataNucleic Acids Research43D1071107810.1093/nar/gku1011438388025348409Search in Google Scholar

Leaman, R., Islamaj, D.R., & Lu, Z. (2013). DNorm: Disease name normalization with pairwise learning to rank. Bioinformatics, 29(22), 2909–2917.LeamanR.IslamajD.R.LuZ.2013DNorm: Disease name normalization with pairwise learning to rankBioinformatics29222909291710.1093/bioinformatics/btt474381084423969135Search in Google Scholar

National Science Board (2005). Long-lived digital data collections: Enabling research and education in the 21st century. Retrieved on Oct 20, 2015, from http://www.nsf.gov/pubs/2005/nsb0540/National Science Board2005Long-lived digital data collectionsEnabling research and education in the 21st century. Retrieved on Oct 20, 2015, fromhttp://www.nsf.gov/pubs/2005/nsb0540/Search in Google Scholar

Neveol, A., Wilbur, W.J., & Lu, Z. (2011) Extraction of data deposition statements from the literature: A method for automatically tracking research results. Bioinformatics, 27, 3306–3312.NeveolA.WilburW.J.LuZ.2011Extraction of data deposition statements from the literature: A method for automatically tracking research resultsBioinformatics273306331210.1093/bioinformatics/btr573322336821998156Search in Google Scholar

Neveol, A., Wilbur, W.J., & Lu, Z. (2012). Improving links between literature and biological data with text mining: A case study with GEO, PDB and MEDLINE. Database (Oxford), 2012, bas026.NeveolA.WilburW.J.LuZ.2012Improving links between literature and biological data with text mining: A case study with GEO, PDB and MEDLINEDatabase (Oxford)2012bas026Search in Google Scholar

Peng, L., Bian, X.W., Li, D.K., Xu, C., Wang, G.M., Xia, Q.Y., & Xiong, Q. (2015). Large-scale RNA-Seq transcriptome analysis of 4043 cancers and 548 normal tissue controls across 12 TCGA cancer types. Scientific Report, 5, 13413.PengL.BianX.W.LiD.K.XuC.WangG.M.XiaQ.Y.XiongQ.2015Large-scale RNA-Seq transcriptome analysis of 4043 cancers and 548 normal tissue controls across 12 TCGA cancer typesScientific Report51341310.1038/srep13413454403426292924Search in Google Scholar

Piwowar, H., & Chapman, W. (2010). Recall and bias of retrieving gene expression microarray datasets through PubMed identifiers. Journal of Biomedical Discovery and Collaboration, 5, 7–20.PiwowarH.ChapmanW.2010Recall and bias of retrieving gene expression microarray datasets through PubMed identifiersJournal of Biomedical Discovery and Collaboration5720Search in Google Scholar

Piwowar, H., & Vision, T.J. (2013). Data reuse and the open data citation advantage. Peer J, 1, e175.PiwowarH.VisionT.J.2013Data reuse and the open data citation advantagePeer J1e17510.7717/peerj.175379217824109559Search in Google Scholar

TCGA Data Matrix (2015). Retrieved on Oct. 20, 2015, from https://tcga-data.nci.nih.gov/tcga/dataAccessMatrix.htmTCGA Data Matrix2015Retrieved on Oct. 20, 2015, fromhttps://tcga-data.nci.nih.gov/tcga/dataAccessMatrix.htmSearch in Google Scholar

TCGA Publications (2016). Retrieved on Jan. 28, 2016, from http://cancergenome.nih.gov/publications.TCGA Publications2016Retrieved on Jan. 28, 2016, fromhttp://cancergenome.nih.gov/publicationsSearch in Google Scholar

Tomczak, K., Czerwinska, P., & Wiznerowicz, M. (2015). The Cancer Genome Atlas (TCGA): An immeasurable source of knowledge. Contemporary Oncology, 19(1A), A68–77.TomczakK.CzerwinskaP.WiznerowiczM.2015The Cancer Genome Atlas (TCGA): An immeasurable source of knowledgeContemporary Oncology191AA687710.5114/wo.2014.47136432252725691825Search in Google Scholar

Wang, Z., Gerstein, M., & Snyder, M. (2009). RNA-Seq: A revolutionary tool for transcriptomics. Nature Reviews Genetics, 10(1), 57–63.WangZ.GersteinM.SnyderM.2009RNA-Seq: A revolutionary tool for transcriptomicsNature Reviews Genetics101576310.1038/nrg2484294928019015660Search in Google Scholar

Yu, Q., Ding, Y., Song, M., Song, S., Liu, J., & Zhang, B. (2015). Tracing database usage: Dectecting main paths in database link network, Journal of Informetrics, 9(1), 1–15.YuQ.DingY.SongM.SongS.LiuJ.ZhangB.2015Tracing database usage: Dectecting main paths in database link networkJournal of Informetrics9111510.1016/j.joi.2014.10.002Search in Google Scholar

eISSN:
2543-683X
Language:
English
Publication timeframe:
4 times per year
Journal Subjects:
Computer Sciences, Information Technology, Project Management, Databases and Data Mining