Perspective

Data-driven Discovery: A New Era of Exploiting the Literature and Data

  • Ying Ding ,
  • Kyle Stirling
Expand
  • Department of Information and Library Science, Indiana University, Bloomington, IN 47405, USA

Received date: 2016-08-30

  Revised date: 2016-09-19

  Online published: 2016-09-19

Abstract

In the current data-intensive era, the traditional hands-on method of conducting scientific research by exploring related publications to generate a testable hypothesis is well on its way of becoming obsolete within just a year or two. Analyzing the literature and data to automatically generate a hypothesis might become the de facto approach to inform the core research efforts of those trying to master the exponentially rapid expansion of publications and datasets. Here, viewpoints are provided and discussed to help the understanding of challenges of data-driven discovery.

Cite this article

Ying Ding , Kyle Stirling . Data-driven Discovery: A New Era of Exploiting the Literature and Data[J]. Journal of Data and Information Science, 2016 , 1(4) : 1 -9 . DOI: 10.20309/jdis.201622

References

American Political Science Association (APSA). (2012). A guide to professional ethics in political science (2nd ed.). Washington, DC:The American Political Science Association. Retrieved on August 15, 2016, from www.apsanet.org/Portals/54/APSA%20Files/publications/ethicsguideweb.pdf.
Ali, O.A., Emerich, D., Dranoff, G., & Mooney, D.J. (2009). In situ regulation of DC subsets and T cell mediates tumor regression in mice. Science Translational Medicine, 1(8), 8ra19.
Bekhuis, T. (2006). Conceptual biology, hypothesis discovery, and text mining:Swanson's legacy. Biomedical Digital Library, 3, 2.
Blagosklonny, M.V., & Pardee, A.B. (2002). Conceptual biology:Unearthing the gems. Nature, 416(6879), 373.
Brynjolfsson, E., & McAfee, A. (2014). The second machine age:Work, progress, and prosperity in a time of brilliant technologies. New York:W.W. Norton & Company Inc.
Chansanchai, A. (2014). Microsoft research shows off advances in artificial intelligence with Project Adam. Microsoft Blog, July 14. Retrieved on September 2, 2016, from blogs.microsoft.com/next/2014/07/14/microsoft-research-shows-advances-artificial-intelligence-project-adam.
Chen, B., Ding, Y., & Wild, D. (2012). Assessing drug target association using semantic linked data. PLoS Computational Biology, 8(7), e1002574.
Editorial (2009). Data's shameful neglect. Nature, 461, 145.
Ding, Y., Song, M., Han, J., Yu, Q., Yan, E., Lin, L., & Chambers, T. (2013). Entitymetrics:Measuring the impact of entities. PLoS One, 8(8), 1-14.
Evans, J.A., & Foster, J.G. (2011). Metaknowledge. Science, 332(6018), 721-725.
Flanagan, M. (2004). Barriers to the implementation of best practice in wound care. Wounds UK, 74-84. Retrieved on September 2, 2016, from www.woundsinternational.com/pdf/content_87.pdf.
Groth, P., & Moreau, L. (2013). PROV-Overview:An overview of the PROV family of documents. Retrieved on September 2, 2016, from www.w3.org/TR/prov-overview.
Jinha, A.E. (2010). Article 50 million:An estimate of the number of scholarly articles in existence. Learned Publishing, 23(3), 258-263.
Keiser, M.J., Setola, V., Irwin, J.J., Laggner, C., Abbas, A.I., Hufeisen, S.J., … Roth, B.L. (2009). Predicting new molecular targets for known drugs. Nature, 462(7270), 175-181.
Kell, D.B. (2006). Metabolomics, modelling and machine learning in systems biology:Towards an understanding of the languages of cells. FEBS Journal, 273(5), 873-894.
Klahr, D. (2000). Exploring science:The cognition and development of discovery processes. Cambridge, MA:MIT Press.
Kostoff, R.N. (2012). Literature-related discovery and innovation update. Technological Forecasting & Social Change, 79(4), 789-800.
Lamb, J., Crawford, E.D., Peck, D., Modell, J.W., Blat, I.C., Wrobel, M.J., … Golub, T.R. (2006). The Connectivity Map:Using gene-expression signatures to connect small molecules, genes, and disease. Science, 313(5795), 1929-1935.
McKinsey (2009). Hal Varian on how the web challenges managers. Retrieved on September 2, 2016, from www.mckinsey.com/insights/innovation/hal_varian_on_how_the_web_challenges_managers.
Mons, B., Van Haagen, H., Chichester, C., Hoen, P.B.T., Den Dunnen, J.T., … Schultes, E. (2011). The value of data. Nature Genetics, 43(4), 281-283.
Moravcsik, A. (2014). Transparency:The revolution in qualitative research. Political Science & Politics, 47(1), 48-53.
Oprea, T.I., Tropsha, A., Faulon, J., & Rintoul, M.D. (2007). Systems chemical biology. Nature Chemical Biology, 3, 447-450.
Schulz, K. (2011). What is distance reading. New York Times, Jan 24. Retrieved on September 2, 2016, from www.nytimes.com/2011/06/26/books/review/the-mechanic-muse-what-is-distantreading.html?pagewanted=all&_r=0.
Song, M., Han, N., Kim, Y., Ding, Y., & Chambers, T. (2013). Discovering implicit entity relation with the gene-citation-gene network. PLoS One, 8(12), e84639.
Spangler, S., Wilkins, A.D., Bachman, B.J., Nagarajan, M., Dayaram, T., Haas, P., … Lichtarge, O.(2014). Automated hypothesis generation based on mining scientific literature. Proceedings of the 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (pp 1878-1886). New York, USA.
Swanson, D.R. (1986). Fish oil, Raynaud's syndrome and undiscovered public knowledge. Perspectives in Biology and Medicine, 30(1), 7-18.
Swanson, D.R., Smalheiser, N.R., & Bookstein, A. (2001). Information discovery from complementary literatures:Categorizing viruses as potential weapons. Journal of the American Society for Information Science and Technology, 52(10), 797-812.
Thomsen, M. (2015). Microsoft's deep learning project outperforms humans in image recognition. Forbes, February 19. Retrieved on September 2, 2016, from www.forbes.com/sites/michaelthomsen/2015/02/19/microsofts-deep-learning-project-outperforms-humans-in-imagerecognition.
Upbin, B. (2013). IBM's Watson gets its first piece of business in healthcare. Forbes, February 8. Retrieved on September 2, 2016, from www.forbes.com/sites/bruceupbin/2013/02/08/ibmswatson-gets-its-first-piece-of-business-in-healthcare.
You, J. (2015). Beyond the Turing test. Science, 347(6218), 116.
Outlines

/

京ICP备05002861号-43

Copyright © 2023 All rights reserved Journal of Data and Information Science

E-mail: jdis@mail.las.ac.cn Add:No.33, Beisihuan Xilu, Haidian District, Beijing 100190, China

Support by Beijing Magtech Co.ltd E-mail: support@magtech.com.cn