Research Paper

Predictive Characteristics of Co-authorship Networks: Comparing the Unweighted, Weighted, and Bipartite Cases

  • Raf Guns
Expand
  • Centre for R&D Monitoring (ECOOM), University of Antwerp, Antwerp 2020, Belgium

Received date: 2016-05-17

  Revised date: 2016-06-15

  Online published: 2016-06-26

Abstract

Purpose: This study aims to answer the question to what extent different types of networks can be used to predict future co-authorship among authors.
Design/methodology/approach: We compare three types of networks: unweighted networks, in which a link represents a past collaboration; weighted networks, in which links are weighted by the number of joint publications; and bipartite author-publication networks. The analysis investigates their relation to positive stability, as well as their potential in predicting links in future versions of the co-authorship network. Several hypotheses are tested.
Findings: Among other results, we find that weighted networks do not automatically lead to better predictions. Bipartite networks, however, outperform unweighted networks in almost all cases.
Research limitations: Only two relatively small case studies are considered.
Practical implications: The study suggests that future link prediction studies on co-occurrence networks should consider using the bipartite network as a training network.
Originality/value: This is the first systematic comparison of unweighted, weighted, and bipartite training networks in link prediction.


http://ir.las.ac.cn/handle/12502/8732

Cite this article

Raf Guns . Predictive Characteristics of Co-authorship Networks: Comparing the Unweighted, Weighted, and Bipartite Cases[J]. Journal of Data and Information Science, 2016 , 1(3) : 59 -78 . DOI: 10.20309/jdis.201620

References

Barabási, A.L., & Albert, R. (1999). Emergence of scaling in random networks. Science, 286(5439), 509.
Barrat, A., Barthélémy, M., Pastor-Satorras, R., & Vespignani, A. (2004). The architecture of complex weighted networks. Proceedings of the National Academy of Sciences, 101(11), 3747-3752.
Boyce, B.R., Meadow, C.T., & Kraft, D.H. (1994). Measurement in information science. San Diego: Academic Press.
Brandes, U. (2001). A faster algorithm for betweenness centrality. Journal of Mathematical Sociology, 25, 163-177.
Cronin, B. (2001). Hyperauthorship: A postmodern perversion or evidence of a structural shift in scholarly communication practices? Journal of the American Society for Information Science and Technology, 52(7), 558-569.
Egghe, L. (2009). New relations between similarity measures for vectors based on vector norms. Journal of the American Society for Information Science and Technology, 60(2), 232-239.
Egghe, L., & Michel, C. (2002). Strong similarity measures for ordered sets of documents in information retrieval. Information Processing & Management, 38(6), 823-848.
Egghe, L., & Rousseau, R. (2003). A measure for the cohesion of weighted networks. Journal of the American Society for Information Science and Technology, 54(3), 193-202.
Erd?s, P., & Rényi, A. (1959). On random graphs, I. Publicationes Mathematicae (Debrecen), 6, 290-297.
Guimerà, R., & Sales-Pardo, M. (2009). Missing and spurious interactions and the reconstruction of complex networks. Proceedings of the National Academy of Sciences of the United States of America, 106(52), 22073-22078.
Guns, R. (2009). Generalizing link prediction: Collaboration at the University of Antwerp as a case study. Proceedings of the American Society for Information Science & Technology, 46(1), 1-15.
Guns, R. (2011). Bipartite networks for link prediction: Can they improve prediction performance? In Proceedings of ISSI 2011 - 13th International Conference of the International Society for Scientometrics and Informetrics (pp. 249-260). Leiden: Leiden University Press.
Guns, R. (2012). Missing links: Predicting interactions based on a multi-relational network structure with applications in informetrics. Antwerp. (University of Antwerp Ph.D dissertation)
Guns, R. (2014). Link prediction. In Ding, Y., Rousseau, R., & Wolfram, D. (Eds.), Measuring Scholarly Impact: Methods and Practice (pp. 35-55). Berlin: Springer.
Guns, R., Lioma, C., & Larsen, B. (2012). The tipping point: F-score as a function of the number of retrieved items. Information Processing & Management, 48(6), 1171-1180.
Guns, R., Liu, Y.X., & Mahbuba, D. (2011). Q-measures and betweenness centrality in a collaboration network: A case study of the field of informetrics. Scientometrics, 87(1), 133-147.
Guns, R., & Rousseau, R. (2014). Recommending research collaborations using link prediction and random forest classifiers. Scientometrics, 101(2), 1461-1473.
Jalili, M. (2011). Error and attack tolerance of small-worldness in complex networks. Journal of Informetrics, 5(3), 422-430.
Katz, J.S., & Martin, B.R. (1997). What is research collaboration? Research Policy, 26(1), 1-18.
Katz, L. (1953). A new status index derived from sociometric analysis. Psychometrika, 18(1), 39-43.
Koren, Y., North, S.C., & Volinsky, C. (2006). Measuring and extracting proximity in networks. In
KDD2006: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (pp. 245-255). New York: ACM.
Kretschmer, H., & Rousseau, R. (2001). Author inflation leads to a breakdown of Lotka's law. Journal of the American Society for Information Science and Technology, 52(8), 610-614.
Liben-Nowell, D., & Kleinberg, J. (2007). The link-prediction problem for social networks. Journal of the American Society for Information Science and Technology, 58(7), 1019-1031.
Lü, L., & Zhou, T. (2010). Link prediction in weighted networks: The role of weak ties. EPL (Europhysics Letters), 89(1), 18001.
Murata, T., & Moriyasu, S. (2007). Link prediction of social networks based on weighted proximity measures. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, (pp. 85-88). Washington, DC: IEEE Computer Society.
Newman, M.E. (2001a). Clustering and preferential attachment in growing networks. Physical Review E, 64(2), 025102.
Newman, M.E. (2001b). Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E, 64 (1), 016132.
Opsahl, T., Agneessens, F., & Skvoretz, J. (2010). Node centrality in weighted networks: Generalizing degree and shortest paths. Social Networks, 32(3), 245-251.
Otte, E., & Rousseau, R. (2002). Social network analysis: A powerful strategy, also for the information sciences. Journal of Information Science, 28(6), 441-453.
Price, D.J. de Solla. (1976). A general theory of bibliometric and other cumulative advantage processes. Journal of the American Society for Information Science, 27(5), 292-306.
Rodriguez, M.A., & Neubauer, P. (2010). Constructions from dots and lines. Bulletin of the American Society for Information Science and Technology, 36(6), 35-41.
Salton, G., & McGill, M.J. (1983). Introduction to modern information retrieval. New York: McGraw-Hill.
Song, H.H., Cho, T.W., Dave, V., Zhang, Y., & Qiu, L. (2009). Scalable proximity estimation and link prediction in online social networks. In IMC 2009: Proceedings of the 9th ACM Internet Measurement Conference (pp. 322-335). New York: ACM.
Van Rijsbergen, C.J. (1979). Information retrieval (Second ed.). Glasgow: Department of Computer Science, University of Glasgow.
Wasserman, S., & Faust, K. (1994). Social network analysis: Methods and applications. Cambridge: University Press.
Watts, D.J., & Strogatz, S.H. (199 8). Collective dynamics of 'small-world' networks. Nature, 393(6684), 440-442.
Zhu, B.Y., & Xia, Y.X. (2016). Link prediction in weighted networks: A weighted mutual information model. PLoS ONE, 11(2), e0148265. model. PLoS ONE, 11(2), e0148265.
Outlines

/

京ICP备05002861号-43

Copyright © 2023 All rights reserved Journal of Data and Information Science

E-mail: jdis@mail.las.ac.cn Add:No.33, Beisihuan Xilu, Haidian District, Beijing 100190, China

Support by Beijing Magtech Co.ltd E-mail: support@magtech.com.cn