For best experience please turn on javascript and use a modern browser!
You are using a browser that is no longer supported by Microsoft. Please upgrade your browser. The site may not present itself correctly if you continue browsing.

Dr G. (Giovanni) Colavizza

Assistant Professor
Faculty of Humanities
Departement Mediastudies

Visiting address
  • Turfdraagsterpad 9
  • Room number: 1.03
Postal address
  • Postbus 94550
    1090 GN Amsterdam
Contact details
  • Profile

    Giovanni is Assistant Professor of Digital Humanities at UvA, visiting researcher at The Alan Turing Institute and at the Centre for Science and Technology Studies (CWTS), Leiden University.

    He did his PhD at the Digital Humanities Laboratory of the EPFL in Lausanne, working on methods for text mining and citation analysis of scholarly publications, and is co-founder of Odoma, a start-up offering customised machine learning techniques in the cultural heritage domain. Giovanni was also a Co-investigator on the Living with Machines project and convenes the AI for Arts interest group at the Turing.

    Giovanni is interested in several topics spanning from AI for cultural heritage (part of UvA CREATE), to crypto art markets and the public understanding of science.

    Prior to joining the UvA, Giovanni has been part of the Research Engineering Group of The Alan Turing Institute, and a researcher at Leiden University (CWTS), the Leibniz Institute of European History in Mainz, and the University of Oxford. He studied computer science (BSc) and history (BA, MA) in Udine, Milan, Padua and Venice in Italy.

  • Publications


    • Colavizza, G., Blanke, T., Jeurgens, K. J. P. F. M., & Noordegraaf, J. J. (2022). Archives and AI: an overview of current debates and future perspectives. ACM journal on computing and cultural heritage : JOCCH, 15(1), [4].



    • Colavizza, G. (2020). COVID-19 research in Wikipedia. Quantitative Science Studies, 1(4), 1349-1380. [details]
    • Colavizza, G., Cella, R., & Bellavitis, A. (2020). Apprenticeship in Early Modern Venice. In M. Prak, & P. Wallis (Eds.), Apprenticeship in Early Modern Europe (pp. 106-137). Cambridge University Press. [details]
    • Colavizza, G., Hrynaszkiewicz, I., Staden, I., Whitaker, K., & McGillivray, B. (2020). The citation advantage of linking publications to research data. PLoS ONE, 15(4), [e0230416]. [details]
    • Daquino, M., Peroni, S., Shotton, D., Colavizza, G., Ghavimi, B., Lauscher, A., Mayr, P., Romanello, M., & Zumstein, P. (2020). The OpenCitations Data Model. In J. Z. Pan, V. Tamma, C. d’Amato, K. Janowicz, B. Fu, A. Polleres, O. Seneviratne, & L. Kagal (Eds.), The Semantic Web – ISWC 2020 : 19th International Semantic Web Conference Athens, Greece, November 2–6, 2020 : Proceedings (Vol. II, pp. 447-463). (Lecture Notes in Computer Science; Vol. 12507). Springer. [details]
    • Franceschet, M., & Colavizza, G. (2020). Quantifying the higher-order influence of scientific publications. Scientometrics, 125(2), 951–963. [details]
    • Lazzari, G., Colavizza, G., Bortoluzzi, F., Drago, D., Erboso, A., Zugno, F., Kaplan, F., & Salathé, M. (2020). A digital reconstruction of the 1630–1631 large plague outbreak in Venice. Scientific Reports, 10, [17849]. [details]
    • Piccardi, T., Redi, M., Colavizza, G., & West, R. (2020). Quantifying Engagement with Citations on Wikipedia. In The Web Conference 2020: Proceedings of The World Wide Web Conference WWW 2020 : April 20–24, 2020 Taipei, Taiwan (pp. 2365-2376). International World Wide Web Conference Committee. [details]
    • Spinaci, G., Colavizza, G., & Peroni, S. (2020). Preliminary Results on Mapping Digital Humanities Research. In C. Marras, M. Passarotti, G. Franzini, & E. Litta (Eds.), La svolta inevitabile: sfide e prospettive per l'Informatica Umanistica: Atti del IX Convegno Annuale dell'Associazione per l'Informatica Umanistica e la Cultura Digitale (AIUCD) : 15-17 gennaio 2020, Milano, Università Cattolica del Sacro Cuore (pp. 246-252). (Quaderni di Umanistica Digitale. Supplement). Associazione per l'Informatica Umanistica e la Cultura Digitale. [details]
    • Todorov, K., & Colavizza, G. (2020). Transfer Learning for Historical Corpora: An Assessment on Post-OCR Correction and Named Entity Recognition. In F. Karsdorp, M. McGillivray, A. Nerghes, & M. Wevers (Eds.), CHR 2020 : Computational Humanities Research 2020: Proceedings of the Workshop on Computational Humanities Research (CHR 2020) : Amsterdam, the Netherlands, November 18-20, 2020 (pp. 310-339). (CEUR Workshop Proceedings; Vol. 2723). CEUR-WS. [details]
    • Waltman, L., Boyack, K. W., Colavizza, G., & van Eck, N. J. (2020). A principled methodology for comparing relatedness measures for clustering publications. Quantitative Science Studies, 1(2), 691-713. [details]
    • van Strien, D., Beelen, K., Ardanuy, M. C., Hosseini, K., McGillivray, B., & Colavizza, G. (2020). Assessing the Impact of OCR Quality on Downstream NLP Tasks. In A. Rocha, L. Steels, & J. van den Herik (Eds.), ICAART 2020 : proceedings of the 12th International Conference on Agents and Artificial Intelligence : Valletta, Malta, February 22-24, 2020 (Vol. 1, pp. 484-496). ScitePress. [details]


    • Colavizza, G. (2019). Are We Breaking the Social Contract? Journal of cultural analytics. [details]
    • Colavizza, G., & Romanello, M. (2019). Citation Mining of Humanities Journals: The Progress to Date and the Challenges Ahead. Journal of European Periodical Studies, 4(1), 36-53. [details]
    • Colavizza, G., Franceschet, M., Traag, V. A., & Waltman, L. (2019). Quantifying the long-term influence of scientific publications. In G. Catalano, C. Daraio, M. Gregori, H. F. Moed, & G. Ruocco (Eds.), 17th International Conference on Scientometrics and Informetrics, ISSI 2019 - Proceedings (Vol. 1, pp. 1301-1306). International Society for Scientometrics and Informetrics.
    • Colavizza, G., Franssen, T., & van Leeuwen, T. (2019). An empirical investigation of the tribes and their territories: Are research specialisms rural and urban? Journal of Informetrics, 13(1), 105-117.
    • Colavizza, G., Romanello, M., Giuliano, A., Mataloni, M. C., & Grandin, D. (2019). Linked Books: un indice citazionale per la storia di Venezia. DigItalia, 14(1), 132-146. [details]
    • Filgueira, R., Jackson, M., Roubickova, A., Krause, A., Ahnert, R., Hauswedell, T., Nyhan, J., Beavan, D., Hobson, T., Ardanuy, M. C., Colavizza, G., Hetherington, J., & Terras, M. (2019). defoe: A Spark-based Toolbox for Analysing Digital Historical Textual Data. In IEEE 15th International Conference on eScience: proceedings : 24-27 September 2019, San Diego, California (pp. 235-242). [9041813] (Proceedings - IEEE 15th International Conference on eScience, eScience 2019). IEEE Computer Society. [details]
    • Madge, J., Colavizza, G., Hetherington, J., Guo, W., & Wilson, A. (2019). Assessing Simulations of Imperial Dynamics and Conflict in the Ancient World. Cliodynamics, 10(2), 99-114. [details]


    • Albertin, F., Balliana, E., Pizzol, G., Colavizza, G., Zendri, E., & Raines, D. (2018). Printing materials and technologies in the 15th–17th century book production: An undervalued research field. Microchemical Journal, 138, 147-153.
    • Boyack, K. W., van Eck, N. J., Colavizza, G., & Waltman, L. (2018). Characterizing in-text citations in scientific articles: A large-scale analysis. Journal of Informetrics, 12(1), 59-73.
    • Colavizza, G. (2018). A diachronic study of historiography. Scientometrics, 117(3), 2117-2131.
    • Colavizza, G. (2018). Understanding the history of the humanities from a bibliometric perspective: Expansion, conjunctures, and traditions in the last decades of venetian historiography (1950–2013). History of Humanities, 3(2), 377-406.
    • Colavizza, G., Boyack, K. W., van Eck, N. J., & Waltman, L. (2018). The Closer the Better: Similarity of Publication Pairs at Different Cocitation Levels. Journal of the Association for Information Science and Technology, 69(4), 600-609.
    • Colavizza, G., Romanello, M., & Kaplan, F. (2018). The references of references: a method to enrich humanities library catalogs with citation data. International Journal on Digital Libraries, 19(2-3), 151-161.



    • Colavizza, G. (2016). Epidemics in Venice: on the small or large nature of the pre-modern world. In B. Bozic, G. Mendel-Gleason, C. Debruyne, & D. O'Sullivan (Eds.), Computational History and Data-Driven Humanities - 2nd IFIP WG 12.7 International Workshop O4CHDDH 2016, Revised Selected Papers (pp. 33-40). (IFIP Advances in Information and Communication Technology; Vol. 482). Springer.
    • Colavizza, G., & Franceschet, M. (2016). Clustering citation histories in the Physical Review. Journal of Informetrics, 10(4), 1037-1051.


    • Colavizza, G., Infelise, M., & Kaplan, F. (2015). Mapping the early modern news flow: An enquiry by robust text reuse detection. In D. McFarland, & L. M. Aiello (Eds.), Social Informatics - SocInfo 2014 International Workshops, Revised Selected Papers (pp. 244-253). (Lecture Notes in Computer Science; Vol. 8852). Springer.


    • Piotrowski, M., Colavizza, G., Thiery, F., & Bruhn, K. C. (2014). The labeling system: A new approach to overcome the vocabulary bottleneck. In P. Schmitz, Q. Dombrowski, & L. Pearce (Eds.), Proceedings of the 2nd International Workshop: Collaborative Annotations in Shared Environments: Metadata, Tools, and Techniques in the Digital Humanities, DH-CASE 2014 - Co-located with ACM DocEng 2014 [a1] (ACM International Conference Proceeding Series; Vol. 16). Association for Computing Machinery.


    • Piccardi, T., Redi, M., Colavizza, G., & West, R. (2021). On the Value of Wikipedia as a Gateway to the Web. In The Web Conference 2021: proceedings of the World Wide Web Conference WWW 2021 : April 19-23, 2021, Ljubljana, Slovenia (pp. 249-260). Association for Computing Machinery.



    • Boyack, K. W., Van Eck, N. J., Colavizza, G., & Waltman, A. L. (2017). Reference behavior in the full text of scientific articles: A large-scale analysis. 787-798. Paper presented at 16th International Conference on Scientometrics and Informetrics, ISSI 2017, Wuhan, China.
    • Colavizza, G., Boyack, K. W., Van Eck, N. J., & Waltman, L. (2017). Exploring the similarity of articles co-cited at different levels. 549-560. Paper presented at 16th International Conference on Scientometrics and Informetrics, ISSI 2017, Wuhan, China.
    • Waltman, L., Boyack, K. W., Colavizza, G., & Van Eck, N. J. (2017). A principled methodology for comparing relatedness measures for clustering publications. 691-702. Paper presented at 16th International Conference on Scientometrics and Informetrics, ISSI 2017, Wuhan, China.


    • Bussière, L., Colavizza, G., Fernández, R. & Bellomo, A. (2019-2021). ILLC Diversity Committee, .

    Media appearance


    • Fraumann, G. (organiser), Zahedi, Z. (organiser), Waltman, L. (organiser) & Colavizza, G. (participant) (8-9-2020). Doing science in times of crisis: Science studies perspectives on COVID-19 (organising a conference, workshop, ...).
    • Colavizza, G. (participant) & Waltman, L. (organiser) (20-5-2020). Doing science in times of crisis: Science studies perspectives on COVID-19 (organising a conference, workshop, ...).


    This list of publications is extracted from the UvA-Current Research Information System. Questions? Ask the library or the Pure staff of your faculty / institute. Log in to Pure to edit your publications. Log in to Personal Page Publication Selection tool to manage the visibility of your publications on this list.
  • Ancillary activities
    • Giovanni Colavizza
      Eenmanszaak for consulting
    • Odoma Sàrl
    • Nifty Value Sarl
    • University of Bologna
      Visiting professor