Work-in-progress Dashboard for "What was Theoretical Biology? A Topic-Modelling Analysis of a Multilingual Corpus of Monographs and Journals, 1914-1945" for DHd 2022
Made with bokeh and pyLDAvis.

Alexander Böhm, Stefan Reiners-Selbach, Jan Baedke, Alejandro Fábregas Tejeda, Daniel J. Nicholson, Vera Straetmanns

References
Angelov, Dimo (2020): Top2Vec: Distributed Representations of Topics, arXiv:2008.09470v1. https://arxiv.org/abs/2008.09470v1
Blei, D. M.; Ng, A. Y.; Jordan, M. I. (2003): Latent Dirichlet allocation. J Mach Learn Res, 3 (March), pp. 993–1022.
Bokeh Development Team (2018). Bokeh: Python library for interactive visualization URL http://www.bokeh.pydata.org
De Vries, Erik; Schoonvelde, Martijn; Schumacher, Gijs (2018): No Longer Lost in Translation: Evidence that Google Translate Works for Comparative Bag-of-Words Text Applications. Political Analysis, 26 (4), pp. 417 – 430. https://doi.org/10.1017/pan.2018.26
Honnibal, Matthew; Montani, Ines; Van Landeghem, Sofie; Boyd, Adriane (2020): spaCy 3.1: Industrial-strength Natural Language Processing in Python. https://spacy.io/
Jockers, Matthew (2013): “Secret” recipe for topic modeling themes. https://www.matthewjockers.net/2013/04/12/secret-recipe-for-topic-modeling-themes/
Malaterre, Christophe (2021): Topic-modeling of multilingual non-parallel corpora: Applying machine-translation to a philosophy of science corpus. Talk at the DS² 2021 online Conference, March 16, 2021. https://youtu.be/FTzmpNYZs3E
Malaterre, Christophe; Chartier, Jean-Francois; Pulizzotto, Davide (2019). What is this thing called philosophy of science? A computational topic-modeling perspective 1934–2015. HOPOS, 9(2), pp. 215–249. https://doi.org/10.1086/704372.
Malaterre, Christophe; Lareau, Francis; Pulizzotto, Davide; St-Onge, Jonathan (2020): Eight journals over eight decades: a computational topic-modeling approach to contemporary philosophy of science. Synthese. https://doi.org/10.1007/s11229-020-02915-6
McCallum, Andrew Kachites (2002): "MALLET: A Machine Learning for Language Toolkit." http://mallet.cs.umass.edu.
McInnes, L, Healy, J (2018): UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction, ArXiv e-prints 1802.03426. https://arxiv.org/abs/1802.03426v3
Noichl, Maximilian (2019): Modeling the structure of recent philosophy, Synthese. https://doi.org/10.1007/s11229-019-02390-8
Rehurek, Radim; Sojka, Petr (2010): [genism]. Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 workshop on new challenges for NLP frameworks, pp. 45-50. https://radimrehurek.com/gensim/
Smith, Ray (2019): tesseract 4.1.1. https://tesseract-ocr.github.io/
Sievert, Carson; Shirley, Kenneth E.(2014):LDAvis: A method for visualizing and interpreting topics. In: Proceedings of the Workshop on Interactive Language Learning, Visualization, and Interfaces, pages 63–70.