Volume 488 - International Symposium on Grids & Clouds (ISGC2025) (ISGC2025) - Social Sciences, Art and Humanities (SSAH) Applications
Easily accessible (small) LLMs in historical studies: opportunities, limitations, pitfalls
M. Groep-foncke* and T. De Moor
*: corresponding author
Full text: pdf
Published on: October 20, 2025
Abstract
Automation of certain parts of data collection and data processing in the field of Social Sciences and Humanities would enable researchers to skip some of the more painstaking tasks, leaving more time for the actual analysis and opening up the possibility to work with larger data sets. With the current and imminent generations of open-source Large Language Models (LLMs) and small Large Language Models (sLLMs) it seems already attainable for individual researchers to speed up onerous but necessary tasks such as qualitative data coding using personal computers, while keeping control of their datasets at all times. Using the qualitative coding process of the former Common Rules project of research group Social Enterprises \& Institutions for Collective Action (SEICA, Erasmus University Rotterdam) as a case study, we explored whether open source, low-threshold (s)LLMs are already able to perform specific tasks such as qualitative data coding, and what are the limitations and pitfalls that current and future researchers in the field of Social Sciences and Humanities have to reckon with when applying such AI-driven aides.
DOI: https://doi.org/10.22323/1.488.0004
How to cite

Metadata are provided both in article format (very similar to INSPIRE) as this helps creating very compact bibliographies which can be beneficial to authors and readers, and in proceeding format which is more detailed and complete.

Open Access
Creative Commons LicenseCopyright owned by the author(s) under the term of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.