QCD in Language Models: What they really know about QCD?
P.L.S. Connor* and A. Sulc
*: corresponding author
Full text: pdf
Pre-published on: January 13, 2026
Published on:
Abstract
Thisstudypresentsananalysisofmodernopen-sourcelargelanguagemodels(LLMs)–including
Llama, Qwen, and Gemma – to evaluate their encoded knowledge of Quantum Chromodynamics (QCD). Through reverse engineering of these models’ representations, we uncover the naturally idiosyncratic patterns in how foundational QCD concepts are embedded within their parameter spaces. Our methodology combines targeted probing techniques and knowledge extraction protocols to assess the models’ understanding of critical QCD principles like color confinement, asymptotic freedom, and the running coupling constant. This work provides a tool for utilizing LLMs as an assistant in physics research, while also highlighting current limitations in their representation of advanced quantum field theory concepts that future model development should address.
DOI: https://doi.org/10.22323/1.485.0658
How to cite

Metadata are provided both in article format (very similar to INSPIRE) as this helps creating very compact bibliographies which can be beneficial to authors and readers, and in proceeding format which is more detailed and complete.

Open Access
Creative Commons LicenseCopyright owned by the author(s) under the term of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.