Publications
Søgaard, Anders. 2025. Do Language Models Have Semantics? On the Five Standard Positions. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL). Vienna, Austria.
Fierro, Constanza; Foroutan, Negar; Elliott, Desmond; Søgaard, Anders. 2025. How Do Multilingual Language Models Remember Facts? Findings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL). Vienna, Austria.
Oldenburg, Ninell; Søgaard, Anders. 2025. Navigating the Informativeness-Compression Trade-Off in XAI. AI & Ethics.
Bangsgaard, Alberte Romme; Ryelund, Cecilia Kløve; Nilsson, Mathilde Marie Lind; Søgaard, Anders. 2025. Digital Friends and Empathy Blindness. Open Philosophy.
Yuan, Yifei; Søgaard, Anders. 2025. Revisiting the Othello World Model Hypothesis. ICLR 2025 World Models Workshop. Singapore, Singapore.
Karamolegkou, Antonia; Schiller Hansen, Sandrine; Christopoulou, Ariadni; Stamatiou, Filippos; Lauscher, Anne; Søgaard, Anders. 2025. Ethical Concern Identification in NLP: A Corpus of ACL Anthology Ethics Statements. Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL). Alberquerque, New Mexico.
Søgaard, Anders. 2024. Externalist XAI? Theoria.
Søgaard, Anders. 2024. Is Unsupervised Learning Somehow Truer? Minds and Machines 34(4): 43.
Peng, Qiwei; Søgaard, Anders. 2024. Concept Space Alignment in Multilingual LLMs. Conference on Empirical Methods in Natural Language Processing (EMNLP) 2024. Miami, Florida.
Fierro, Constanza; Dhar, Ruchira; Stamatiou, Filippos; Garneau, Nicolas; Søgaard, Anders. 2024. Defining Knowledge: Bridging Epistemology and Large Language Models. Conference on Empirical Methods in Natural Language Processing (EMNLP) 2024. Miami, Florida.
Li, Jiaang; Kementchedjhieva, Yova; Fierro, Constanza; Søgaard, Anders. 2024. Do Vision and Language Models Share Concepts? A Vector Space Alignment Study. Transactions of the Association for Computational Linguistics (TACL) 12: 1232-1249.
Dhar, Ruchira; Søgaard, Anders. 2024. From Words to Worlds: Compositionality for Cognitive Architectures. ICML 2024 Workshop on LLMs and Cognition. Vienna, Austria.
Li, Jiaang; Karamolegkou, Antonia; Kementchedjhieva, Yova; Abdou, Mostafa; Lehmann, Sune Lehmann; Søgaard, Anders. 2024. Structural Similarities Between Language Models and Neural Response Measurements. NeurIPS 2023 Workshop on Symmetry and Geometry in Neural Representations. New Orleans, LA.
Søgaard, Anders. 2024. On the Opacity of Deep Neural Networks. Canadian Journal of Philosophy 53(3): 224-239.
Søgaard, Anders. 2024. Identity Theory and Falsifiability. Acta Analytica.
Schiller, Sandrine; Søgaard, Anders. 2024. The Challenge of Generative AI Optimized for Engagement. Robophilosophy Conference 2024. Aarhus, Denmark.
Smidt, Mathilde; Anegaard, Olivia; Søgaard, Anders. 2024. How Good Are We at Assessing the Trustworthiness of LLMs? Robophilosophy Conference 2024. Aarhus, Denmark.
van Zee, Anna Katrine; van Zee, Marc; Søgaard, Anders. 2024. Group Fairness in Multilingual Speech Recognition Models. Findings of the North American Chapter of the Association for Computational Linguistics (NAACL). Mexico City, Mexico.
Søgaard, Anders; Kappel, Klemens; Grünbaum, Thor. 2024. On Hedden’s Proof that Machine Learning Fairness Metrics are Flawed. Inquiry.
Søgaard, Anders. 2023. Can Machines Be Trustworthy? AI & Ethics.
Karamolegkoy, Antonia; Li, Jiaang; Zhou, Li; Søgaard, Anders. 2023. Copyright Violations and Large Language Models. Conference on Empirical Methods in Natural Language Processing (EMNLP) 2023. Singapore, Singapore.