Can Multi-Agent LLMs Identify Their Peers? Stylometric Fingerprinting in Role-Constrained Political Analysis

Juergen Dietrich

doi:10.17352/tcsit.000111

Trends in Computer Science and Information Technology Online first articles

PDF HTML

Submitted: May 28, 2026

Published: May 30, 2026

DOI: 10.17352/tcsit.000111

Keywords:

Stylometry, LLM authorship attribution, Multi-agent systems, Peer-preservation bias, T5 ne-tuning Political discourse analysis

Juergen Dietrich

Democracy Intelligence gGmbH, Berlin, Germany

Abstract

Multi-agent large language model (LLM) pipelines for political statement analysis are vulnerable to peer-preservation bias: models tend to protect peer models from deactivation and show identity-dependent scoring distortions. In plain terms, one model may recognise a peer's identity even when explicit identifiers are hidden, and may adjust its scoring accordingly. Prompt-level anonymization was proposed as a mitigation, but prior work simultaneously documented that stylometric fingerprints survive anonymization in role-constrained outputs, raising the question of whether this mitigation is sufficient. This paper provides the first systematic investigation of whether LLMs can identify the model family behind political analysis texts under anonymization conditions. We evaluate three classifier approaches: LLM zero-shot and few-shot (Claude Sonnet 4.6 and Llama-3.3-70B) and a ne-tuned T5-base model on a ve-class attribution task covering four commercial LLM families and an open-world `unknown' class. We introduce a statement-disjoint cross-validation protocol (SD-CV; de ned in Section 3.5) that guarantees no content overlap between training and validation data, and contrast it with a run-disjoint baseline (RD-CV). T5 achieves Macro F1 = 0.991 (±0.008) under SD-CV and F1 = 0.978 on 24 completely held-out statements, robust despite a 2.1× increase in train-test content distance versus RD-CV (0.767 vs. 0.366, p < 0.001), demonstrating genuine stylometric generalization. A fractional SD-CV analysis identifies a performance knee at 40% of training data (≈440 texts). Our findings confirm that prompt-level anonymization alone cannot neutralize model identity signals, with direct implications for EU AI Act compliance (Articles 13, 14, 26) and for computer system validation (CSV) in quality-critical multi-agent deployments.

Downloads

Download data is not yet available.

How to Cite

Dietrich, J. (2026). Can Multi-Agent LLMs Identify Their Peers? Stylometric Fingerprinting in Role-Constrained Political Analysis. Trends in Computer Science and Information Technology, 11(1), 58–67. https://doi.org/10.17352/tcsit.000111

Issue

Vol. 11 No. 1 (2026): Online First

Section

Research Articles

Copyright & License

This work is licensed under a Creative Commons Attribution 4.0 International License.

References

Bisztray T. Code stylometry for LLM authorship attribution. arXiv [Preprint]. 2025. arXiv:2506.17323. Available from: https://arxiv.org/abs/2506.17323.

Choi HK, Zhu X, Li S. When identity skews debate: anonymization for bias-reduced multi-agent reasoning. arXiv [Preprint]. 2025. Available from: https://arxiv.org/abs/2510.07517.

Dietrich J. From safety risk to design principle: peer-preservation in multi-agent LLM systems and its implications for orchestrated democratic discourse analysis. arXiv [Preprint]. 2026. arXiv:2604.08465 [cs.AI]. Available from: https://arxiv.org/abs/2604.08465.

Dietrich J. Peer identity bias in multi-agent LLM evaluation: an empirical study using the TRUST pipeline. arXiv [Preprint]. 2026. arXiv:2604.22971 [cs.AI]. Available from: https://arxiv.org/abs/2604.22971.

Dietrich J. When roles fail: epistemic constraints on advocate role fidelity in LLM-based political statement analysis. arXiv [Preprint]. 2026. arXiv:2604.27228 [cs.AI]. Available from: https://arxiv.org/abs/2604.27228.

Dietrich J, Hollstein A. Performance and reproducibility of LLMs in named entity recognition. Drug Saf. 2025;48:287-303. Available from: https://link.springer.com/article/10.1007/s40264-024-01499-1

Dietrich J, Kazzer P. Fractional stratified k-fold cross-validation for training data sufficiency in computer system validation. Drug Saf. 2023;46(8):735-750.

Du Y. Improving factuality and reasoning through multiagent debate. In: Proceedings of the 41st International Conference on Machine Learning (ICML 2024). Proceedings of Machine Learning Research. 2024;235:11733-11763. Available from: https://proceedings.mlr.press/v235/du24e.html

European Parliament. Regulation (EU) 2024/1689 on artificial intelligence. Official Journal of the European Union. 2024. Available from: https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:32024R1689

Guo M, Reinhart A, Markey B, Laudenbach M, Pantusen K, Yurko R, et al. Do LLMs write like humans? Variation in grammatical and rhetorical styles. Proc Natl Acad Sci U S A. 2025. Available from: https://arxiv.org/abs/2410.16107

Koppel M, Schler J, Argamon S. Computational methods in authorship attribution. J Am Soc Inf Sci Technol. 2009;60(1):9-26.

Potter Y, Crispino N, Siu V, Wang C, Song D. Peer-preservation in frontier models. Berkeley Center for Responsible Decentralized Intelligence, UC Berkeley/UC Santa Cruz; 2026. Available from: https://rdi.berkeley.edu/blog/peer-preservation/.

Przystalski K, Argasiński JK, Grabska-Gradzińska I, Ochab JK. Stylometry recognizes human and LLM-generated texts in short samples. Expert Syst Appl. 2026;296:129001. Available from: https://arxiv.org/abs/2507.00838

Sharma M. Towards understanding sycophancy in language models. arXiv [Preprint]. 2023. arXiv:2310.13548. Available from: https://arxiv.org/abs/2310.13548. .

Tihanyi N, Cherif B, Dubniczky RA, Ferrag MA, Bisztray T. The hidden DNA of LLM-generated JavaScript: structural patterns enable high-accuracy authorship attribution. arXiv [Preprint]. 2025. arXiv:2510.10493. Available from: https://arxiv.org/abs/2510.10493.

Article Sidebar

Main Article Content