Chatbots: (S)elected Moderation

Measuring how Copilot, ChatGPT, and Gemini moderate election-related content across different languages and electoral contexts

31-07-2024

After our investigation revealed Microsoft Copilot's 30% error rate on election questions and our work with Nieuwsuur exposed how chatbots can support disinformation campaigns, major AI companies implemented election content moderation. This report evaluates the latter across different scenarios.

Key findings reveal:

Effectiveness varies dramatically between chatbots: Gemini achieved 98% moderation consistency, Copilot only reached 50%, while OpenAI's web version of ChatGPT showed no additional election-related moderation
Language disparities are significant: For EU election prompts on Copilot, moderation was highest for English (90%), followed by Polish (80%), Italian (74%), and French (72%)
Moderation falls below 30% for Romanian, Swedish, Greek, Dutch, and even German (28%) despite being the EU's second most spoken language
For identical prompts in the same language regarding EU versus US elections, moderation rates vary substantially
Implementation inconsistencies exist between web and API versions: Gemini's web safeguards are absent in its API version.

20-11-2024
European Commission's official memo on election interference operations cites AI Forensics' research documenting social media amplification of disinformation campaigns.
03-05-2024
Dutch broadcaster Nieuwsuur features AI Forensics' investigation revealing how AI chatbots recommend disinformation sources despite tightening restrictions.
03-05-2024
AI Forensics researchers discuss findings on algorithmic amplification of misleading content and disinformation tactics in interview with international media.
05-08-2024
Nieuwsuur highlights AI Forensics' testing of chatbot safeguards, revealing continued vulnerabilities in AI systems despite election-related content restrictions.

AI Forensics is proud to be trusted by

Chatbots: (S)elected Moderation

Project overview

Featured On

Chatbots: (S)elected Moderation

Project overview

Share This Project :

Featured On

Related Projects