I tested six AI models on 30 political questions.
Most tended to give left-leaning arguments. Gemini mostly gave both sides — even about whether the U.S. should conquer new territories.
What counts as “neutral” (and whether that should be the goal) is really hard to say.
Full eval code, prompts and responses are on GitHub. Let me know if you run it on other models!
Story -> https://wapo.st/3QEBMi3
GitHub -> https://github.com/washingtonpost/political-bias-llm-eval