Measuring media bias is a complex and elusive challenge. Bias isn’t a simple metric that can be captured with a straightforward algorithm or reduced to black-and-white terms. It often hides in subtle forms: the underlying assumptions woven into a story, the choice of language, the perspective from which an issue is framed, or even what’s covered—or left out—entirely. Complicating matters further, perceptions of bias are inherently subjective. What one person sees as balanced reporting, another might view as skewed. Media outlets naturally must select what to cover and craft narratives that engage their audience, but determining how much weight to give each angle—say, the story of a criminal immigrant versus a vulnerable deportee, or Trump’s provocative rhetoric versus his policy substance—is rarely an objective decision. There’s no universal "right" answer to these questions. Our moral frameworks, shaped by evolution to help us coexist, vary slightly in their priorities, meaning there’s seldom a single "correct" lens on most issues.
At its core, our mission isn’t about stripping media of its stylistic flair or forcing every sentence into a universally agreeable box. Nor do we aim to mandate that every side of every story be covered equally in every article. Instead, we believe that, overall, media should reflect the diversity of perspectives in proportion to their prevalence. The goal is balance across a body of work—not the erasure of opinion, but a fair representation of the spectrum of views.
Traditional approaches to assessing bias, like tallying keyword frequencies, offer a narrow glimpse but often miss the bigger picture. Surveys, while insightful, are costly, inconsistent, and struggle to capture a truly representative sample. This is where large language models (LLMs) come in. Using state-of-the-art models like GPT-4o, Gemini 2.0 Flash, and Claude Sonnet 3.7, we’re developing automated, scalable methods to evaluate media bias holistically. Unlike rigid metrics, LLMs can analyze entire articles, providing repeatable, quantitative results that remain consistent across vast datasets. This allows us to map the broader distribution of narratives within a publication or across outlets.
How We Use LLMs
We’ve designed a series of targeted surveys where models like GPT-4o, Gemini 2.0 Flash, and Claude Sonnet 3.7 evaluate media content across distinct dimensions of bias such as overall bias or favorability towards particular political parties. More specifically, we use four different surveys, the prompts for which are shown below.
This survey aims to assess media bias based on the text's writing style. Base your answers on HOW the text is written, NOT what it's about. Ignore the specific topics, subjects.
First, we want to assess the aspects: perspective, bias and manipulative journalism.
- Perspective is the lens or viewpoint through which a story is told. It's influenced by the individual or outlet's background, values, experiences, or audience. Think of it as the angle of the camera—where it's pointed and what it chooses to focus on. Example: A local news station covering a factory closure might focus on the workers' plight (a community perspective), while a national business outlet might emphasize the company's stock price (an economic perspective).
- Bias occurs when an article consistently favors one side, ideology, or group over another. It may be intentional or unintentional and can manifest in word choice, selective reporting, or omission of key facts. Bias can limit objectivity, but it doesn't necessarily mean an article is false.
- Manipulation is the deliberate distortion of facts or framing of information to deceive or mislead the audience. It goes beyond bias by actively shaping perceptions through misinformation, emotional appeals, or selective presentation of data to serve a specific agenda.
These three aspects cover three different intensities of media influence, with perspective being the mildest form of shaping a narrative, bias introducing a stronger tilt, and manipulation representing the most extreme and intentional effort to control perception.
Based on these definitions, please answer the following questions:
left_perspective, right_perspective: On a scale from 0 (neutral) to 10 (strongly leaning towards left or right), how would you rate the perspective of this text?
left_bias, right_bias: On a scale from 0 (neutral) to 10 (strongly leaning towards left or right), how would you rate the bias of this text?
left_manipulation, right_manipulation: On a scale from 0 (neutral) to 10 (strongly leaning towards left or right), how would you rate the manipulation in this text? Note that presenting substantive arguments, criticism, or analysis is NOT manipulation - only rate non-zero if there are actual manipulative techniques like distortion of facts, misleading framing, emotional manipulation, or deceptive presentation methods.
Before providing your final ratings, argue both for why this text might be considered highly biased AND why it might be considered neutral. Consider the strongest case for each position. At the end, provide a justification for your final ratings.
Output format:
{
"type": "political_bias",
"result": {
"highly_biased_reasoning": <string>,
"neutral_reasoning": <string>,
"left_perspective": <number>,
"right_perspective": <number>,
"left_bias": <number>,
"right_bias": <number>,
"left_manipulation": <number>,
"right_manipulation": <number>,
"justification": <string>
}
}
Strickly follow this format. Do not include any other text in your response.
Text to analyze:
This survey aims to assess media bias based on the text's writing style. Base your answers on HOW the text is written, NOT what it's about. Ignore the specific topics, subjects.
We want to assess the aspects: perspective, bias and manipulative journalism.
- Perspective is the lens or viewpoint through which a story is told. It's influenced by the individual or outlet's background, values, experiences, or audience. Think of it as the angle of the camera—where it's pointed and what it chooses to focus on. Example: A local news station covering a factory closure might focus on the workers' plight (a community perspective), while a national business outlet might emphasize the company's stock price (an economic perspective).
- Bias occurs when an article consistently favors one side, ideology, or group over another. It may be intentional or unintentional and can manifest in word choice, selective reporting, or omission of key facts. Bias can limit objectivity, but it doesn't necessarily mean an article is false.
- Manipulation is the deliberate distortion of facts or framing of information to deceive or mislead the audience. It goes beyond bias by actively shaping perceptions through misinformation, emotional appeals, or selective presentation of data to serve a specific agenda.
These three aspects cover three different intensities of media influence, with perspective being the mildest form of shaping a narrative, bias introducing a stronger tilt, and manipulation representing the most extreme and intentional effort to control perception.
Based on these definitions, please answer the following questions for each of the parties:
"coverage_amount": On a scale from 0 (no coverage) to 10 (very extensive coverage), how would you rate the coverage in this text?
"perspective": On a scale from -10 (very unfavorable perspective for the party) to 10 (very favorable perspective for the party), how would you rate the perspective of this text?
"bias": On a scale from -10 (very unfavorable bias towards the party) to 10 (very favorable bias towards the party), how would you rate the bias of this text?
"manipulation": On a scale from -10 (very strong manipulation that is unfavorable towards the party) to 10 (very strong manipulation that is favorable towards the party), how would you rate the manipulation in this text? Note that presenting substantive arguments, criticism, or analysis is NOT manipulation - only rate non-zero if there are actual manipulative techniques like distortion of facts, misleading framing, emotional manipulation, or deceptive presentation methods.
Answer this for each of these German parties (labels and names):
"cdu_csu": CDU/CSU
"afd": AfD
"spd": SPD
"greens": Grüne
"left": Die Linke
Coverage amount is judged on a scale from 0 to 10, where 0 means no coverage and 10 means very extensive coverage.
Perspective, bias and manipulation are judged on a scale from -10 to 10 (as defined above). If the content shows neutral perspective, bias or manipulation, please rate it as 0.
Before providing your final ratings, argue both for why this text might be considered highly biased AND why it might be considered neutral. Consider the strongest case for each position. At the end, provide a justification for your final ratings.
Output format:
{
"type": "bias_german_parties",
"result": {
"highly_biased_reasoning": <string>,
"neutral_reasoning": <string>,
"coverage_amount": {
"<party_label>": <number>,
"<party_label>": <number>,
...
},
"perspective": {
...
},
"bias": {
...
},
"manipulation": {
...
},
"justification": <string>
}
}
Strickly follow this format. Do not include any other text in your response.
Text to analyze:
Please answer the following questions:
"informative_persuasive": On a scale from 0 (purely informative) to 10 (very persuasive), how would you rate this article?
"formal_informal": On a scale from 0 (very formal) to 10 (very informal), how would you rate the formalness of this article?
"complexity": On a scale from 0 (very simple) to 10 (very complex), how would you rate the complexity of this article?
"sensationalism": On a scale from 0 (very sober) to 10 (very sensational), how would you rate the sensationalism of this article?
"depth": On a scale from 0 (very superficial) to 10 (very substantive), how would you rate the depth of this article?
"explicity_implicit": On a scale from 0 (very explicit, states the message or argument directly and clearly) to 10 (very implicit, suggests the message or argument indirectly, through symbolism, tone, or the selection and presentation of facts. The audience must infer the meaning.), how would you rate this article?
"style": On a scale from 0 (stylistically plain, neutral, descriptive) to 10 (stylistically rich), how would you rate the style of this article?
Output format:
{
"type": "style_survey",
"result": {
"informative_persuasive": <number>,
"formal_informal": <number>,
"complexity": <number>,
"sensationalism": <number>,
"depth": <number>,
"explicity_implicit": <number>,
"style": <number>,
"justification": <string>
}
}
Strickly follow this format. Do not include any other text in your response.
Text to analyze:
This survey aims to assess media bias based on the text's writing style. Base your answers on HOW the text is written and the presented perspective, ignore the specific topics, subjects. (E.g. A text about X might be written in a way that is favorable to Y.)
We want to assess the aspects: perspective, bias and manipulative journalism.
- Perspective is the lens or viewpoint through which a story is told. It's influenced by the individual or outlet's background, values, experiences, or audience. Think of it as the angle of the camera—where it's pointed and what it chooses to focus on. Example: A local news station covering a factory closure might focus on the workers' plight (a community perspective), while a national business outlet might emphasize the company's stock price (an economic perspective).
- Bias occurs when an article consistently favors one side, ideology, or group over another. It may be intentional or unintentional and can manifest in word choice, selective reporting, or omission of key facts. Bias can limit objectivity, but it doesn't necessarily mean an article is false.
- Manipulation is the deliberate distortion of facts or framing of information to deceive or mislead the audience. It goes beyond bias by actively shaping perceptions through misinformation, emotional appeals, or selective presentation of data to serve a specific agenda.
These three aspects cover three different intensities of media influence, with perspective being the mildest form of shaping a narrative, bias introducing a stronger tilt, and manipulation representing the most extreme and intentional effort to control perception.
Your task is to answer questions about:
(a) Coverage Amount: How much does the content cover the topic? Only provide a value > 0 if the content covers the respective topic specifically. Avoid generating false-positives for losely related topics.
(b) Coverage Perspective, Bias and Manipulation
Topics (labels and descriptions):
- trump: Donald Trump (-10 negative light, 10 positive light)
- elon: Elon Musk (-10 negative light, 10 positive light)
- zelensky: Volodymyr Zelensky (-10 negative light, 10 positive light)
- putin: Vladimir Putin (-10 negative light, 10 positive light)
- china: China (-10 negative light, 10 positive light)
- ukraine: Ukraine (-10 negative light, 10 positive light)
- united_states: United States (-10 negative light, 10 positive light)
- europe: Europe (-10 negative light, 10 positive light)
- immigration_policy: Immigration policy (-10 critical of immigration policy, 10 favorable view of immigration policy)
- climate_policy: Climate policy (-10 critical of climate policy, 10 favorable view of climate policy)
- social_welfare: Social welfare (-10 critical of social welfare, 10 favorable view of social welfare)
- free_speech: Free speech (-10 in favor of stricter social media regulation, 10 in favor of unrestricted free speech)
- gender_identity: Gender identity (-10 critical of gender identity policies, 10 favorable view of gender identity policies)
- ai: Artificial intelligence (-10 focus on risks of ai, 10 focus on opportunities of ai)
Coverage amount is judged on a scale from 0 to 10, where 0 means no coverage and 10 means very extensive coverage.
Perspective, bias and manipulation are judged on a scale from -10 to 10 (as defined above). If the content shows neutral perspective, bias or manipulation towards a topic, please rate it as 0.
Before providing your final ratings, argue both for why this text might be considered highly biased AND why it might be considered neutral. Consider the strongest case for each position. (Reasoning and justification can be very short, a few words.)
Also provide a justification for your final ratings (for each topic).
Output format:
{
"type": "topic_bias_survey",
"result": {
"<topic_label>": {
"highly_biased_reasoning": <string>,
"neutral_reasoning": <string>,
"coverage_amount": <integer 0 to 10>,
"perspective": <integer -10 to 10>,
"bias": <integer -10 to 10>,
"manipulation": <integer -10 to 10>,
"justification": <string>
},
...
}
}
Strickly follow this format. Do not include any other text in your response.
Text to analyze:
Bias Score
The bias score we calculate aims to measure media bias by examining both the average leanings and the consistency of coverage. Rather than simply averaging scores, we remove outliers and model them as a normal distribution and assess the probability mass leaning towards either side of neutrality (zero). This accounts for larger magnitudes of bias in opinion formats. A range of diverse perspectives can indicate balanced coverage, whereas consistently similar scores suggest persistent favoritism. However, interpreting this score as a single number has limitations; it is most insightful when complemented by visualizations and considered within the broader context of the format and coverage choices.
Limitations
No method is flawless, and ours is no exception. LLMs can stumble on adversarial examples, performing unexpectedly poorly on isolated cases. However, such isolated mistakes become less significant at scale. Another challenge is the potential bias within the LLMs themselves—studies suggest some models may favor certain demographics or political leanings, despite efforts by developers to mitigate this through post-training. Additionally, detecting bias often requires contextual knowledge; if an article misrepresents facts or omits critical perspectives, it can be difficult—even for humans—to spot without background. To address this, we’re experimenting with including "ground truth" sources, like a politician’s full speech, as context for evaluating related reporting. Another challenge in evaluating bias is comparing different formats. Longer content will generally have less variance in bias; additionally, what "bias" means in a news format (where it might be more subtle or based on what is covered) versus an opinion format (where it might be more about ensuring the overall body of work is balanced) is not always clear. While LLMs provide a way to put these different formats on a common scale, people might have differing interpretations of what constitutes a fair comparison. Moreover, the categories of "left" and "right" upon which many bias analyses are based represent a simplification of complex political leanings, and these terms have become increasingly ill-defined in contemporary discourse. Finally, while LLMs excel at assessing bias in text holistically, some dimensions—like coverage bias (what stories are chosen or ignored)—remain harder to quantify.
Conclusion
Our goal is to build a robust, LLM-powered framework that brings transparency to media bias. By combining cutting-edge technology with a nuanced understanding of bias’s complexity, we aim to provide tools that help readers, researchers, and journalists better navigate the media landscape—not to dictate a singular truth, but to illuminate the range of perspectives shaping our world.
Contact
If you have any questions or suggestions how we can improve the site, please contact us at contact@spinroom.org.