← Back to feed
2026-05-27data

Stance Detection in Prediction Markets: Addressing Imbalanced Trader Commentary via Counterfactual Augmentation and Market Context

Thomas Mbrice

PDF preview for Stance Detection in Prediction Markets: Addressing Imbalanced Trader Commentary via Counterfactual Augmentation and Market Context
Read on arXiv →

Key claim

Market context greatly improves stance detection performance.

This paper explores stance detection in prediction market comments, revealing that market context significantly improves recall for opposing stances. The optimal augmentation strategy is found to be 50% synthetic samples, which enhances performance without degrading it.

In plain English

This paper explores stance detection in prediction market comments, revealing that market context significantly improves recall for opposing stances. The optimal augmentation strategy is found to be 50% synthetic samples, which enhances performance without degrading it.

Novelty
7.0/10

This work introduces stance detection in prediction market commentary, a novel application of NLP techniques.

Reliability
8.0/10

The study employs rigorous ablation studies and provides clear evidence for its claims.

Deep reliability assessment

The methodology supports an internal ablation claim: adding market-question context improves RoBERTa stance detection on this small Polymarket dataset, and LLM counterfactual augmentation has dose-dependent effects. Broader claims about general prediction-market stance detection, cross-platform robustness, and mechanistic explanations from attention maps are less well supported.

Reproducibility

Yes: the paper states that code, data, and trained model checkpoints are available via a Google Drive link in the footnote, but no GitHub repository is mentioned.

Discussion questions

  1. 1.Does prepending the market question truly teach stance, or does it mainly let the model exploit market-specific lexical shortcuts in a small dataset?
  2. 2.For builders, is comment-level stance detection useful enough beyond market price, volume, and trader history to justify production deployment?
  3. 3.What happens if the model is evaluated on new markets, future Polymarket slang, or another prediction-market platform with no overlap in topics?

Key figure

The key setup is a RoBERTa-base stance classifier whose input is optionally augmented by prepending the market question to a short trader comment, with optional LLM-generated Pro-to-Anti counterfactual samples added during training.

Benchmark results

~2,229 annotated Polymarket comments across 12 marketsAnti recall: 0.45vs RoBERTa-base 3-class without market context+0.35
~2,229 annotated Polymarket comments across 12 marketsAnti F1: 0.24vs Weak RoBERTa-base configuration without counterfactual augmentation+0.14
~2,229 annotated Polymarket comments across 12 marketsmacro F1: 0.5vs RoBERTa-base 2-class with market context and no augmentation-0.18