System Overview

ML-Driven Edge Detection
for Prediction Markets

74 Data Sources | Category Ensemble AUC 0.686 | Backtested Sharpe 3.97

Data Sources

335

Features

256,296

Markets

Services

55.9%

Win Rate

3.97

Sharpe

Development Pipeline

01Data InfrastructureComplete

74 data sources across 14 categories. 86 harvest scripts. data_processor.py (1,800+ lines) produces 335 features per market. Output: market_features.parquet (256,296 x 335, 33 MB).

02Model Training60%

Category ensemble AUC 0.686 (+6.2%). 6 custom models (Sports 0.955, Health 0.794, Entertainment 0.731). LightGBM 80% + MLP 10% + XGBoost 10%. Autoresearch: 129+ experiments on 3090.

03Live Market FeedComplete

live_scanner.py (1,501 lines). Gamma API polling, 246 features with batch enrichment, model scoring in 4.4 seconds.

04Real-Time FeaturesComplete

246 features generated per market. Classification (instant), financial markets (near-instant), batch enrichment (volume rank, uniqueness, network).

05Edge DetectionComplete

Bayesian blend: P_final = 0.3 * P_model + 0.7 * P_market. Trump-code + MiroFish enrichment.

06Position SizingComplete

Kelly criterion (25% fractional). Max 10% per position, 50% portfolio, 30% per category. 20% drawdown breaker.

07Liquidity FilterComplete

Order book depth, bid-ask spread analysis, slippage estimation. Rejects trades with slippage > 1%.

08ExecutionComplete

Paper trading (default) + live via py-clob-client. Limit orders, manual approval, kill switch.

09P&L TrackingComplete

Sharpe ratio, max drawdown, Calmar, calibration analysis. Category-level breakdown. CSV export.

10Live News & SentimentComplete

7 real-time sources: USGS, HN, Reddit, GDELT, NWS, Google Trends, financial markets.

Phase 2

Model Architecture

Category-specific ensemble with Bayesian market-price anchoring and multi-agent AI consensus

Base Model

0.745validation AUC

LightGBM · 232 trees · 247 features · 15s training

Category Ensemble

0.686+6.2%

6 custom models routing by market category

MLP Neural Net

0.661

3-layer MLP (128-64-32) · 10% ensemble weight

Bayesian Blend (Live Scoring)

P_final = 0.3 × P_model + 0.7 × P_market

Market price receives 70% weight. Model identifies mispricings via 30% contribution. Adversarial AUC = 1.0 confirms significant distribution shift.

MiroFish AI Consensus

AnalystContrarianInsiderBayesianSuperforecaster

5 agents debate each opportunity via Kimi/Moonshot LLM. Confidence-weighted consensus.

Validation

Backtesting Results

Out-of-sample performance on 7,346 resolved 2026 markets

55.9%

Win Rate

4.49

Profit Factor

3.97

Sharpe Ratio

8.5%

Max Drawdown

Category Performance

Category	Trades	Win Rate	Status
Geopolitics	10	70.0%	Best
Other	98	61.2%	Strong
Crypto	194	56.2%	Good
Sports	51	43.1%	Weak

Monthly P&L

Key Metrics

Markets Scanned: 7,346

Trades Taken: 358 (4.9% selectivity)

Avg Edge at Entry: 9.6%

Edge Accuracy: 55.9%

Live System

Paper Trading

Active paper trading with $500 bankroll, scanning every 5 minutes

Bankroll

$500

Deployed

$250

50% exposure

Positions

P&L

$0.00

Open Positions

#	Market	Side	Entry	Size	Edge
1	BTC above $70K March 16	NO	0.983	$50	-27.0%
2	BTC above $72K March 16	NO	0.885	$50	-24.2%
3	No change Fed rates April	NO	0.935	$50	-24.2%
4	BTC reach $75K March	NO	0.855	$50	-22.3%
5	Gen.G vs JD Gaming (LoL)	NO	0.885	$50	-21.8%

System Design

Architecture

Data

74 Sources

→

Features

335 Signals

→

ML Model

AUC 0.686

→

Bayesian

30/70 Split

→

Edge

9.6% Avg

→

Kelly

25% Frac

→

Executor

Paper/Live

News Monitor

7 real-time sources. USGS, HN, Reddit, GDELT, NWS, Trends, Financial.

Trump-Code

100 validated rules. 414 days Truth Social. 6.2hr early signal window.

MiroFish

5 AI agents via Kimi/Moonshot. Analyst, Contrarian, Insider, Bayesian, Superforecaster.

Reference

Commands

Trading Commands

# Full orchestrator python -m services.orchestrator --bankroll 500 --interval 300 # Scan only python -m services.live_scanner --top 20 --min-volume 5000 # Paper trading python run_paper_trading.py --bankroll 500 --loop 300 # Dashboard server python -m services.dashboard_server --port 8080

3090 VM Commands

# Check autoresearch ssh douglaswhittingham@10.0.0.3 "tail ~/autoresearch_log.txt" # Train model ~/ml_venv/bin/python ~/train_model.py # Category models python -m backtesting.scripts.train_category_models # Historical backtest python -m backtesting.scripts.backtest_engine --bankroll 500

Main Dashboard

Hero Stats

Key system metrics. 74 data sources feed 335 features across 256,296 markets. Win rate and Sharpe from out-of-sample 2026 backtesting.

Pipeline

10-phase development roadmap. Click phases to expand. All complete except model optimization (ongoing autoresearch on 3090).

Data Sources

All 74 sources in 14 categories. Green dot = full 2023-2026 coverage. Ranges from Reddit to lunar phases to satellite imagery.

Models

Base LightGBM improved by category routing (+6.2%). Bayesian blend: 30% model / 70% market price. MiroFish = 5 AI agents debating each market.

Backtesting

Simulated trading on 7,346 resolved 2026 markets. Win rate 55.9%, profit factor 4.49, Sharpe 3.97.

Paper Trading

Live simulated trades, $500 bankroll. BUY NO = model thinks overpriced. Edge = model vs market disagreement.

Whitepaper

Full 14-section technical documentation. Click to expand each section.

ML-Driven Edge Detection
for Prediction Markets

74 Data Sources

Model Architecture

Backtesting Results

Paper Trading

Technical Whitepaper

Visualization Suite

Architecture

Commands

ML-Driven Edge Detectionfor Prediction Markets

74 Data Sources

Model Architecture

Backtesting Results

Paper Trading

Technical Whitepaper

Visualization Suite

Architecture

Commands

ML-Driven Edge Detection
for Prediction Markets