What this is

Raw Sports Vault is a premium sports data library. We started with baseball; cricket is the second sport added. Today the cricket catalog is five bundles of pre-cleaned, pre-enriched datasets ranging from $19 (CSV/Excel of the historical cricket record) to $299 (every dataset in every format including a pre-loaded SQLite database with 16.6M rows).

The product isn't the data — Cricsheet, Sportmonks, public odds feeds, and Wikipedia are all out there. The product is what we did to it: cleaning, deduplication, schema standardization, leakage-checked feature engineering, team-name canonicalisation across sources, and packaging in formats you can actually use.

If you've ever spent a weekend trying to merge a Cricsheet ball-by-ball archive with a Dream11 scoring formula and a venue-by-venue dew probability lookup and gave up, that's the problem we solved.

What's in the cricket library

  • 21,665 matches across all formats (T20, ODI, Test) and 180 seasons — Cricsheet ball-by-ball plus Sportmonks fixture and player metadata.
  • 282k Dream11 fantasy points pre-calculated using the standard T20 ruleset, with captain (2x) and vice-captain (1.5x) multipliers ready to apply.
  • 1.5M ball-by-ball win-probability snapshots from a logistic-regression model trained on every Cricsheet match.
  • 11.1M individual deliveries classified by type — yorker, bouncer, full, short, good length — using over-position, runs, and wicket-kind heuristics.
  • 20+ bookmakers historical odds from 2018-2026 for IPL, PSL, T20 World Cup, T20 Internationals, ODIs and more — opening and closing lines, line movement, sharp-money flags, and CLV calculations.
  • Updated annually after each major cricket season ends.