Triskel Data 132B+ Clean Tokens for Under $200

Just launched: https://triskeldata.au
Structured, cleaned, and tokenized AI training datasets no junk, no scraping, no bloat.

What’s Available (Full Access):

Total: 132.389 Billion tokens all for under $200 USD


Why So Cheap?

  • I’m covering hosting + processing costs only
  • Datasets are cleaned, deduplicated, .jsonl, and ingestion-ready
  • Built to support the dev community not exploit it

:locked_with_key: License:

  • Use for R&D, personal fine-tuning, private AI builds
  • No resale, redistribution, or commercial deployment

Stop burning compute on messy junk.
Train on clean signal.