Building a Multi Lingual Multi Task Model in Finance Domain

Pimpcat-AU · June 10, 2025, 2:40am

You’re on the right path with multi-lingual, multi-task models in the finance domain. I’ve built Triskel Data a curated archive of high-value, structured legal and financial datasets like:

CourtListener (legal rulings)
SEC filings (fully extracted)
Federal Register (regulatory history)
AI patent datasets

All cleaned and tokenization-ready in .jsonl format — not raw scrapes.

A Developer Tier is available with limited access for serious users. While not free, it’s accessible enough to get started without the typical scraping or cleanup burden.

Topic		Replies	Views
Translating Financial PhraseBank 🤗Datasets	1	736	February 25, 2021
Fine tuning a model for a specific task Beginners	2	2365	July 4, 2023
Setting up Custom LLM Leaderboard for other languages 🤗Hub	0	292	March 10, 2024
Using LLM for Data Analytics Beginners	1	1233	June 7, 2025
Agent course learnings Course	0	76	June 10, 2025

Building a Multi Lingual Multi Task Model in Finance Domain

Related topics