What is the best fine tune LLM model I can use to extract data lineage from PySpark scripts?

autumn24 · June 26, 2023, 6:20pm

I am working on a project that requires LLM to extract data lineage from many PySpark scripts. My plan is to use a HG LLM and fine tune it with training data. TargetTable, SourceTable, TargetColumn, SourceColumn, and Transformation logic are the information I am interested in and will be sent to Collibra for a subsequent report.

I’m relatively new to this field. Any advice or suggestion is greatly appreciated!

Topic		Replies	Views
Can i run LLM with pyspark Beginners	0	555	August 7, 2023
Fine-tune LLM model for document analysis Models	0	318	September 18, 2024
Fine Tuning LLM Research	0	1711	August 16, 2023
How to fine-tune an LLM with AutoTrain? 🤗AutoTrain	5	2845	March 3, 2024
FineTune LLM for regex Intermediate	3	2154	April 21, 2024

What is the best fine tune LLM model I can use to extract data lineage from PySpark scripts?

Related topics