Experience with and extending LLM for software engineering

Allom · July 16, 2024, 11:29am

Hi from a newbie to this exciting forum

I have been using various models to supercharge code generation and learning. Here in order of preference or usefulness from my experience FWIW, are the latest models of
claude.ai
ChatGPT
perplexity.ai
Git CoPilot - either in PyCharm or Neovim

I am looking to go from requesting snippets of code and methods/functions from shortish prompts to a more ambitious next step. This step is to pass more extensive parts of a software project to the LLM and to request useful methods/functions.

The type of code I want to pass are a series of Pedantic BaseClass models together with example objects and definitions and also explanatory text. I want to generate compliant code for he key methods that operate on these models.

I am writing before actually giving this a whirl. I know, of course, that there are limits to the number of tokens to pass to and receive from an LLM.

So, I am wondering if there any resources emerging, ideally in this great HF ecosystem that is suited for this scope.

Any advise on how to go to the next (wo)man-machine level is much appreciated.

Thanks to and for

E

LLUMOAI · July 17, 2024, 1:10pm

Hi @Allom

Great to see your journey in code generation ! For passing extensive code to LLMs:

Chunking: Break code into smaller parts.
Fine-tuning: Customize a model for your needs.
External Memory: Use vector databases.
Documentation: Provide clear examples.
Hybrid Approaches: Combine tools and models.

Check out Hugging Face for resources like model cards and datasets. Have you explored specific models here?

Allom · July 17, 2024, 9:22pm

Thank you @LLUMOAI for that kickstart. To the extent that I can get my head around it, makes a lot of sense.

I can see vector databases as useful if you have some standard content that you want to be persistently available. Or if your chunks are really massive… (But that is just newbie thinking).

In my case its just several thousands of lines of python.
We use Pydantic v2 models extensively and generate with LLM methods to act on instances of these models. As you say we provide clear examples and documentation. to the LLMM.

I know nothing of model cards and datasets available from Hugging Face and would love to learn more how this could help to generate high quality code to specification.

Many thanks for your support and help and super excited about learning more.
Eric

LLUMOAI · July 23, 2024, 1:58pm

Hi @Allom

Glad to hear it helped! For learning about model cards and datasets on Hugging Face, explore their model hub and dataset section for detailed info and resources. Happy to assist further!

Allom · August 15, 2024, 6:28am

Hi @LLUMOAI, hi all

We are now closing in on the task of choosing a good model and then training and finetuning it with datasets.

The objective is to generate the code for methods that operate on our defined Python Pydantic and Enum class instances.

Of course we want to use models and datasets that are well tested and suited for this task and models that allow us to train and fine tune with our own code (mainly the model definitions themselves) and relevant projects and of course Huggingface available datasets as considered useful.

It would be ideal if the model is convenient to deploy and train on the Huggingface platform (or locally) and without extensive effort.

What is working:
Searching for models/datasets by name and then ranking these by likes, downloads etc.

What is not working:
Searching for these by Full-text search [Search terms: text-to-code code generation python pydantic enum] and then tanking by likes, downloads etc
With Huggingface’s Full-text search, a long list is received and it is difficult to know about quality without such a ranking.

Here are some of the seemingly better established models that might be suitable for this task.

facebook/incoder-6B · Hugging Face
Isotonic/Hermes-2-Pro-Mixtral-4x7B · Hugging Face
Salesforce/codegen-350M-mono · Hugging Face
Perhaps also GPT-3 ?

Any pointers on how to make a quickstart with well established models and datasets would be much appreciated.

E

Topic		Replies	Views
How to fine-tune a pretrained LLM on custom code libraries? Beginners	3	7394	April 26, 2025
What is the best approach to let LLM to learn company internal legacy system Intermediate	6	196	April 8, 2025
Which Transformers/Libraries Should I use? Beginners	2	216	December 17, 2024
How to fine-tune an LLM to support funciton calling Intermediate	0	881	November 15, 2023
Easy to grab hello world llm creation tutorial Beginners	0	459	February 12, 2024

Experience with and extending LLM for software engineering

Related topics