Combinatorial Optimization with LLMs/Transformers

wlakinsson · May 12, 2023, 9:30am

I am curious whether a well-designed Transformer can do something like a job-shop-scheduling problem (JSSP) at the high level as GA and other heuristical approaches.

The logic I am coming from is that words are sequences, and JSSP can be transformed into a sequence of tasks no matter what the precedence graph looks like. And final solution would be set of tasks, as LLM makes a set of words that make a story…

I did find some literature on this, but problems are usually very small - like few dozens of tasks with very simple/streamlined rules.

MLalex · June 5, 2023, 2:43pm

Yes I’d be very interested in this as well

katarinayuan · June 15, 2023, 9:20pm

Does the data in JSSP scale up now, like millions pieces of job shop schedules?

davidkiller · July 25, 2023, 4:02am

I’m interested in LLM4CO too! Could you share the literature about the topic please ?

CoOL31 · January 23, 2024, 2:51am

me too. Here are some related papers found recently. But I am doubting about the promissing performance since LLMs are not that controllable:

[2310.19046] Large Language Models as Evolutionary Optimizers
(ICLR24-Google DeepMind) [2309.03409] Large Language Models as Optimizers

fliu36 · August 30, 2024, 7:33am

Check this updating list (GitHub - FeiLiu36/LLM4Opt: A Collection on Large Language Models for Optimization) on LLM4Opt including combinatorial optimization and other related works
Here is an ICML Oral paper on LLM4CO (GitHub - FeiLiu36/EoH: Evolution of Heuristics)

xemos61890 · February 8, 2025, 9:22pm

Interesting paper that uses LLM for end-to-end optimisation in Job-shop-scheduling problem (JSSP)
LLMs can Schedule
https://arxiv.org/pdf/2408.06993

STARJOB: DATASET FOR LLM-DRIVEN JOB SHOP
SCHEDULING
https://openreview.net/pdf?id=z4Ho599uOL

xemos61890 · February 8, 2025, 9:25pm

Interesting paper that uses LLM for end-to-end optimisation in Job-shop-scheduling problem (JSSP)
LLMs can Schedule

STARJOB: DATASET FOR LLM-DRIVEN JOB SHOP
SCHEDULING

Topic		Replies	Views
Combinatorial Optimization for AI Research	2	22	May 7, 2025
How to use DeepSparse in Transformer? Intermediate	1	256	March 11, 2024
Transformers + Attention / or LLMs in other contexts: (I.e. AlphaFold, ForceGen, etc) Beginners	0	150	March 12, 2024
Any advice on LLM inference over a large dataset? 🤗Transformers	0	782	August 16, 2023
Offloading LLM models to CPU uses only single core 🤗Transformers	1	4010	June 3, 2024

Combinatorial Optimization with LLMs/Transformers

Related topics