Any update on this topic? Have been looking into this kind of extraction too recently. Currently, I assume I have to train/finetune a model with high value for sequence length specifically for this task. Well trained and finetuned chat models like chat gpt can do it, but that seems like a bit of an overkill plus I need an open source solution.
1 Like