Open-source LLMs and tools for scientific PDFs data extraction and to MD conversion

Hi all, I’m searching for the open-source LLMs or other tools for scientific PDFs data extraction with further conversion to Markdown format. I’m aware of some paid solutions or python libraries, but the last ones do not perform very good with scientific texts.

Are there any available models for this purpose? Thank you in advance for potential suggestions.