Darshan Hiranandani : Optimizing Model for Handling Large Transcripts with Metadata: Suggestions Needed

darshanhiranandani23 · January 16, 2025, 9:00am

Hi all,

I’m Darshan Hiranandani, currently developing a system to process raw transcripts by splitting them into meaningful paragraphs. The transcript is structured in a JSON schema, where each paragraph is an item in an array, and each paragraph contains words, which are stored as objects with start and end timestamps.

While OpenAI models have been useful so far in answering questions based on the transcript, I’ve encountered an issue where the results get truncated when dealing with large transcripts.

Given this, I’m seeking recommendations for the most suitable model to handle large transcripts with accompanying metadata effectively. What are your suggestions? Has anyone dealt with similar issues, or can you think of other models that might be more efficient for this use case?

Would appreciate any insights!

Regards
Darshan Hiranandani

Topic		Replies	Views
Which is the optimum model for handling large transcripts with MetaData? Beginners	0	15	January 14, 2025
Transcription summaries and actions Beginners	3	39	March 5, 2025
[Paper] Delta Compression: Towards Efficient Semantic Compression Research	2	14	July 15, 2025
Tool for working with large files Models	0	135	January 29, 2024
Seeking Advice on Processing Support Conversations for Efficient RAG Model Search Intermediate	0	50	September 9, 2024

Darshan Hiranandani : Optimizing Model for Handling Large Transcripts with Metadata: Suggestions Needed

Related topics