Background
I’m a content creator who produces short videos. To improve my craft, I regularly study high-quality videos to learn techniques and gain inspiration. The short video industry is extremely competitive, and I’m looking to using AI technology to enhance my creative efficiency and video quality.
My Goals
-
Quality Assessment: I need an AI model that can identify high-quality TikTok videos and explain what makes them excellent (innovative filming techniques, interesting narrative structures, high visual quality, etc.)
-
Content Potential Recognition: I want the model to automatically identify high-potential content elements within videos, helping me quickly filter out the most valuable creative materials.
Current Resources
I have watched and manually annotated numerous videos, marking the reasons why certain videos are considered high quality.
Proposed Approach
I’m interested in developing a model inspired by Deepseek R1’s reasoning capabilities to evaluate TikTok videos. This model would need reflective and reasoning abilities since video quality standards aren’t strictly quantifiable. It should provide multi-dimensional evaluations covering aspects such as:
-
Content themes
-
Visual effects
-
Narrative structure
-
Audience engagement techniques
-
Emotional resonance
-
And more
Questions
-
What would be the most effective architecture for such a model?
-
How should I structure my training data?
-
What evaluation metrics would be appropriate?
-
Are there existing models I could fine-tune rather than build from scratch?
-
What technical challenges should I anticipate?
-
How much labeled data would I need for reasonable performance?
I appreciate any insights, suggestions, or references to similar projects. Thank you!