Technical Skill classification model

A raw dataset of over 30k data points is given which contains technical skills and a lot of jargon
mixed in. We need to develop a code that can clean this dataset and extract Technical (Hard) skills. Some 900 random examples of technical skills are also given to go through them to understand the
pattern and sequence. How should we go about this problem?