1. Notes organization: different topics
2. Approach to be used: semi-supervised learning
1. Types of multimodal data: text, image/video, audio
2. Imbalanced data: could be that notes of some topics are more frequent
3. Metadata has information on type of data (#1): multiple types can be contained in one note
4. Setting this as a single-label problem: each note can be categorized into only one topic.