**Preparing** \- Since I can’t get user data, I had to create a pipeline for synthetic data generation.
**Training** \- Just boring stuff. Used Modal.
Planning to fine-tune whisper as well. Also trying to create next version for HyprLLM for multi-lingual support; our user base is global.
Would love to get any tips on synthetic dataset generation or suggestions on models!