My language has under 1000 hours of training data, so apparently is not well supported. How can I help add more training data? I actually have tens of hours of transcriptions of my own voice in my language, because I take many voice notes spanning back almost two decades. Most of it is very personal, but I could probably sort away a good portion for this and other projects.
https://keyboard.futo.org/whisper-training-data-breakdown