Processing potentially noisy metadata occasionally caused the langdetect library to throw exceptions, crashing the data preparation pipeline. The language detection step is now wrapped in a try-except block to silently skip problematic inputs and keep the generator flowing. Additionally, the model checkpointing frequency was reduced to every 20 epochs, significantly cutting down on disk I/O and boosting overall training speed.