Improved the flexibility of model evaluation by allowing beam search parameter 'k' to exceed 3 while still restricting output to the top 3 predictions. Also updated 'make_submissions' to support batch processing multiple .pickle files simultaneously, streamlining the submission workflow for large datasets.