Ok, so- to be sure that I understood it in a correct way- I use e.g. 8 size of batch, with (I donno, I got 1271 samples, so I wonder how many iterations I should take for such a small set ~1.5 h of audio) 32 000 iterations, doing aggregation (I assume, it must be a small script run somewhere?) , and then, if I want to reach batch of size 32, then I am doing it 4 times.
Edit: I ve found a nice model presenting aggregation, tough:
https://www.ijcai.org/proceedings/2019/0363.pdf (4-5p)