SPRADC9 july 2023 AM62A3 , AM62A7
Data augmentation in machine learning includes generating altered copies of the pictures in a dataset. Data augmentation can be as simple as adding noise to the dataset in order to generate enough variation to prevent model overfitting. It is also used to expand the dataset by adding those altered copies. Two geometrical augmentation methods are applied in this demo: flip right-left and rotation. First flipped copies are created for each picture, which brings the total number of pictures to 400x2=800. Then five rotated copies of each picture are created, which brings the total number of pictures up to 800+800x5=4800 pictures. The rotation angle is randomly selected for each picture. This step substantially increases the total number of pictures in the dataset without the burden of data capturing and annotation.