If you have ever worked on a Computer Vision project, you might know that using augmentations to diversify the dataset is the best practice. On this page, we will:
Сover the Longest max size augmentation;
Check out its parameters;
See how Longest max size affects an image;
And check out how to work with Longest max size using Python through the Albumentations library.
Let's jump in.
To define the term, Longest max size is a data augmentation technique that fixes the maximum possible length of the image's longest side.
Note that the image's aspect ratio is preserved, and hence the other side of the image will also be proportionally scaled after the Longest max size application.
You can add several values to this parameter. In this case, each image's maximum length will be chosen randomly from the specified values. You might use this option to make your dataset more diversified.
- Interpolation - used to specify the interpolation algorithm:
import albumentations as albu
from PIL import Image
import numpy as np
transform = albu.LongestMaxSize(max_size=500, interpolation=1, p=1)
image = np.array(Image.open('/some/image/file/path'))
image = transform(image=image)['image']