Recognition problems: Semantic Segmentation
At the core of computer vision lies the task of recognizing objects and structures in images. Depending on the desired output granularity, these problems are called image classification, object detection or semantic segmentation. The data can RGB images, Lidar data, video or three-dimensional medical images. In any case, the solution is normally based a convolutional neural network (CNN) or a vision transformer that is learnt from data. If you call it a machine learning, deep learning, artificiial intelligence or even a filterbank is up to you,.