Image
An Image consists of a set of pixels, which are the buildings blocks for any image. Every Pixels defines the color or the intensity of light.
An Image can be of two type :-
- Grayscale
- Color
A Grayscale image can have a pixel value between 0 and 255, here 0 means the pixel is 'Black' and 255 means the pixel is 'White'. All the values in between represents various shades of gray. The matrix obtained from a Grayscale Image is 2-Dimensional ie it has width and height.
A Color image is represented in RGB color space.The matrix obtained from a color image is a 3D matrix with parameter of Width, Height and Depth. Pixels in the RGB color space are no longer a scalar value like in a grayscale/single channel image – instead, the pixels are represented by a list of three values: one value for the Red component, one for Green, and another for Blue.
Image processing libraries such as OpenCv and skit-image represents RGB images as multi-dimensional Numpy arrays with shape (height, width, depth). They also store the RGB channels in reverse order ie BGR. ,the depth is fixed at depth=3.Before feeding the images to a neural network the image processing is required for scaling the images.The size/aspect ratio of the set of images should be same.The Common choices for width and height image sizes inputted to Convolutional Neural Networks include 32×32, 64×64, 224×224, 227×227, 256×256, and 299×299.
Loading an Image using OpenCV Library
import cv2
image = cv2.imread("example.png")
print(image.shape)
cv2.imshow("Image", image)
cv2.waitKey(0)
*pip install cv2 if module not found
Comments
Post a Comment