Skip to main content

Posts

Showing posts with the label Computer Vision

Intro to Generative Adversarial Networks | GANs 001

   GANs consist of three terms Generative Adversarial Network. Let's understand these three terms first. Generative : A Generative Model takes input training sample from some distribution and learns to represents that distribution. Adversarial : It basically means Conflicting or Opposing. Networks : These are basically neural networks. So,Generative Adversarial Networks are deep neural network architecture comprising of two neural networks compete with each other to make a generative model. A GAN consist of two class models : Discriminative Model :- It is the one that discriminate between two different classes of data.It tries to identify real data from fakes created by the generator Generative Model :- The Generator turns noise into an imitation of the data to try to trick the discriminator Mathematically, A Generative Model 'G' to be trained on training data 'X' sampled from some true distribution 'D' is the one which, given some standard random distrib

How to resize an Image using OpenCV

Media files tends to have a lot of information and processing it need a lot of time and computation. Resizing of image and videos is done to prevent computational strain Resizing is basically modifying the width and height. Many image recognition and machine learning applications benefit from scaling. By Resizing, the training time of a neural network can be significantly reduced. We'll use CV2.resize() method :- cv2.resize(src, dsize, interpolation)   src - takes the input image   dsize - take the output dimension as input in Tuple   Interpolation take three method as input  cv2.INTER_AREA : This is used when we need to shrink an image.It is the preferred method  cv2.INTER_CUBIC : A Bicubic method, is slow but more efficient.  cv2.INTER_LINEAR : This is primarily used when zooming is required. This is the default interpolation technique in OpenCV   The Function defined below will always work for Images, Video and Live Camera Feed . CODE FOR RESIZING. import cv2 frame= cv2.imre

An Image For Computer Vision,Everything You Need To Know

 Image An Image consists of a set of pixels, which are the buildings blocks for any image. Every Pixels defines the color or the intensity of light.      Suppose an image has a resolution of 1000 x 750,which mean that it is 1000 pixels wide and 750 pixels tall. So the total number of pixels in our image will be 1000 * 750 = 7,50,000 pixels. An Image can be of two type :- Grayscale Color A Grayscale image can have a pixel value between 0 and 255, here 0 means the pixel is 'Black' and 255 means the pixel is ' White'. All the values in between represents various shades of gray. The matrix obtained from a Grayscale Image is 2-Dimensional ie it has width and height. A Color image is represented in RGB color space.The matrix obtained from a color image is a 3D matrix with parameter of Width, Height and Depth. Pixels in the RGB color space are no longer a scalar value like in a grayscale/single channel image – instead, the pixels are represented by a list of three values: