Skip to main content

Intro to Generative Adversarial Networks | GANs 001

   GANs consist of three terms Generative Adversarial Network. Let's understand these three terms first. Generative : A Generative Model takes input training sample from some distribution and learns to represents that distribution. Adversarial : It basically means Conflicting or Opposing. Networks : These are basically neural networks. So,Generative Adversarial Networks are deep neural network architecture comprising of two neural networks compete with each other to make a generative model. A GAN consist of two class models : Discriminative Model :- It is the one that discriminate between two different classes of data.It tries to identify real data from fakes created by the generator Generative Model :- The Generator turns noise into an imitation of the data to try to trick the discriminator Mathematically, A Generative Model 'G' to be trained on training data 'X' sampled from some true distribution 'D' is the one which, given some standard random distrib

Must Know Computer Vision Tools | 2021

Computer vision is an interdisciplinary scientific field which deals with how digital images or videos can obtain high-level understanding from computers. It attempts to understand and automate activities that the human visual system can do from the point of view of engineering.Computer vision is also used in convenience stores, Driverless car training, everyday medical diagnostics, and in tracking the health of crops and animals, as an AI system that helps machines to interpret and mark images. We have seen from our study that machines are talented in identifying pictures.

5 Must Know Computer Vision Tools are:

  • YOLO

    YOLO (You Only Look Once) is an open source object detection approach, which has a number of benefits compared to the other approaches. YOLO nicely learns the context and can learn 'generalized' representation so well that could be used on images with different objects. YOLO is extremely fast.

    Paper :-https://arxiv.org/pdf/1506.02640v5.pdf

  • Scikit-Image

     It is an open source image processing library for the python programmers. It is intended to interoperate with the NumPy and SciPy numerical and science libraries in Python, by representing image object as native Numpy arrays.SciKit-image is a set of image processing algorithms. It is open free of charge and unlimited. 

     Paper :-https://peerj.com/articles/453/

  • Pillow

     Python Imaging Library is an open source library for the python for image processing. It's able to identify, read and saves files in various formats, and is compatible with other modules such as scikit-image.

    Documentation:-  https://pillow.readthedocs.io/en/stable/

  • PyTorch

     It is an open source machine learning Library (based on Torch), developed by Facebook's AI Research Lab. It is used for computer vision and Natural language processing. It is free and open-source software released under the Modified BSD license.PyTorch is an optimized tensor library for deep learning using GPUs and CPUs.

    Documentation:- https://pytorch.org/docs/stable/index.html

  • OpenCV 

    OpenCV is a library of programming functions specifically targeted at computer vision in real-time. Originally developed by Intel, Willow Garage, then Itseez, later sponsored it. The library is free for use under the open source Apache 2 License and is cross-platform. It mainly focuses on image processing, video capture and analysis including features like face detection and object detection.

     Documentation:- https://docs.opencv.org/

 

Comments

Popular posts from this blog

Best Platforms to Improve Machine Learning Skills | 2020

 Machine learning is one of the most exciting techniques one has ever encountered.The field of study that gives computers the ability to learn without being explicitly programmed is machine learning.Their are platforms that can help you improve your Machine Learning skills. Today I've come up with the list of some of my favorite platforms.   Platforms to Improve Machine Learning Skills 1. Kaggle The online community of data scientists and machine learning practitioners is Kaggle, a subsidiary of Google LLC. Kaggle is the largest data science community in the world.Kaggle enables users in a web-based data-science environment to find and publish data sets, explore and build models, work with other data scientists and machine learning engineers, and enter competitions to solve challenges in data science.With it's free GPUs, high paying competitions,massive community , thousands on datasets and notebooks, this platform helps a lot. 2. Seedbank It was launched by  'TensorFlow