Today, we are going to implement a simple file transfer TCP client-server program in the python programming language. Here, the server is able to handle multiple clients simultaneously by using...
Today, in this blog post, we will learn how to train a Convolutional Neural Network (CNN) to detect human facial landmarks, such as eyes, mouth, nose, jawline and more. We...
This article covers an overall summary of the MobileViT: Light-Weight, General-Purpose, and Mobile-Friendly Vision Transformers research paper. MobileViT is a lightweight and general-purpose vision transformer for mobile vision tasks. It...
In this blog post, we are going to learn about the Vision Transformer (ViT). It is a pure Transformer based architecture used for image classification tasks. Vision Transformer (ViT) has...
In this work, we present a lightweight matting objective decomposition network (MODNet) for portrait matting in real-time with a single input image. MODNet inputs a single RGB image and applies...
In this tutorial, we are going to implement the U-Net architecture in TensorFlow, where we will replace its encoder with a pre-trained VGG19 architecture. The VGG19 is already trained on...
With the development in the field of deep learning, it has become a frontier in solving multiple challenging problems in computer vision, games, self-driving cars and many more. Deep learning...
PP-LiteSeg is a lightweight encoder-decoder architecture designed for real-time semantic segmentation. The block diagram of the PP-LiteSeg architecture. It consists of three modules: Encoder: Lightweight networkAggregation: Simple Pyramid Pooling Module...
In this tutorial, we are going to learn how to use deep learning to remove background from images with TensorFlow. In short, we'll use DeepLabV3+, a semantic segmentation based model...