Object detection is a fundamental task in computer vision that involves identifying and locating objects within an image or video. In this post, we will be discussing a simple method for object detection using bounding box regression in TensorFlow. Bounding box regression is a technique used to predict the location Continue Reading
TensorFlow vs PyTorch
TensorFlow and PyTorch are both popular open-source frameworks for building and training machine learning models. Both frameworks have their own strengths and weaknesses, and the choice between them depends on the specific needs of the project. Introduction to TensorFlow and PyTorch TensorFlow TensorFlow, which was developed by Google, is a Continue Reading
Implementing Linear Regression in TensorFlow
TensorFlow is a powerful library for machine learning that allows for the easy implementation of various algorithms, including linear regression. In this tutorial, we will be using TensorFlow tape gradient to implement a linear regression model and plot the loss graph and x and y on matplotlib. First, we will Continue Reading
What is ChatGPT?
ChatGPT is a state-of-the-art natural language processing model developed by OpenAI. It is based on transformer architecture and is trained on a massive amount of conversational data. In this blog post, we will take a closer look at what ChatGPT is, how it works, and its applications in the field Continue Reading
Activation Function – Basics of Deep Learning
Deep learning is a subset of machine learning that utilizes neural networks with multiple layers to analyze and identify patterns in data. One of the key components of deep learning is the activation function, which is responsible for determining the output of each neuron in the network. In this blog Continue Reading
Multithreaded TCP File Transfer in Python
Today, we are going to implement a simple file transfer TCP client-server program in the python programming language. Here, the server is able to handle multiple clients simultaneously by using multiple threads. The server assigns each client a thread to handle communication with that client. Outline Architecture Functions of the Continue Reading
Human Face Landmark Detection in TensorFlow using Pre-trained MobileNetv2
Today, in this blog post, we will learn how to train a Convolutional Neural Network (CNN) to detect human facial landmarks, such as eyes, mouth, nose, jawline and more. We will use the pre-trained MobileNetv2 from TensorFlow to build our model and then train it on Landmark Guided Face Parsing Continue Reading
What is MobileViT?
This article covers an overall summary of the MobileViT: Light-Weight, General-Purpose, and Mobile-Friendly Vision Transformers research paper. MobileViT is a lightweight and general-purpose vision transformer for mobile vision tasks. It combines the strength of the standard CNN (Convolutional Neural Network) and the Vision Transformers. It has outperformed several CNNs and Continue Reading
Vision Transformer – An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale
In this blog post, we are going to learn about the Vision Transformer (ViT). It is a pure Transformer based architecture used for image classification tasks. Vision Transformer (ViT) has the ability to replace the standard CNNs while achieving excellent results. The Vision Transformer (ViT) attains excellent results when pre-trained Continue Reading
MODNet: Real-Time Trimap-Free Portrait Matting via Objective Decomposition
In this work, we present a lightweight matting objective decomposition network (MODNet) for portrait matting in real-time with a single input image. MODNet inputs a single RGB image and applies explicit constraints to solve matting sub-objectives simultaneously in one stage. The research paper is accepted at AAAI 2022 conference. Research Continue Reading