Idiot Developer

Simple Object Detection with Bounding Box Regression in TensorFlow

Posted on 26th January 202326th January 2023 by Nikhil Tomar

Object detection is a fundamental task in computer vision that involves identifying and locating objects within an image or video. In this post, we will be discussing a simple method for object detection using bounding box regression in TensorFlow. Bounding box regression is a technique used to predict the location Continue Reading

TensorFlow vs PyTorch

Posted on 24th January 202324th January 2023 by Nikhil Tomar

TensorFlow and PyTorch are both popular open-source frameworks for building and training machine learning models. Both frameworks have their own strengths and weaknesses, and the choice between them depends on the specific needs of the project. Introduction to TensorFlow and PyTorch TensorFlow TensorFlow, which was developed by Google, is a Continue Reading

Implementing Linear Regression in TensorFlow

Posted on 21st January 202321st January 2023 by Nikhil Tomar

TensorFlow is a powerful library for machine learning that allows for the easy implementation of various algorithms, including linear regression. In this tutorial, we will be using TensorFlow tape gradient to implement a linear regression model and plot the loss graph and x and y on matplotlib. First, we will Continue Reading

What is ChatGPT?

Posted on 21st January 202321st January 2023 by Nikhil Tomar

ChatGPT is a state-of-the-art natural language processing model developed by OpenAI. It is based on transformer architecture and is trained on a massive amount of conversational data. In this blog post, we will take a closer look at what ChatGPT is, how it works, and its applications in the field Continue Reading

Activation Function – Basics of Deep Learning

Posted on 20th January 202321st January 2023 by Nikhil Tomar

Deep learning is a subset of machine learning that utilizes neural networks with multiple layers to analyze and identify patterns in data. One of the key components of deep learning is the activation function, which is responsible for determining the output of each neuron in the network. In this blog Continue Reading

Multithreaded TCP File Transfer in Python

Posted on 24th November 20227th February 2023 by Nikhil Tomar

Today, we are going to implement a simple file transfer TCP client-server program in the python programming language. Here, the server is able to handle multiple clients simultaneously by using multiple threads. The server assigns each client a thread to handle communication with that client. Outline Architecture Functions of the Continue Reading

Human Face Landmark Detection in TensorFlow using Pre-trained MobileNetv2

Posted on 23rd November 202223rd November 2022 by Nikhil Tomar

Today, in this blog post, we will learn how to train a Convolutional Neural Network (CNN) to detect human facial landmarks, such as eyes, mouth, nose, jawline and more. We will use the pre-trained MobileNetv2 from TensorFlow to build our model and then train it on Landmark Guided Face Parsing Continue Reading

What is MobileViT?

Posted on 15th November 202215th November 2022 by Nikhil Tomar

This article covers an overall summary of the MobileViT: Light-Weight, General-Purpose, and Mobile-Friendly Vision Transformers research paper. MobileViT is a lightweight and general-purpose vision transformer for mobile vision tasks. It combines the strength of the standard CNN (Convolutional Neural Network) and the Vision Transformers. It has outperformed several CNNs and Continue Reading

Vision Transformer – An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale

Posted on 2nd November 20226th February 2023 by Nikhil Tomar

In this blog post, we are going to learn about the Vision Transformer (ViT). It is a pure Transformer based architecture used for image classification tasks. Vision Transformer (ViT) has the ability to replace the standard CNNs while achieving excellent results. The Vision Transformer (ViT) attains excellent results when pre-trained Continue Reading

MODNet: Real-Time Trimap-Free Portrait Matting via Objective Decomposition

Posted on 31st October 20226th February 2023 by Nikhil Tomar

In this work, we present a lightweight matting objective decomposition network (MODNet) for portrait matting in real-time with a single input image. MODNet inputs a single RGB image and applies explicit constraints to solve matting sub-objectives simultaneously in one stage. The research paper is accepted at AAAI 2022 conference. Research Continue Reading