Human Face Detection using Multi-task Cascaded Convolutional Networks in TensorFlow

In recent years, advances in machine learning and deep learning techniques have revolutionized the field of computer vision. With the help of these techniques, we can now detect and identify objects in real time with remarkable accuracy. One of the most popular tasks in computer vision is human face detection, Continue Reading

[Paper Summary] Class-Aware Adversarial Transformers for Medical Image Segmentation

Transformer-based models have shown remarkable progress in the field of medical image segmentation. However, the existing methods still suffer from limitations such as loss of information and inaccurate segmentation label maps. In this article, we will discuss a new type of adversarial transformer, the Class-Aware Adversarial Transformer (CASTformer), which overcomes Continue Reading

What is U2-Net or U-square Net

U2-Net is a simple and powerful architecture designed for the purpose of salient object detection (SOD). It is a two-level nested U-shaped architecture built using the proposed ReSidual U-blocks (RSU). The U2-Net does not use any pre-trained architecture and is trained from scratch. The architecture comes with two variants: U2-Net Continue Reading

Simple Object Detection with Bounding Box Regression in TensorFlow

Object detection is a fundamental task in computer vision that involves identifying and locating objects within an image or video. In this post, we will be discussing a simple method for object detection using bounding box regression in TensorFlow. Bounding box regression is a technique used to predict the location Continue Reading

TensorFlow vs PyTorch

TensorFlow and PyTorch are both popular open-source frameworks for building and training machine learning models. Both frameworks have their own strengths and weaknesses, and the choice between them depends on the specific needs of the project. Introduction to TensorFlow and PyTorch TensorFlow TensorFlow, which was developed by Google, is a Continue Reading

Implementing Linear Regression in TensorFlow

TensorFlow is a powerful library for machine learning that allows for the easy implementation of various algorithms, including linear regression. In this tutorial, we will be using TensorFlow tape gradient to implement a linear regression model and plot the loss graph and x and y on matplotlib. First, we will Continue Reading

What is MobileViT?

This article covers an overall summary of the MobileViT: Light-Weight, General-Purpose, and Mobile-Friendly Vision Transformers research paper. MobileViT is a lightweight and general-purpose vision transformer for mobile vision tasks. It combines the strength of the standard CNN (Convolutional Neural Network) and the Vision Transformers. It has outperformed several CNNs and Continue Reading

Vision Transformer – An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale

In this blog post, we are going to learn about the Vision Transformer (ViT). It is a pure Transformer based architecture used for image classification tasks. Vision Transformer (ViT) has the ability to replace the standard CNNs while achieving excellent results. The Vision Transformer (ViT) attains excellent results when pre-trained Continue Reading