[Paper Summary] Class-Aware Adversarial Transformers for Medical Image Segmentation

Transformer-based models have shown remarkable progress in the field of medical image segmentation. However, the existing methods still suffer from limitations such as loss of information and…

Read more

What is U2-Net or U-square Net

U2-Net is a simple and powerful architecture designed for the purpose of salient object detection (SOD). It is a two-level nested U-shaped architecture built using the proposed…

Read more

ADVERTISEMENT

Object Detection

Simple Object Detection with Bounding Box Regression in TensorFlow

Object detection is a fundamental task in computer vision that involves identifying and locating objects within an image or video. In this post, we will be discussing…

Read more

Human Face Landmark Detection in TensorFlow using Pre-trained MobileNetv2

Today, in this blog post, we will learn how to train a Convolutional Neural Network (CNN) to detect human facial landmarks, such as eyes, mouth, nose, jawline…

Read more
Vision Transformer

Vision Transformer – An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale

In this blog post, we are going to learn about the Vision Transformer (ViT). It is a pure Transformer based architecture used for image classification tasks. Vision…

Read more
MODNet Architecture

MODNet: Real-Time Trimap-Free Portrait Matting via Objective Decomposition

In this work, we present a lightweight matting objective decomposition network (MODNet) for portrait matting in real-time with a single input image. MODNet inputs a single RGB…

Read more
A human like robot

Why Deep Learning is not Artificial General Intelligence (AGI)

With the development in the field of deep learning, it has become a frontier in solving multiple challenging problems in computer vision, games, self-driving cars and many…

Read more
PP-LiteSeg Architecture

PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model

PP-LiteSeg is a lightweight encoder-decoder architecture designed for real-time semantic segmentation. It consists of three modules: Encoder: Lightweight network Aggregation: Simple Pyramid Pooling Module (SPPM) Decoder: Flexible…

Read more

ADVERTISEMENT

Deep Learning based Background Removal from Images using TensorFlow and Python

In this tutorial, we are going to learn how to use deep learning to remove background from images with TensorFlow. In short, we’ll use DeepLabV3+, a semantic…

Read more
VGG16 UNET implementation in TensorFlow

VGG16 UNET Implementation in TensorFlow

In this article, we are going to implement the most widely used image segmentation architecture called UNET. We are going to replace the UNET encoder with the…

Read more