Idiot Developer

Tag: cnn

Computer Vision, TensorFlow

Human Face Landmark Detection in TensorFlow using Pre-trained MobileNetv2

23rd November 2022

Nikhil Tomar

Today, in this blog post, we will learn how to train a Convolutional Neural Network (CNN) to detect human facial landmarks, such as eyes, mouth, nose, jawline and more. We will use the pre-trained MobileNetv2 from TensorFlow to build our model and then train it on Landmark Guided Face Parsing (LaPa) dataset. Outline What are…
Read more: Human Face Landmark Detection in TensorFlow using Pre-trained MobileNetv2
Computer Vision, TensorFlow

VGG19 UNET Implementation in TensorFlow

18th October 2022

Nikhil Tomar

In this tutorial, we are going to implement the U-Net architecture in TensorFlow, where we will replace its encoder with a pre-trained VGG19 architecture. The VGG19 is already trained on the ImageNet classification dataset. Therefore, it would have already learned the required features, which would help to boost the overall performance of the VGG19-UNET. The…
Read more: VGG19 UNET Implementation in TensorFlow
Deep Learning

Why Deep Learning is not Artificial General Intelligence (AGI)

7th October 2022

Nikhil Tomar

With the development in the field of deep learning, it has become a frontier in solving multiple challenging problems in computer vision, games, self-driving cars and many more. Deep learning has even achieved superhuman performance in some tasks, but still, it lacks some fundamental features which are required for a truly intelligent system. In this…
Read more: Why Deep Learning is not Artificial General Intelligence (AGI)
Computer Vision, Deep Learning, TensorFlow

VGG16 UNET Implementation in TensorFlow

3rd December 2021

Nikhil Tomar

In this article, we are going to implement the most widely used image segmentation architecture called UNET. We are going to replace the UNET encoder with the VGG16 implementation from the TensorFlow library. The UNET encoder would learn the features from scratch, while the VGG16 is already trained on the Image ImageNet classification dataset. Therefore…
Read more: VGG16 UNET Implementation in TensorFlow
Computer Vision, Deep Learning, Python, PyTorch, TensorFlow

Squeeze and Excitation Implementation in TensorFlow and PyTorch

1st December 2021

Nikhil Tomar

The Squeeze and Excitation network is a channel-wise attention mechanism that is used to improve the overall performance of the network. In today’s article, we are going to implement the Squeeze and Excitation module in TensorFlow and PyTorch. What is Squeeze and Excitation Network? The squeeze and excitation attention mechanism was introduced in the year…
Read more: Squeeze and Excitation Implementation in TensorFlow and PyTorch
Deep Learning

Semi-supervised Learning – Fundamentals of Deep Learning

27th November 2021

Nikhil Tomar

Semi-supervised learning is a type of machine learning where we use a combination of a large amount of unlabelled data and a small amount of labelled data to train the model. It is a hybrid approach between supervised learning and unsupervised learning. The basic difference between the two is that supervised learning algorithms use labelled…
Read more: Semi-supervised Learning – Fundamentals of Deep Learning
Computer Vision, Deep Learning

What is UNET?

19th January 2021

Nikhil Tomar

UNET is an architecture developed by Olaf Ronneberger and his team at the University of Freiburg in 2015 for biomedical image segmentation. It is a highly popular approach for semantic segmentation tasks. It is a fully convolutional neural network that is designed to learn from fewer training samples. This architecture is an improvement over the…
Read more: What is UNET?
Computer Vision, Python, TensorFlow

UNET Segmentation with Pretrained MobileNetV2 as Encoder

16th June 2020

Nikhil Tomar

In this tutorial, we are going to work on UNet segmentation and use it for biomedical image segmentation tasks. This time we are going to use pre-trained MobileNetV2 as the encoder for the UNet architecture. We are going to integrate the pre-trained MobileNetV2 with the UNet and have an efficient network architecture. The MobileNetV2 is…
Read more: UNET Segmentation with Pretrained MobileNetV2 as Encoder
Computer Vision

What is Data Augmentation?

19th March 2020

Nikhil Tomar

Data augmentation is a process that enables you to increase the amount of training data by making reasonable modifications in your existing data. It helps you to increase the diversity of your training data which is essential for developing a robust model. This then, generally speaking, improves the performance of deep learning models. Although data…
Read more: What is Data Augmentation?

About Us

Nikhil Kumar Tomar

AI Researcher and a part-time blogger and YouTuber. Most of my research is focused medical imaging.

Categories

Featured Posts

Visual Question Answering from Scratch using TensorFlow
by Nikhil Tomar
15th January 2025
What is Dice Coefficient?
by Nikhil Tomar
28th August 2024
ResUNet++ Implementation in TensorFlow
by Nikhil Tomar
20th April 2024
UNet 3+ Implementation in TensorFlow
by Nikhil Tomar
11th April 2024
ResUNET: A TensorFlow Implementation for Semantic Segmentation
by Nikhil Tomar
1st February 2024
ColonSegNet Implementation In TensorFlow
by Nikhil Tomar
25th January 2024