OpenAI GPT-3: The successor of OpenAI GPT-2

The research lab OpenAI has released a preprint arXiv paper, titled “Language Models are Few-Shot Learners” or OpenAI GPT-3, which is a continuation of their previous work entitled “Language Models are Unsupervised Multitask Learners” or GPT-2. As a recap. GPT-2 is a language model based on the transformer architecture with Continue Reading

Building Convolutional Autoencoder using TensorFlow 2.0

We are going to continue our journey on the autoencoders. In this article, we are going to build a convolutional autoencoder using the convolutional neural network (CNN) in TensorFlow 2.0. Let us first revise, what are autoencoders?  Autoencoders are neural networks that attempt to mimic its input as closely as Continue Reading

Review: One Model To Learn Them All

Recent advancement in the field of deep learning has enabled us to develop models that yield impressive results across various fields, from image classification, object detection, to speech recognition. However, developing the architecture for each problem is a challenge in itself. In this paper, the authors present a single deep Continue Reading