https://medium.com/@richardcngo/visualizing-the-deep-learning-revolution-722098eb9c5
This post aims to convey three ideas using a series of illustrative examples:
I’ll focus on four domains: vision, games, language-based tasks, and science.
Image recognition has been a focus of AI for many decades. Early research focused on simple domains like handwriting; performance has now improved significantly, beating human performance on many datasets.
In 2014, AI image generation advanced significantly with the introduction of Generative Adversarial Networks (GANs). However, the first GANs could only generate very simple or blurry images.
The key underlying factor for today's progress was scaling up the amount of computing and data used during training.
In 2019, although the videos had some realistic features, almost all of the individual videos were noticeably malformed.
More recently, researchers have focused on producing videos in response to text prompts.