Kreiman Lab

Biological and Computer Vision

Gabriel Kreiman

Additional Materials

Chapter IX: Towards a world with intelligent machines that can interpret the visual world

Fueled by the availability of large training datasets, increased computational power, and biologically inspired architectures, there has been rapid progress in the development of computer vision systems that find applicability in a plethora of real-world pattern recognition problems. Computer vision has thrived, and often surpasses humans, in tasks like face recognition, clinical diagnosis based on imaging techniques, classifying the shapes of galaxies, flora and fauna, and scene interpretation for self-driving cars. We may soon have computer vision systems that can help blind people interpret the world around them. An intriguing corollary of modern computer vision systems is the possibility of creating quasi-realistic images by inverting the architectures, as in generative adversarial networks. Despite the multiple successes of computer vision systems during the last decade, current algorithms remain fragile, can be easily fooled via adversarial images, and often fail to generalize. The quod-erat-demonstrandum (qed) will be a model that can pass the Turing for vision, that is, a model that can answer any arbitrary question about any image, and those answers are indistinguishable from the ones provided by humans.

[1] Figures in powerpoint format for teaching

[2] Further reading

[3] Style transfer demos and deepdream

[4] Image captioning

[5] Image classification

[6] Other

[1] Style transfer demos and deepdream

Deep Style

Arbitrary style transfer

Deep Fake: Bill Hader channels Tom Cruise

Deepdream generator

Feature visualization: how neural netwroks build up their understanding of images

[2] Image captioning

Captionbot

Diverse beam search

Image to text

[3] Image classification demos

Image classification

[4] Other

A great collection of computer vision datasets

What neural networks see

Quickdraw

Cartoonify

How the police use facial recognition, and where it falls short. New York Times, Jan 14, 2020

A long list of interesting computer vision datasets

Adversarial robustness

The power of self-learning systems (lecture by Demis Hassabis, March 20, 2019) [1:03:43 duration]

The convergence of machine learning and artificial intelligence towards enabling autonomous driving (lecture by Amnon Shashua, March 24, 2017) [1:15:30 duration]