Learn about the history of science fiction/science fact CV/ML, challenges in CV, techniques including blob detection and deep learning for computer vision

Computer vision is an instance of the “if it’s easy for a human, it’s hard for a computer” truism. Seeing is so natural to us that we underestimate how hard it is. A significant portion of our brain is devoted to processing the signals from our eyes- and we only understand a small part of those brain structures. Computer vision started out in the 60s trying to identify the location and orientation of parts in manufacturing so that a robot effector could pick them up; that led to a set of techniques referred to as blob analysis. Model-based vision techniques try to recognize an object by its structural shape, but that can be difficult—for example, a hanging basket chair is as much a chair as a standard dining room chair with four legs. That led to more biologically inspired techniques where the computer attempted to recognize the functional properties of the objects, e.g., sittability, or the properties of being the right height and surface area to sit on. The biological methods also produced optic flow systems that can detect time to collision without recognizing what is the object the robot is about to hit. Face recognition or even recognizing a human remained difficult, with slow progress in indirect cues such as gait detection, which is described in Autonomous. There was little dangerous of a surveillance culture in The Robots of Gotham. But then Moore’s Law enabled artificial neural networks to have more than three layers and deep (as in many layers in the network, not deep as in intellectual or thoughtful) learning took off. Unfortunately, deep learning is reaching a plateau and the results appears to be very sensitive to the training examples-- so much so that science fiction now talks about nurseries where all the inputs to a robot or AI system can be controlled. But the real challenges are not object recognition but scene understanding and object permanence, topics explored in Kill Decision.  Introduction to AI Robotics second edition (out in Sept 2019) is a good reference as well as the Springer Handbook of Robotics second edition.

June 4, 2019

Science fiction has generally ignored the science behind computer vision and its use of neural networks for machine learning. Robot vision, like robot intelligence, simply existed in stories; it did not have to be explained...

Read the article here...  http://robot...

May 22, 2019

The first science fiction story about computer vision appears to be Jack Williamson’s 1931 short story The Doom from Planet 4. That story featured alien robots who are noteworthy for their camera eyes that actively tracked the movements of an island’s two inhabitants (...

April 20, 2019

Kill Decision, is the 2012 bestseller by Daniel Suarez, It is a fast, fun, and accurate introduction to robot swarms that also covers the ethics of lethal autonomous weapons. 

Most non-roboticists think of swarms as just lots and lots of robots, what we call more precis...

October 23, 2018

Recommendation: If you have a low tolerance for GameLit, you may want to skip this book-- but you’d be missing a smorgasbord of interesting ideas about AI and robotics and the opportunity to learn about computer vision.

The Robots of Gotham is a highly rated GameLit adv...

July 12, 2018

Robots: Humanoid

What it gets right about robotics: the complexities of computer vision and natural language communication.

Recommended reading: don’t let the reviews about robot sex make you think this is a fembot soft porn novel; it’s a thoughtful book about humans and...

Please reload


  • History of science fiction/science fact: A Brief Science Fiction/Science Fact History of Computer Vision and Machine Learning

  • Challenges in computer vision: Science Robotics May 2019: Computer Vision and Machine Learning in Science Fiction

  • The oldest computer vision story: The Doom from Planet 4

  • Active perception: The Doom from Planet 4

  • Blob detection: A Bad Day for Sales, Westworld (1973)

  • Deep learning: Autonomous, Kill Decision, The Robots of Gotham

  • Multi-modal recognition: Autonomous, Predator 2

  • Sobel operator: Predator 2

  • Surveillance: Kill Decision, The Robots of Gotham

  • Thermal vision: Predator, Westworld (1973)

  • Unrealistic: Blade Runner 

For further reading:

  • Springer Handbook of Robotics 2nd Edition

  • Grey Twitter Icon
  • Grey Facebook Icon
  • Grey YouTube Icon
  • Grey Instagram Icon
  • Grey Pinterest Icon