Programming Computer Vision with Python: Tools and algorithms for analyzing images

Front Cover
"O'Reilly Media, Inc.", Jun 19, 2012 - Computers - 264 pages

If you want a basic understanding of computer vision’s underlying theory and algorithms, this hands-on introduction is the ideal place to start. You’ll learn techniques for object recognition, 3D reconstruction, stereo imaging, augmented reality, and other computer vision applications as you follow clear examples written in Python.

Programming Computer Vision with Python explains computer vision in broad terms that won’t bog you down in theory. You get complete code samples with explanations on how to reproduce and build upon each example, along with exercises to help you apply what you’ve learned. This book is ideal for students, researchers, and enthusiasts with basic programming and standard mathematical skills.

  • Learn techniques used in robot navigation, medical image analysis, and other computer vision applications
  • Work with image mappings and transforms, such as texture warping and panorama creation
  • Compute 3D reconstructions from several images of the same scene
  • Organize images based on similarity or content, using clustering methods
  • Build efficient image retrieval techniques to search for images based on visual content
  • Use algorithms to classify image content and recognize objects
  • Access the popular OpenCV library through a Python interface
 

Contents

Basic Image Handling and Processing
1
Local Image Descriptors
29
Image to Image Mappings
53
Camera Models and Augmented Reality
79
Multiple View Geometry
99
Clustering Images
127
Searching Images
147
Classifying Image Content
167
OpenCV
209
Installing Packages
227
Image Datasets
233
Image Credits
237
References
239
Index
243
About the Author
247
Copyright

Image Segmentation
191

Other editions - View all

Common terms and phrases

About the author (2012)

Jan Erik Solem is a Python enthusiast and a computer vision researcher and entrepreneur. He is an applied mathematician and has worked as associate professor, startup CTO, and now also book author. He sometimes writes about computer vision and Python on his blog www.janeriksolem.net. He has used Python for computer vision in teaching, research and industrial applications for many years. He currently lives in San Francisco.

Bibliographic information