Introduction. Selim Aksoy. Bilkent University

Similar documents
Introduction. Selim Aksoy. Bilkent University

Question Answering. CS486 / 686 University of Waterloo Lecture 23: April 1 st, CS486/686 Slides (c) 2014 P. Poupart 1

The Fallacy in Intelligent Design

APAS assistant flexible production assistant

A Scientific Model Explains Spirituality and Nonduality

Carolina Bachenheimer-Schaefer, Thorsten Reibel, Jürgen Schilder & Ilija Zivadinovic Global Application and Solution Team

From Machines To The First Person

Grids: Why, How, and What Next

Benefit of Chemo Brain Recovery

This is a relatively new term used by those in things like Speech recognition software development or robotic engineering or the internet searches.

Under the command of algorithms

Classical Hebrew Poetry: A Guide To Its Techniques (Journal For The Study Of The Old Testament Supplement Series) By Wilfred G. E.

Visual Analytics Based Authorship Discrimination Using Gaussian Mixture Models and Self Organising Maps: Application on Quran and Hadith

Gesture recognition with Kinect. Joakim Larsson

Automatic Recognition of Tibetan Buddhist Text by Computer. Masami Kojima*1, Yoshiyuki Kawazoe*2 and Masayuki Kimura*3

Professor Manovich, welcome to the Thought Project. Thank you so much. I love your project name. I can come back any time.

Information Extraction. CS6200 Information Retrieval (and a sort of advertisement for NLP in the spring)

Living Life Radiantly

End of the year test day 2 #3

Copyrighted material Boys Guide to Discovering His Bible, A.indd 3 12/23/14 2:45 PM

What the Manuscripts Reveal

What is belief, such that first person authority can exist?

Actuaries Institute Podcast Transcript Ethics Beyond Human Behaviour

APRIL 2017 KNX DALI-Gateways DG/S x BU EPBP GPG Building Automation. Thorsten Reibel, Training & Qualification

Ms. Shruti Aggarwal Assistant Professor S.G.G.S.W.U. Fatehgarh Sahib

Biometrics Prof. Phalguni Gupta Department of Computer Science and Engineering Indian Institute of Technology, Kanpur. Lecture No.

Monday, December 10, Robert DeLaurentis with Citizen of the World, the Gulfstream Turbine Commander 900 plane that will complete the mission

INF5020 Philosophy of Information: Ontology

Biometrics Prof. Phalguni Gupta Department of Computer Science and Engineering Indian Institute of Technology, Kanpur. Lecture No.

Natural Language Processing (NLP) 10/30/02 CS470/670 NLP (10/30/02) 1

IIF Symposium Toronto Julie Nagam

TRG PROGRAMME PG COURSES SPRING SEMESTER Course: MS-2016 (EE) Duration: 20 Feb to 23 Jun 2017 Discipline: Electrical Engineering No of Weeks: 18

Mark V. Shaney. Comp 140

The 15 th KCMCW. Karniel Computational Motor Control Workshop Ben-Gurion University of the Negev

Health Information Exchange

LOMBARD POLICE DEPARTMENT RESPONSE LETTER TO REQUESTOR

Apologetics. Course Description. Rationale. Prerequisite. Biblical Integration Outcomes. Measurable Learning Outcomes BIB1000

Year 4 Medium Term Planning

Network-based. Visual Analysis of Tabular Data. Zhicheng Liu, Shamkant Navathe, John Stasko

>> THE_PARABLES_PROJECT >> TEACHERS NOTES

An Easy Retail Management System

Artificial Intelligence Prof. Deepak Khemani Department of Computer Science and Engineering Indian Institute of Technology, Madras

Namesake. Leader Guide Sally Sharpe, Contributor. Nashville

The Last Judgment in Cyberspace

Year 4 Medium Term Planning

DALI power line communication

Seldom do we know any specific information or history. The Other Martial Henry by Rob Kassab #4144LB

NPTEL ONLINE CERTIFICATION COURSES. Course on Reinforced Concrete Road Bridges

Artificial Intelligence or Real Wisdom

PRODUCTS AND SERVICES CATALOGUE 2017 EDITION

FOURTH GRADE. WE LIVE AS CHRISTIANS ~ Your child recognizes that the Holy Spirit gives us life and that the Holy Spirit gives us gifts.

ABB STOTZ-KONTAKT GmbH ABB i-bus KNX DGN/S DALI Gateway for emergency lighting

Inimitable Human Intelligence and The Truth on Morality. to life, such as 3D projectors and flying cars. In fairy tales, magical spells are cast to

Wendy E. Mackay. INRIA, France

STEP SEVEN-INTUITION. Gut instinct Psychic Ability Pattern Recognition. The only real valuable thing is intuition. Einstein

A Path Forward for NDE Reliability

A REFLECTION Walk in My Shoes

Potten End Church of England Primary School Curriculum Map. Year 6

Human Factors/Ergonomics for Societal Transformation: A Tale of Two Cities. Nancy J. Cooke. HFES President

Bringing the Overview Effect Down to Earth

Humans were created scientifically

THE WAY TO PRACTISE VIPASSANA MEDITATION

ST PETER S CATHOLIC PARISH PRIMARY SCHOOL Thursday 7th September 2017 Week 8

In a previous lecture, we used Aristotle s syllogisms to emphasize the

Sounds of Love. Intuition and Reason

Q & A with author David Christian and publisher Karen. This Fleeting World: A Short History of Humanity by David Christian

Review for Test III #1

Treasure: An Interview with Chelsea Bartlett


A scene from WHEN WE DEAD AWAKEN. by HENRIK IBSEN. adapted for the stage by WALTER WYKES. CHARACTERS RUBEK: A sculptor IRENE: A former model

Mind Manipulation Ancient And Modern Ninja Techniques Haha Lung

Modern neuroscience: Room for the soul? John Beggs

REVELATION UNDERSTAND. You Can. the Book of A CLEAR GUIDE TO INTERPRETING PROPHECY

Overview of the National Aeronautics and Space AdministrationÕs Nondestructive Evaluation (NDE) Program

The curious case of Mark V. Shaney. Comp 140 Fall 2008

Index. Symbols. brahmi script 246, 249 broadcast news (BN) S framework 503, 504, 591

MY METACOGNITIVE JOURNEY THE POWER OF UNDERSTANDING YOURSELF

The Other 90% by David Franklin Farkas

The Punk Rock Murders: An R. Blaise Conte Mystery by Robert Jamelli

Prentice Hall World Geography: Building A Global Perspective 2003 Correlated to: Colorado Model Content Standards for Geography (Grade 9-12)

Dennett's Reduction of Brentano's Intentionality

Stay Connected this Summer!

Bachelor of Computer and Information Sciences inapplied Information Systems (ISY)

T.M. Luhrmann. When God Talks Back: Understanding the American Evangelical Relationship

Minneapolis Multi-Faith Network. How can religious communities help bend the moral arc of the universe toward justice?

Applying Data Mining to Field Quality Watchdog Task

Refrigeration Training System

Punjab University, Chandigarh. Kurukshetra University, Haryana. Assistant Professor. Lecturer

The Problem of the External World

THIRD GRADE CURRICULUM RELIGION

Minds, Brains and Turing

correlated to the Massachussetts Learning Standards for Geometry C14

This image cannot currently be displayed. Course Catalog. Bible Doctrine Glynlyon, Inc.

September 9, 2012 Page 1 of 5

Al Qaeda Financing and Conflict Diamonds A Sentinel TMS Analysis

Junior Soldiers. Talking to God about myself & others. Consider & Prepare. Unit 3 : Lesson 8

Reading Standards for the Archdiocese of Detroit Kindergarten

Apostle Ronald Harden Founder /Overseer/Pastor

Auroville Core Mobility Group, Final Presentation, 30 April 2010, 1

Device Agnostic: Why You Need to Transform Now

Transcription:

Introduction Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr

What is computer vision? Analysis of digital images by a computer. Stockman and Shapiro: making useful decisions about real physical objects and scenes based on sensed images. Trucco and Verri: computing properties of the 3D world from one or more digital images. Ballard and Brown: construction of explicit, meaningful description of physical objects from images. Forsyth and Ponce: extracting descriptions of the world from pictures or sequences of pictures. CS 484, Spring 2010 2010, Selim Aksoy 2

Why study computer vision? Possibility of building intelligent machines is fascinating. Capability of understanding the visual world is a prerequisite for such machines. Much of the human brain is dedicated to vision. Humans solve many visual problems effortlessly, yet we have little understanding of visual cognition. CS 484, Spring 2010 2010, Selim Aksoy 3

Why study computer vision? Fast growing collections and many useful applications. Goals of vision research: Give machines the ability to understand scenes. Aid understanding and modeling of human vision. Automate visual operations. Adapted from CSE 455, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 4

Applications Medical image analysis Security Biometrics Surveillance Tracking Target recognition Remote sensing Robotics Industrial inspection, quality control Document analysis Multimedia Assisted living Human-computer interfaces CS 484, Spring 2010 2010, Selim Aksoy 5

Medical image analysis http://www.clarontech.com CS 484, Spring 2010 2010, Selim Aksoy 6

Medical image analysis http://www.clarontech.com CS 484, Spring 2010 2010, Selim Aksoy 7

Medical image analysis http://www.clarontech.com CS 484, Spring 2010 2010, Selim Aksoy 8

Medical image analysis 3D imaging: MRI, CT Image guided surgery Grimson et al., MIT Adapted from CSE 455, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 9

Medical image analysis Cancer detection and grading CS 484, Spring 2010 2010, Selim Aksoy 10

Medical image analysis Slice of lung Adapted from Linda Shapiro, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 11

Medical image analysis CS 484, Spring 2010 2010, Selim Aksoy 12

Biometrics Adapted from Anil Jain, Michigan State CS 484, Spring 2010 2010, Selim Aksoy 13

Biometrics Adapted from Anil Jain, Michigan State CS 484, Spring 2010 2010, Selim Aksoy 14

Surveillance and tracking University of Central Florida, Computer Vision Lab CS 484, Spring 2010 2010, Selim Aksoy 15

Surveillance and tracking Adapted from Octavia Camps, Penn State CS 484, Spring 2010 2010, Selim Aksoy 16

Surveillance and tracking Adapted from Martial Hebert, CMU CS 484, Spring 2010 2010, Selim Aksoy 17

Surveillance and tracking Generating traffic patterns University of Central Florida, Computer Vision Lab CS 484, Spring 2010 2010, Selim Aksoy 18

Surveillance and tracking Tracking in UAV videos Adapted from Martial Hebert, CMU, and Masaharu Kobashi, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 19

Vehicle and pedestrian protection Lane departure warning, collision warning, traffic sign recognition, pedestrian recognition, blind spot warning http://www.mobileye-vision.com CS 484, Spring 2010 2010, Selim Aksoy 20

Smart cars Adapted from CSE 455, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 21

Forest fire monitoring system Early warning of forest fires Adapted from Enis Cetin, Bilkent University CS 484, Spring 2010 2010, Selim Aksoy 22

Land cover classification CS 484, Spring 2010 2010, Selim Aksoy 23

Land cover classification CS 484, Spring 2010 2010, Selim Aksoy 24

Object recognition CS 484, Spring 2010 2010, Selim Aksoy 25

Object recognition Recognition of buildings and building groups CS 484, Spring 2010 2010, Selim Aksoy 26

Content-based retrieval Finding similar regions: airports CS 484, Spring 2010 2010, Selim Aksoy 27

Robotics Adapted from CSE 455, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 28

Robotics Adapted from Steven Seitz, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 29

Autonomous navigation Michigan State University General Dynamics Robotics Systems http://www.gdrs.com CS 484, Spring 2010 2010, Selim Aksoy 30

Industrial automation Automatic fruit sorting Color Vision Systems http://www.cvs.com.au CS 484, Spring 2010 2010, Selim Aksoy 31

Industrial automation Industrial robotics; bin picking http://www.braintech.com CS 484, Spring 2010 2010, Selim Aksoy 32

Postal service automation General Dynamics Robotics Systems http://www.gdrs.com CS 484, Spring 2010 2010, Selim Aksoy 33

Optical character recognition Digit recognition, AT&T labs http://www.research.att.com/~yann License place recognition Adapted from Steven Seitz, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 34

Document analysis Adapted from Shapiro and Stockman CS 484, Spring 2010 2010, Selim Aksoy 35

Document analysis Adapted from Linda Shapiro, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 36

Sports video analysis Tennis review system http://www.hawkeyeinnovations.co.uk CS 484, Spring 2010 2010, Selim Aksoy 37

Scene classification CS 484, Spring 2010 2010, Selim Aksoy 38

Organizing image archives Adapted from Pinar Duygulu, Bilkent University CS 484, Spring 2010 2010, Selim Aksoy 39

Photo tourism: exploring photo collections Building 3D scene models from individual photos Adapted from Steven Seitz, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 40

Content-based retrieval CS 484, Spring 2010 2010, Selim Aksoy 41

Content-based retrieval CS 484, Spring 2010 2010, Selim Aksoy 42

Content-based retrieval Online shopping catalog search http://www.like.com CS 484, Spring 2010 2010, Selim Aksoy 43

Face detection and recognition Adapted from CSE 455, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 44

Object recognition Adapted from Rob Fergus, MIT CS 484, Spring 2010 2010, Selim Aksoy 45

3D scanning Adapted from Linda Shapiro, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 46

3D reconstruction Adapted from David Forsyth, UC Berkeley CS 484, Spring 2010 2010, Selim Aksoy 47

3D reconstruction Adapted from David Forsyth, UC Berkeley CS 484, Spring 2010 2010, Selim Aksoy 48

Motion capture Adapted from Linda Shapiro, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 49

Visual effects Adapted from CSE 455, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 50

Mozaic Adapted from David Forsyth, UC Berkeley CS 484, Spring 2010 2010, Selim Aksoy 51

Mozaic Adapted from David Forsyth, UC Berkeley CS 484, Spring 2010 2010, Selim Aksoy 52

Critical issues What information should be extracted? How can it be extracted? How should it be represented? How can it be used to aid analysis and understanding? CS 484, Spring 2010 2010, Selim Aksoy 53

Challenge What do you see in the picture? A hand holding a man A hand holding a shiny sphere An Escher drawing Adapted from Octavia Camps, Penn State CS 484, Spring 2010 2010, Selim Aksoy 54

Perception and grouping Subjective contours CS 484, Spring 2010 2010, Selim Aksoy 55

Perception and grouping Subjective contours Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 56

Perception and grouping Adapted from Gonzales and Woods CS 484, Spring 2010 2010, Selim Aksoy 57

Perception and grouping Adapted from Gonzales and Woods CS 484, Spring 2010 2010, Selim Aksoy 58

Copyright A.Kitaoka 2003 CS 484, Spring 2010 2010, Selim Aksoy 60

Perception and grouping Occlusion Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 61

Perception and grouping The shape of junctions constrains the possible interpretations of the scene. Ambiguous: paint and surface boundaries can be confused. Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 62

Challenges 1: view point variation Michelangelo 1475-1564 Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2010 2010, Selim Aksoy 63

Challenges 2: illumination Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2010 2010, Selim Aksoy 64

Challenges 3: occlusion Magritte, 1957 Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2010 2010, Selim Aksoy 65

Challenges 4: scale Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2010 2010, Selim Aksoy 66

Challenges 5: deformation Xu, Beihong 1943 Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2010 2010, Selim Aksoy 67

Challenges 6: background clutter Klimt, 1913 Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2010 2010, Selim Aksoy 68

Challenges 7: intra-class variation Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2010 2010, Selim Aksoy 69

Recognition How can different cues such as color, texture, shape, motion, etc., can be used for recognition? Which parts of image should be recognized together? How can objects be recognized without focusing on detail? How can objects with many free parameters be recognized? How do we structure very large model bases? CS 484, Spring 2010 2010, Selim Aksoy 70

Color Adapted from Martial Hebert, CMU CS 484, Spring 2010 2010, Selim Aksoy 71

Texture Adapted from David Forsyth, UC Berkeley CS 484, Spring 2010 2010, Selim Aksoy 72

Segmentation Original Images Color Regions Texture Regions Line Clusters Adapted from Linda Shapiro, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 73

Segmentation Adapted from Jianbo Shi, U Penn CS 484, Spring 2010 2010, Selim Aksoy 74

Shape Recognized objects Model database Adapted from Enis Cetin, Bilkent University CS 484, Spring 2010 2010, Selim Aksoy 75

Motion Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 76

Recognition Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 77

Recognition Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 78

Recognition Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 79

Recognition Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 80

Recognition Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 81

Recognition CS 484, Spring 2010 2010, Selim Aksoy 82

Recognition Adapted from David Forsyth, UC Berkeley CS 484, Spring 2010 2010, Selim Aksoy 83

Detection Adapted from David Forsyth, UC Berkeley CS 484, Spring 2010 2010, Selim Aksoy 84

Detection Adapted from David Forsyth, UC Berkeley CS 484, Spring 2010 2010, Selim Aksoy 85

Detection Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 86

Parts and relations Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 87

Parts and relations Adapted from Michael Black, Brown University CS 484, Spring 2010 2010, Selim Aksoy 88

Context Adapted from Antonio Torralba, MIT CS 484, Spring 2010 2010, Selim Aksoy 89

Context Adapted from Antonio Torralba, MIT CS 484, Spring 2010 2010, Selim Aksoy 90

Context Adapted from Derek Hoiem, CMU CS 484, Spring 2010 2010, Selim Aksoy 91

Context Adapted from Derek Hoiem, CMU CS 484, Spring 2010 2010, Selim Aksoy 92

Stages of computer vision Low-level image image Mid-level image features / attributes Image analysis / image understanding High-level features making sense, recognition CS 484, Spring 2010 2010, Selim Aksoy 93

Low-level sharpening blurring Adapted from Linda Shapiro, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 94

Low-level Canny original image Mid-level edge image ORT edge image data structure circular arcs and line segments Adapted from Linda Shapiro, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 95

Mid-level K-means clustering (followed by connected component analysis) original color image regions of homogeneous color data structure Adapted from Linda Shapiro, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 96

Low-level to high-level low-level edge image mid-level high-level consistent line clusters Adapted from Linda Shapiro, U of Washington CS 484, Spring 2010 2010, Selim Aksoy 97