Introduction. Selim Aksoy. Bilkent University

Similar documents
Introduction. Selim Aksoy. Bilkent University

Question Answering. CS486 / 686 University of Waterloo Lecture 23: April 1 st, CS486/686 Slides (c) 2014 P. Poupart 1

The Fallacy in Intelligent Design

A Scientific Model Explains Spirituality and Nonduality

APAS assistant flexible production assistant

Grids: Why, How, and What Next

From Machines To The First Person

Professor Manovich, welcome to the Thought Project. Thank you so much. I love your project name. I can come back any time.

Under the command of algorithms

Visual Analytics Based Authorship Discrimination Using Gaussian Mixture Models and Self Organising Maps: Application on Quran and Hadith

Gesture recognition with Kinect. Joakim Larsson

Actuaries Institute Podcast Transcript Ethics Beyond Human Behaviour

Information Extraction. CS6200 Information Retrieval (and a sort of advertisement for NLP in the spring)

Carolina Bachenheimer-Schaefer, Thorsten Reibel, Jürgen Schilder & Ilija Zivadinovic Global Application and Solution Team

Natural Language Processing (NLP) 10/30/02 CS470/670 NLP (10/30/02) 1

IIF Symposium Toronto Julie Nagam

ST PETER S CATHOLIC PARISH PRIMARY SCHOOL Thursday 7th September 2017 Week 8

What the Manuscripts Reveal

Mark V. Shaney. Comp 140

Year 4 Medium Term Planning

What is belief, such that first person authority can exist?

Automatic Recognition of Tibetan Buddhist Text by Computer. Masami Kojima*1, Yoshiyuki Kawazoe*2 and Masayuki Kimura*3

This is a relatively new term used by those in things like Speech recognition software development or robotic engineering or the internet searches.

Year 4 Medium Term Planning

ABB STOTZ-KONTAKT GmbH ABB i-bus KNX DGN/S DALI Gateway for emergency lighting

INF5020 Philosophy of Information: Ontology

Prentice Hall World Geography: Building A Global Perspective 2003 Correlated to: Colorado Model Content Standards for Geography (Grade 9-12)

FOURTH GRADE. WE LIVE AS CHRISTIANS ~ Your child recognizes that the Holy Spirit gives us life and that the Holy Spirit gives us gifts.

Spatial History - "Using GIS and Open Web Resources to Reconstruct Great Tang Records of the Western Regions on the Silk Route"

Q & A with author David Christian and publisher Karen. This Fleeting World: A Short History of Humanity by David Christian

THIRD GRADE CURRICULUM RELIGION

Potten End Church of England Primary School Curriculum Map. Year 6

An Easy Retail Management System

Biometrics Prof. Phalguni Gupta Department of Computer Science and Engineering Indian Institute of Technology, Kanpur. Lecture No.

The curious case of Mark V. Shaney. Comp 140 Fall 2008

>> THE_PARABLES_PROJECT >> TEACHERS NOTES

Artificial Intelligence Prof. Deepak Khemani Department of Computer Science and Engineering Indian Institute of Technology, Madras

The Last Judgment in Cyberspace

Biometrics Prof. Phalguni Gupta Department of Computer Science and Engineering Indian Institute of Technology, Kanpur. Lecture No.

End of the year test day 2 #3

Living Life Radiantly

#FollowTheStar. Church of England Christmas campaign Find out more and join in at

STEP SEVEN-INTUITION. Gut instinct Psychic Ability Pattern Recognition. The only real valuable thing is intuition. Einstein

A Path Forward for NDE Reliability

Bringing the Overview Effect Down to Earth

Buddha Images in Mudras Representing Days of a Week: Tactile Texture Design for the Blind

The 15 th KCMCW. Karniel Computational Motor Control Workshop Ben-Gurion University of the Negev

YOUR GOD-SELF Manuscript on hidden Knowledge

Laboratory Exercise Saratoga Springs Temple Site Locator

Sounds of Love. Intuition and Reason

Doctor Faustus and the Universal Machine

Presidents Day Resources

March 17, FCC Director of Children s Ministries

Classical Hebrew Poetry: A Guide To Its Techniques (Journal For The Study Of The Old Testament Supplement Series) By Wilfred G. E.

APRIL 2017 KNX DALI-Gateways DG/S x BU EPBP GPG Building Automation. Thorsten Reibel, Training & Qualification

Ms. Shruti Aggarwal Assistant Professor S.G.G.S.W.U. Fatehgarh Sahib

Modern neuroscience: Room for the soul? John Beggs

Copyrighted material Boys Guide to Discovering His Bible, A.indd 3 12/23/14 2:45 PM

Network-based. Visual Analysis of Tabular Data. Zhicheng Liu, Shamkant Navathe, John Stasko

Health Information Exchange

LOMBARD POLICE DEPARTMENT RESPONSE LETTER TO REQUESTOR

Human Resource Management (HRM) 199 hybrid managers 392

T.M. Luhrmann. When God Talks Back: Understanding the American Evangelical Relationship

The Problem of the External World

Wendy E. Mackay. INRIA, France

Proposed Motion: Sponsor Persons of Contact: Guest Speakers Date:

NPTEL ONLINE CERTIFICATION COURSES. Course on Reinforced Concrete Road Bridges

correlated to the Massachussetts Learning Standards for Geometry C14

Minds, Brains and Turing

This image cannot currently be displayed. Course Catalog. Bible Doctrine Glynlyon, Inc.

TRG PROGRAMME PG COURSES SPRING SEMESTER Course: MS-2016 (EE) Duration: 20 Feb to 23 Jun 2017 Discipline: Electrical Engineering No of Weeks: 18

Reading Standards for the Archdiocese of Detroit Kindergarten

Junior Soldiers. Talking to God about myself & others. Consider & Prepare. Unit 3 : Lesson 8

Human Factors/Ergonomics for Societal Transformation: A Tale of Two Cities. Nancy J. Cooke. HFES President

... come into my parlor said the spider... The Creation

Changes sermon--the Tao that Can be Known Rev. Julie Forest October 22, 2017

Monday, December 10, Robert DeLaurentis with Citizen of the World, the Gulfstream Turbine Commander 900 plane that will complete the mission

Curriculum Catalog

CREATE. CONNECT. LIVE. Ed Hepler Winner of the Qualcomm Tricorder XPRIZE

Seldom do we know any specific information or history. The Other Martial Henry by Rob Kassab #4144LB

Inimitable Human Intelligence and The Truth on Morality. to life, such as 3D projectors and flying cars. In fairy tales, magical spells are cast to

Hindu Paradigm of Evolution

THE WAY TO PRACTISE VIPASSANA MEDITATION

Can a Machine Think? Christopher Evans (1979) Intro to Philosophy Professor Douglas Olena

PRODUCTS AND SERVICES CATALOGUE 2017 EDITION

GO ASK ALICE BY AUTHOR ANONYMOUS

MINISTRY OPPORTUNITIES

PARTNERSHIP OPPORTUNITIES

Multi-faith Creation Stories. Cave paintings. Printing

The Great Commission: Part 2 Lesson Aim: To tell other people about Jesus.

Artistic Engagement with Halakhah

Follow this and additional works at: Part of the Fine Arts Commons

INFORMATION FOR DVC MATH STUDENTS in Math 75, 110, 120, 121, 124 and 135 Distance Education Hours by Arrangement (HBA) - Summer 2010

Scripture Focus: Deuteronomy 28:1-9, Galatians 3:26-29, Exodus 19:3-6, 1 Peter 2:4-10

Lecture 7.1 Berkeley I

DOWNLOAD OR READ : WESTERN IMAGE OF THE SIKH RELIGION A S PDF EBOOK EPUB MOBI

Paul and King Agrippa Lesson Aim: To know we can pray for others to know who Jesus is.

Mind Manipulation Ancient And Modern Ninja Techniques Haha Lung

LESSON 1: Determining Your Legacy

OUR BODY LIFE SERIES: PBC DNA: WHY WE DO WHAT WE DO. Catalog No Ephesians 4:11-16 Second Message Paul Taylor January 14, 2018

Transcription:

Introduction Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr

What is computer vision? Analysis of digital images by a computer. Stockman and Shapiro: making useful decisions about real physical objects and scenes based on sensed images. Trucco and Verri: computing properties of the 3D world from one or more digital images. Ballard and Brown: construction of explicit, meaningful description of physical objects from images. Forsyth and Ponce: extracting descriptions of the world from pictures or sequences of pictures. CS 484, Spring 2009 2009, Selim Aksoy 2

Why study computer vision? Possibility of building intelligent machines is fascinating. Capability of understanding the visual world is a prerequisite for such machines. Much of the human brain is dedicated to vision. Humans solve many visual problems effortlessly, yet we have little understanding of visual cognition. CS 484, Spring 2009 2009, Selim Aksoy 3

Why study computer vision? An image is worth 1000 words. Images and videos are everywhere. Fast growing collections and many useful applications. Goals of vision research: Give machines the ability to understand scenes. Aid understanding and modeling of human vision. Automate visual operations. CS 484, Spring 2009 2009, Selim Aksoy 4

Applications Medical image analysis Security Biometrics Surveillance Tracking Target recognition Remote sensing Robotics Industrial inspection, quality control Document analysis Multimedia Assisted living Human-computer interfaces CS 484, Spring 2009 2009, Selim Aksoy 5

Medical image analysis http://www.clarontech.com CS 484, Spring 2009 2009, Selim Aksoy 6

Medical image analysis http://www.clarontech.com CS 484, Spring 2009 2009, Selim Aksoy 7

Medical image analysis http://www.clarontech.com CS 484, Spring 2009 2009, Selim Aksoy 8

Medical image analysis Cancer detection and grading CS 484, Spring 2009 2009, Selim Aksoy 9

Medical image analysis Slice of lung Adapted from Linda Shapiro, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 10

Biometrics Adapted from Anil Jain, Michigan State CS 484, Spring 2009 2009, Selim Aksoy 11

Biometrics Adapted from Anil Jain, Michigan State CS 484, Spring 2009 2009, Selim Aksoy 12

Surveillance and tracking University of Central Florida, Computer Vision Lab CS 484, Spring 2009 2009, Selim Aksoy 13

Surveillance and tracking Adapted from Octavia Camps, Penn State CS 484, Spring 2009 2009, Selim Aksoy 14

Surveillance and tracking Adapted from Martial Hebert, CMU CS 484, Spring 2009 2009, Selim Aksoy 15

Surveillance and tracking Generating traffic patterns University of Central Florida, Computer Vision Lab CS 484, Spring 2009 2009, Selim Aksoy 16

Surveillance and tracking Tracking in UAV videos Adapted from Martial Hebert, CMU, and Masaharu Kobashi, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 17

Vehicle and pedestrian protection Lane departure warning, collision warning, traffic sign recognition, pedestrian recognition, blind spot warning http://www.mobileye-vision.com CS 484, Spring 2009 2009, Selim Aksoy 18

Forest fire monitoring system Early warning of forest fires Adapted from Enis Cetin, Bilkent University CS 484, Spring 2009 2009, Selim Aksoy 19

Land cover classification CS 484, Spring 2009 2009, Selim Aksoy 20

Object recognition CS 484, Spring 2009 2009, Selim Aksoy 21

Object recognition Recognition of buildings and building groups CS 484, Spring 2009 2009, Selim Aksoy 22

Object recognition Automatic mapping; agriculture CS 484, Spring 2009 2009, Selim Aksoy 23

Content-based retrieval Finding similar regions: airports CS 484, Spring 2009 2009, Selim Aksoy 24

Robotics Adapted from Linda Shapiro, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 25

Robotics Adapted from Steven Seitz, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 26

Autonomous navigation Michigan State University General Dynamics Robotics Systems http://www.gdrs.com CS 484, Spring 2009 2009, Selim Aksoy 27

Industrial automation Automatic fruit sorting Color Vision Systems http://www.cvs.com.au CS 484, Spring 2009 2009, Selim Aksoy 28

Industrial automation Industrial robotics; bin picking http://www.braintech.com CS 484, Spring 2009 2009, Selim Aksoy 29

Postal service automation General Dynamics Robotics Systems http://www.gdrs.com CS 484, Spring 2009 2009, Selim Aksoy 30

Document analysis Digit recognition, AT&T labs http://www.research.att.com/~yann Adapted from Steven Seitz, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 31

Document analysis Adapted from Shapiro and Stockman CS 484, Spring 2009 2009, Selim Aksoy 32

Document analysis Adapted from Linda Shapiro, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 33

Sports video analysis Tennis review system http://www.hawkeyeinnovations.co.uk CS 484, Spring 2009 2009, Selim Aksoy 34

Scene classification CS 484, Spring 2009 2009, Selim Aksoy 35

Organizing image archives Adapted from Pinar Duygulu, Bilkent University CS 484, Spring 2009 2009, Selim Aksoy 36

Photo tourism: exploring photo collections Building 3D scene models from individual photos Adapted from Steven Seitz, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 37

Content-based retrieval CS 484, Spring 2009 2009, Selim Aksoy 38

Content-based retrieval CS 484, Spring 2009 2009, Selim Aksoy 39

Content-based retrieval Online shopping catalog search http://www.like.com CS 484, Spring 2009 2009, Selim Aksoy 40

Face detection and recognition CS 484, Spring 2009 2009, Selim Aksoy 41

Object recognition Adapted from Rob Fergus, MIT CS 484, Spring 2009 2009, Selim Aksoy 42

3D scanning Adapted from Linda Shapiro, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 43

3D reconstruction Adapted from David Forsyth, UC Berkeley CS 484, Spring 2009 2009, Selim Aksoy 44

3D reconstruction Adapted from David Forsyth, UC Berkeley CS 484, Spring 2009 2009, Selim Aksoy 45

Motion capture Adapted from Linda Shapiro, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 46

Visual effects Adapted from Linda Shapiro, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 47

Mozaic Adapted from David Forsyth, UC Berkeley CS 484, Spring 2009 2009, Selim Aksoy 48

Mozaic Adapted from David Forsyth, UC Berkeley CS 484, Spring 2009 2009, Selim Aksoy 49

Critical issues What information should be extracted? How can it be extracted? How should it be represented? How can it be used to aid analysis and understanding? CS 484, Spring 2009 2009, Selim Aksoy 50

Challenge What do you see in the picture? A hand holding a man A hand holding a shiny sphere An Escher drawing Adapted from Octavia Camps, Penn State CS 484, Spring 2009 2009, Selim Aksoy 51

Perception and grouping Subjective contours CS 484, Spring 2009 2009, Selim Aksoy 52

Perception and grouping Subjective contours Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 53

Perception and grouping Adapted from Gonzales and Woods CS 484, Spring 2009 2009, Selim Aksoy 54

Perception and grouping Adapted from Gonzales and Woods CS 484, Spring 2009 2009, Selim Aksoy 55

Perception and grouping Occlusion Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 57

Perception and grouping The shape of junctions constrains the possible interpretations of the scene. Ambiguous: paint and surface boundaries can be confused. Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 58

Challenges 1: view point variation Michelangelo 1475-1564 Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2009 2009, Selim Aksoy 59

Challenges 2: illumination Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2009 2009, Selim Aksoy 60

Challenges 3: occlusion Magritte, 1957 Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2009 2009, Selim Aksoy 61

Challenges 4: scale Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2009 2009, Selim Aksoy 62

Challenges 5: deformation Xu, Beihong 1943 Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2009 2009, Selim Aksoy 63

Challenges 6: background clutter Klimt, 1913 Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2009 2009, Selim Aksoy 64

Challenges 7: intra-class variation Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2009 2009, Selim Aksoy 65

Recognition How can different cues such as color, texture, shape, motion, etc., can be used for recognition? Which parts of image should be recognized together? How can objects be recognized without focusing on detail? How can objects with many free parameters be recognized? How do we structure very large model bases? CS 484, Spring 2009 2009, Selim Aksoy 66

Color Adapted from Martial Hebert, CMU CS 484, Spring 2009 2009, Selim Aksoy 67

Texture Adapted from David Forsyth, UC Berkeley CS 484, Spring 2009 2009, Selim Aksoy 68

Segmentation Original Images Color Regions Texture Regions Line Clusters Adapted from Linda Shapiro, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 69

Segmentation Adapted from Jianbo Shi, U Penn CS 484, Spring 2009 2009, Selim Aksoy 70

Shape Recognized objects Model database Adapted from Enis Cetin, Bilkent University CS 484, Spring 2009 2009, Selim Aksoy 71

Motion Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 72

Recognition Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 73

Recognition Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 74

Recognition Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 75

Recognition Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 76

Recognition Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 77

Recognition CS 484, Spring 2009 2009, Selim Aksoy 78

Recognition Adapted from David Forsyth, UC Berkeley CS 484, Spring 2009 2009, Selim Aksoy 79

Detection Adapted from David Forsyth, UC Berkeley CS 484, Spring 2009 2009, Selim Aksoy 80

Detection Adapted from David Forsyth, UC Berkeley CS 484, Spring 2009 2009, Selim Aksoy 81

Detection Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 82

Parts and relations Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 83

Parts and relations Adapted from Michael Black, Brown University CS 484, Spring 2009 2009, Selim Aksoy 84

Context Adapted from Antonio Torralba, MIT CS 484, Spring 2009 2009, Selim Aksoy 85

Context Adapted from Antonio Torralba, MIT CS 484, Spring 2009 2009, Selim Aksoy 86

Context Adapted from Derek Hoiem, CMU CS 484, Spring 2009 2009, Selim Aksoy 87

Context Adapted from Derek Hoiem, CMU CS 484, Spring 2009 2009, Selim Aksoy 88

Stages of computer vision Low-level image image Mid-level image features / attributes Image analysis / image understanding High-level features making sense, recognition CS 484, Spring 2009 2009, Selim Aksoy 89

Low-level sharpening blurring Adapted from Linda Shapiro, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 90

Low-level Canny original image Mid-level edge image ORT edge image data structure circular arcs and line segments Adapted from Linda Shapiro, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 91

Mid-level K-means clustering (followed by connected component analysis) original color image regions of homogeneous color data structure Adapted from Linda Shapiro, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 92

Low-level to high-level low-level edge image mid-level high-level consistent line clusters Adapted from Linda Shapiro, U of Washington CS 484, Spring 2009 2009, Selim Aksoy 93