2015

Jean Ponce, Ivan Laptev, Cordelia Schmid and Josef Sivic

Course Information

Room: ENS Ulm Salle UV aile Rataud, 45 rue d'Ulm

Class time: Tuesday 16:15-19:15

News:

Internship proposals at Willow
Final project reports due on Jan 9.
Schedule of the final project presentations (Tue Dec 16 and Fri Dec 19. See the link for details.)
Assignment 3 “Neural networks” is out. Due on Nov 18.
Final project topics are out. Due on Nov 21.
Assignment 2 “Image classification” is out. Due on Nov 4.
2nd Matlab tutorial will be organized on October 13, 10:00-12:00 at INRIA/Willow lab, Salle Verte 1.
Doodle for Matlab Tutorial http://doodle.com/k23khuxa98rfvpz8 Due on Oct 8.
Assignment 1 “Instance-level recognition” is out. Due on Oct 21.
Matlab tutorial will be organized on October 6, 14:00-16:00 at INRIA/Willow lab, Salle Jaune. Materials for the tutorial are available here.

List of received reports:

https://docs.google.com/spreadsheet/ccc?key=0Aso5oi2c4UB5dHZaZVJtUjJvSmF4ZUxLaUVhOUxpakE#gid=0

Course description

Automated object recognition -- and more generally scene analysis -- from photographs and videos is the grand challenge of computer vision. This course presents the image, object, and scene models, as well as the methods and algorithms, used today to address this challenge.

Assignments

There will be three programming assignments representing 50% (10% + 20% + 20%) of the grade. The supporting materials for the programming assignments and final projects will be in Matlab.

Final project

The final project will represent 50% of the grade. Suggested topics for final projects will be added here. See examples from the last year.

Computer vision and machine learning talks

You are welcome to attend seminars in the Willow group. Please see the current seminar schedule. Typically, these are one hour research talks given by visiting speakers. The talks are at 23 avenue d'Italie. Ring the bell to get into the building, then take the elevator to the 5th floor.

Course schedule (subject to change):

Lecture	Date	Topic and reading materials.	Slides
1	Sep 30	Introduction (J. Ponce); Instance-level recognition I. - Camera geometry (J. Ponce) Class logistics, assignments, final projects (I. Laptev and J. Sivic) Background materials: History: J. Mundy - Object recognition in the geometric era: A retrospective.; Camera geometry: Forsyth&Ponce Ch.1-2. Hartley&Zisserman - Ch.6	PDF
2	Oct 7	Instance-level recognition II. - Local invariant features (1.5hrs, C. Schmid); Correspondence and image matching (1.5hrs, I. Laptev) Materials: Mikolajczyk & Schmid, Scale and affine invariant interest point detectors, IJCV 2004; D. Lowe, Distinctive image features from scale-invariant keypoints, IJCV 2004, R. Szeliski (pdf), Sections 4.1, 4.1.1 and 4.1.2 from Chapter 4: Feature detection and matching;R. Szeliski (pdf), Sections 4.1.3 (feature matching) and 6.1 (feature-based alignment); Assignment: Assignment 1 out.	PDF1 PDF2 PDF3
3	Oct 14	Instance-level recognition III. - Efficient visual search (1.5hrs, J. Sivic); Materials: Muja & Lowe, Fast approx. nearest neighbors with automatic algorithm configuration, VISAPP'09; Sivic & Zisserman, Video Google: Efficient visual search of videos (chapter from this book), Philbin et al., Object retrieval with large vocabularies and fast spatial matching, CVPR'07. Instance-level recognition IV. - Very large scale image indexing (1.5hrs, J. Sivic) Materials: Jegou et al., Improving bag-of-features for large scale image search, IJCV 2010; Jegou et al., Aggregating local image descriptors into compact codes, PAMI 2011;	PDF1 PDF2
4	Oct 21	Sparse coding and dictionary learning for image analysis (1.5hrs, J. Ponce) Materials: Bach, Mairal, Ponce, Sapiro, Tutorial on sparse coding and dictionary learning for image analysis, at CVPR'10. Bag-of-feature models for category-level recognition (1.5hrs, C. Schmid) Materials: Csurka et al., Visual categorization with bags of keypoints, 2004 Assignments: Assignment 1 due. Assignment 2 out.	PDF1 PDF2
5	Oct 28	Neural networks; Optimization methods (N. Le Roux) Materials: 1. Python examples 2. For more details on neural networks you can watch the video lectures by Hugo Larochelle. The website also includes links to useful reading materials such as “Practical Recommendations for Gradient-Based Training of Deep Architectures” by Y. Bengio. 3. The draft of the book on deep learning by Y. Bengio Assignments: Topic suggestions for the final project are out.	PDF
6	Nov 4	Convolutional neural networks for visual recognition (J. Sivic and I. Laptev) Materials: Y. LeCun et al., Gradient-based learning applied to document recognition, Proceedings of the IEEE 86(11): 2278–2324, 1998. M.D. Zeiler, R. Fergus, Visualizing and Understanding Convolutional Networks, ECCV 2014. M. Oquab et al., Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks, CVPR 2014 Assignments: Assignment 2 due. Assignment 3 out.	PDF1 PDF2
	Nov 11	No lecture - public holiday
7	Nov 18	Structured models for category-level localization and pose estimation (J. Sivic) Materials: Felzenszwalb et al., A Discriminatively Trained, Multiscale, Deformable Part Model, CVPR’08; Pascal VOC Challenge; Yang and Ramanan, Articulated Human Detection with Flexible Mixtures of Parts, PAMI’13. P. Felzenszwalb and D. Huttenlocher, Distance Transforms of Sampled Functions. Assignments: Assignment 3 due. Final project proposal due (Nov 21).	PDF
8	Nov 25	Motion and human actions I. (I. Laptev) Materials: Laptev et al., Learning realistic human actions from movies, CVPR’08; Want et al., Dense trajectories and motion boundary descriptors for action recognition, CVPR’11. Assignments:	PDF1 PDF2
9	Dec 2	Motion and human actions II. (C. Schmid) Materials:	PDF1 PDF2
10	Dec 9	Scenes, Objects and 3D reasoning (I. Laptev, J. Sivic) Materials: A. Oliva and A. Torralba: Modeling the shape of the scene: A holistic representation of the spatial envelope, IJCV 2001; J. Xiao et al.: Sun database: Large-scale scene recognition from abbey to zoo, CVPR 2010; D.Hoiem et al.: Putting Objects in Perspective, CVPR 2006; C. Desai et al.: Discriminative models for multi-class object layout, CVPR 2009; N. Kumar et al.: Attribute and simile classifiers for face verification, ICCV 2009.	PDF1 PDF2
11	Dec 16 Dec 19	Final project presentations and evaluation (I. Laptev, J. Sivic) Tue Dec 16 presentations are at the standard class location and time (16:15-19:15 ENS Ulm). Fri Dec 19 (14:00-17:00) presentations are at INRIA, 23 Av. d’Italie, 75013. See presentation schedule.

Relevant literature:

[1]	D.A. Forsyth and J. Ponce, "Computer Vision: A Modern Approach", Prentice-Hall, 2nd edition, 2011
[2]	J. Ponce, M. Hebert, C. Schmid and A. Zisserman "Toward Category-Level Object Recognition", Lecture Notes in Computer Science 4170, Springer-Verlag, 2007
[3]	O. Faugeras, Q.T. Luong, and T. Papadopoulo, "Geometry of Multiple Images", MIT Press, 2001.
[4]	R. Hartley and A. Zisserman, "Multiple View Geometry in Computer Vision", Cambridge University Press, 2004.
[5]	J. Koenderink, "Solid Shape", MIT Press, 1990
[6]	R. Szeliski, "Computer Vision: Algorithms and Applications", 2009. A draft of a new book, which can be downloaded online.

Reconnaissance d’objets et vision artificielle 2014/2015Object recognition and computer vision 2014/2015Jean Ponce, Ivan Laptev, Cordelia Schmid and Josef Sivic

Course schedule (subject to change):

Reconnaissance d’objets et vision artificielle 2014/2015
Object recognition and computer vision 2014/2015

Jean Ponce, Ivan Laptev, Cordelia Schmid and Josef Sivic