[SCS dragon logo]
15-463
(15-862): Computational
Computer Science Department
Carnegie
Mellon University
COURSE OVERVIEW:
Computational Photography is an emerging new field created by the
convergence
of computer graphics, computer vision and photography. Its role is to
overcome the
limitations of the traditional camera by using computational techniques
to
produce a richer, more vivid, perhaps more perceptually meaningful
representation of our visual world.
The aim of this advanced undergraduate course is to study ways in which samples from the real world (images and video) can be used to generate compelling computer graphics imagery. We will learn how to acquire, represent, and render scenes from digitized photographs. Several popular image-based algorithms will be presented, with an emphasis on using these techniques to build practical systems. This hands-on emphasis will be reflected in the programming assignments, in which students will have the opportunity to acquire their own images of indoor and outdoor scenes and develop the image analysis and synthesis tools needed to render and view the scenes on the computer.
TOPICS TO BE COVERED:
PREREQUISITES:
Programming experience and familiarity with linear algebra and calculus
is
assumed.Some background in computer
graphics, computer vision, or image processing is helpful.This class does not significantly overlap with
15-462 and can be taken concurrently.
Graduate Students: a small number of graduate students will be
allowed
to take the graduate version of this course (15-862) with the
permission of the
instructor. Students taking 15-862 will be required to do more
substantial
assignments as well as a research-level final paper.
Note: if the system doesn’t
let you sign up, or puts you on the waitlist, do talk to me.
PROGRAMMING ASSIGNMENTS:
Results
Class
Choice Award:Xiaoyuan
Jiang
Class Choice Award: Ajoux Philippe Vincent
Project 3: Face morphing and modeling:
See
results
Class Choice Award: TBA
Project 4: Stitching Photo Mosaics (including
autostitching)
See
results
Class Choice Award:TBA
TEXT:
This is the first year when we will be using:
Computer
Vision: Algorithms and Applications. Richard Szeliski (available
online)
Various course notes and papers will be made available.Furthermore, there is an optional textbook that you might find helpful.It will be placed on reserve at the Wean Hall library:
Computer
Vision: The Modern Approach, Forsyth and
There is a number of other fine texts that you can use for general reference:
Photography
(8th
edition), London and Upton, (a great
general guide to taking pictures)
Vision Science: Photons to
Phenomenology, Stephen Palmer (great book
on human visual perception)
Digital Image Processing, 2nd edition, Gonzalez and
Woods (a good general image processing
text)
The Art and Science of Digital Compositing, Ron Brinkmann (everything about compositing)
Multiple View Geometry in
Computer Vision, Hartley & Zisserman (a
bible on recovering 3D geometry) [on reserve]
The Computer Image, Watt and Policarpo (a
nice “vision for graphics” text, somewhat dated)
3D Computer Graphics (3rd Edition), Watt (a
good general graphics text)
Fundamentals of Computer Graphics, Peter Shirley (another
good general graphics text)
Linear Algebra and its Applications, Gilbert Strang (a
truly wonderful book on linear algebra)
CLASS
NOTES
The instructor is extremely grateful to a large number of researchers
for
making their slides available for use in this course.Steve Seitz
and Rick Szeliski
have been
particularly kind in letting me use their wonderful lecture notes.In addition, I would like to thank Paul Debevec, Stephen Palmer, Paul Heckbert, David Forsyth, Steve Marschner and
others, as noted
in the slides.The instructor gladly
gives permission to use and modify any of the slides for academic and
research
purposes. However, please do also acknowledge the original sources
where appropriate.
CLASS SCHEDULE:
CLASS DATE
TOPICS
Material
Th Feb 2
Image
Blending and Compositing
Additional
Burt and Adelson, A multiresolution spline with application to
image mosaics, ACM ToG (1983)
Agarwala et al, Interactive
Digital Photomontage, SIGGRAPH 2004
Data-driven
Methods: Faces
Slides (ppt , pdf )
Rowland
and Ferrett, “Manipulating Facial Appearance through Shape
and Color” ,
CG&A, 1995
Additional
Homographies and
Mosaics
Tu
Automatic Alignment
Th
More Mosaic Madness
キAdditional
Tu
Automatic Alignment
Data-driven Methods: Video and Texture
Additional
1. im2gps
2.Creating and Exploring a Large
Photorealistic Virtual Space
Single
View Reconstruction
single viewsingle view
キ
Image-based Lighting
CAMERAS:
Although it is not required, students are highly encouraged to obtain a
digital
camera for use in the course (one can get a pretty
good camera for under 150ドル). A camera might be available on load
from the
instructor.
METHOD OF EVALUATION:
Grading will be based on a set of programming and written assignments
(60%), an
exam (20%) and a final project (20%).For the programming assignments, students will be allowed a
total of 5
(five) late days per semester; each additional late day will incur a
10%
penalty.
Students taking 15-862 will also be required to submit a conference-style paper describing their final project.
COMPUTING
FACILITIES:
All students will have access to
the graphics cluster in Gates Hall. You will need to setup your Andrew
account in the right way before you can login.
MATLAB:
Students will be encouraged to
use Matlab (with the Image Processing Toolkit) as their primary
computing
platform.Besides being a great
prototyping environment, Matlab is particularly well-suited for working
with
image data and offers tons of build-in image processing functions.Here is a link to some useful
Matlab resources
PREVIOUS OFFERINGS OF THIS
COURSE:
Previous offerings of this course can be found here.
SIMILAR COURSES IN OTHER UNIVERSITIES:
Page design courtesy of Doug James