Advanced Computer Vision 2026

CGU AICV Lab Computer Vision Lab of the Department of Artificial Intelligence at Chang Gung University

Advanced Computer Vision 2026

Spring 2026, Friday 1:10pm to 4:00pm, Classroom: The second Medical Science Building Room 104
Instructor: Chih-Yuan Yang

Course Information

This course is titled Advanced Computer Vision. However, the field is vast—in 2025 alone, CVPR saw over 2,800 papers published, not to mention specialized CV conferences like ICCV and WACV, or major AI venues like NeurIPS, ICLR, ICML, and AAAI. At an advanced level, we must shift our focus toward specific areas of expertise rather than attempting to cover fundamental knowledge. In this class, I want to guide students through the latest research papers to explore their new ideas, the problems they aim to solve, their current limitations, and their relevance to students’ own research. I expect students to answer these core questions: What is the paper proposing? Is the source code available? Are the results reproducible? How does their approach benefit your research? And finally, are there ways to improve upon their solutions? As this is a literature-heavy course, I assume students already possess foundational knowledge in Computer Vision. While I do not plan to lecture on basic concepts like pixels, color spaces, filters, or neural networks, I will step in to clarify concepts or provide necessary background if discussions become confusing or technical gaps arise.

Prerequisites

In this course, I want students to read the latest papers from top computer vision conferences and journals, which are the state-of-the-art research reports. Students need to present their findings, understanding, reproduced experimental results, and ideas for improvements in the classroom. By understanding those cutting-edge methods, students should gain knowledge and get some ideas for their own research. This course requires programming experience and fundamental knowledge of computer vision.

Microsoft Teams link

https://teams.microsoft.com/meet/48242579078635?p=UaJzY7gYU1iQivpZ10 It will be activated only when asked.

Syllabus

Week	Date	Topic	Slides	Recording	Action
1	2/27				Holiday: Peace Memorial Day Compensation Day
2	3/6	Introduction to this course and the top computer vision conferences. Presented paper: 2025 ICCV Towards Proactive Social Robots: Distilling Visual Knowledge from Large Vision-Language Models	pptx	YouTube
3	3/13	2025 ICCV GeoFormer: Geometry Point Encoder for 3D Object Detection with Graph-based Transformer 2026 arXiv LoopViT: Scaling Visual ARC with Looped Transformers
4	3/20	2025 ICCV From Gaze to Movement: Predicting Visual Attention for Autonomous Driving Human-Machine Interaction based on Programmatic Imitation Learning 2022 WACV SeaDronesSee: A Maritime Benchmark for Detecting Humans in Open Water
5	3/27	Paper presentation and discussion 3
6	4/3				Holiday: Children’s Day
7	4/10	Term project proposal / Paper presentation and discussion 4
8	4/17	Paper presentation and discussion 5
9	4/24	Paper presentation and discussion 6
10	5/1				Holiday: Labor Day
11	5/8	Midterm presentation / Paper presentation and discussion 7
12	5/15	Paper presentation and discussion 8
13	5/22	Paper presentation and discussion 9
14	5/29	Paper presentation and discussion 10
15	6/5	Paper presentation and discussion 11
16	6/12	Term project presentation
17	6/19				Final report due

Term Project Topics, Slides, and Reports

Topic	Slides	Report	Code

Textbook

We do not have a textbook because the knowledge reported by latest research papers is too new to be covered by a textbook. An evoling large language model is more useful than a textbook for you to retrive new knowledge.

Reference Books

Available online for free offered by the authors.
- Computer Vision: Algorithms and Applications by Richard Szeliski (2022)
- Programming Computer Vision with Python: Tools and algorithms for analyzing images by Jam Solem. (2012)
- Deep Learning by Ian Goodfellow et al. (2016). A third-party-made PDF is available at GitHub.
- Computer Vision: Models, Learning, and Inference by Simon J.D. Prince. (2012)
- Data Driven Science and Engineering by Steven L. Brunton and J. Nathan Kutz. (2017)
- Learn Computer Vision Using OpenCV With Deep Learning CNNs and RNNs by Sunila Gollapudi (2019)
No free PDF offered by the authors, but available at school library.
- Digital Image Processing 3th edition by Rafael Gonzalez and Richard Woods. (2008) There is a 4th edition published in 2017.
- Digital Image Processing using Matlab 2nd edition by Rafael Gonzalez et al. (2010)
- Learning OpenCV3 by Adrian Kaehler & Gary Bradski. (2017)
No free PDF offered by the authors, but code available at GitHub.
- OpenCV 3.x with Python By Example 2nd edition by Gabriel Garrido and Prateek Joshi. (2018)

Existing Full-length Course Lecture Recordings

Existing Online Lecture Videos for Computer Vision Knowledge Points

Columbia Computer Science 2021

Existing Computer Vision Course Slides for Self-Learning

Alexei Efros at UC Berkeley https://cs280-berkeley.github.io/
Derek Hoiem at UIUC https://courses.engr.illinois.edu/cs543/sp2017/
David Forsyth at UIUC http://luthuli.cs.uiuc.edu/~daf/courses/CV23/planned.html
James Hays at Georgia Tech https://faculty.cc.gatech.edu/~hays/compvision2022fall/
Steve Seitz at U Washington https://courses.cs.washington.edu/courses/cse576/20sp/calendar/
Min Sun at NTHU https://aliensunmin.github.io/teaching/cv2022/index.html
Justin Johnson at UMich https://web.eecs.umich.edu/~justincj/teaching/eecs498/WI2022/

Grading

Your final grade will be made up from

50% Your paper presentations in the classroom
10% Discussion participation in the classroom
40% Term project, including proposal (5%), midterm presentation (10%), final project presentation (15%), and final project report (10%). Maximum 5 members each group.
late policy
I do not have a strict late policy because there are only a few students taking this course. I will directly ask students why I do not see their submissions via Teams messages.

Contact Info and Office Hour

Chih-Yuan Yang: cyyang@cgu.edu.tw
Office hours: Tue 10:30~11:30 Management Building Room 1416