User Tools

Site Tools


computer_vision

Table of Contents

# Computer Vision

see: Digital image processing

## Contents

### Image enhancement 图像增强

* Image denoising * Image histogram * Inpainting * Histogram equalization * Tone mapping * Retinex * Gamma correction * Anisotropic Diffusion (Perona-Malik equation)

### Transformations 转换

* Affine transform * Homography (computer vision) * Hough transform * Radon transform * Walsh–Hadamard transform

### Filtering, Fourier and wavelet transforms and image compression 过滤,压缩

* Image compression * Filter bank * Gabor filter * JPEG 2000 * Adaptive filtering

### Color vision, 颜色视觉

* Visual perception * Human visual system model * Color matching function * Color space * Color appearance model * Color management system * Color mapping * Color model * Color profile

### Feature extraction 特征提取

* Active contour * Blob detection * Canny edge detector * Contour detection * Edge detection * Edge linking * Harris corner detector * Random sample consensus (RANSAC) * Scale-invariant feature transform (SIFT)

### Pose estimation 姿态估计

* Bundle adjustment * Articulated body pose estimation (BoPoE) * Direct linear transformation (DLT) * Epipolar geometry * Fundamental matrix * Pinhole camera model * Projective geometry * Trifocal tensor

### Registration

* Active appearance model (AAM) * Cross correlation * Geometric hashing * Graph cut segmentation * Least squares estimation * Image pyramid * Image segmentation * Level set method * Markov random fields * Medial axis * Motion field * Motion vector * Multispectral imaging * Normalized cut segmentation * Optical flow * Particle filtering * Scale space

### Visual Recognition 视觉识别

* Object recognition * Image classification * Object detection/object localization * Face detection * Face recognition * Image Description

* Scene Understanding

#### Video analysis

* Video classification * Video Captioning * Visual Question Answering * Action recognition 行为识别 * Temporal action localization, Temporal Action Detection 时序行为检测 * Multimedia event detection

### Motion analysis

### Scene reconstruction

### Image restoration

## Applications

* 3D reconstruction from multiple images * Audio-visual speech recognition * Augmented reality * Augmented reality-assisted surgery * Automated optical inspection * Automatic image annotation * Automatic number plate recognition * Automatic target recognition * Check weigher * Closed-circuit television * Computer stereo vision * Contextual image classification * DARPA LAGR Program * Digital video fingerprinting * Document mosaicing * Facial recognition systems * GazoPa * Geometric feature learning * Gesture recognition * Image collection exploration * Image retrieval

  • Content-based image retrieval
  • Reverse image search

* Image-based modeling and rendering * Integrated mail processing * Iris recognition * Machine vision * Mobile mapping * Navigation system components for:

  • Autonomous cars
  • Mobile robots

* Object detection * Optical braille recognition * Optical character recognition

  • Intelligent character recognition

* Pedestrian detection * People counter * Physical computing * Red light camera * Remote sensing * Smart camera * Traffic enforcement camera * Traffic sign recognition * Vehicle infrastructure integration * Velocity Moments * Video content analysis * View synthesis * Visual sensor network * Visual Word * Water remote sensing

## Books

* Computer Vision: Algorithms and Applications, Richard Szeliski, Springer, 2010. 书中对计算机视觉领域最新的一些算法进行了汇编,包括图像分割,特征检测和匹配,运动检测,图像缝合,3D重建,对象识别等图像处理的诸多方面,借助本书我们可以对最新主流图像处理算法有个全局把握。 * Learning OpenCV, by Gary Bradski & Adrian Kaehler, O'Reilly Media, 2008. * Multiple View Geometry in Computer Vision, 2nd Edition, by R. Hartley, and A. Zisserman, Cambridge University Press, 2004. * Computer Vision: A Modern Approach,by D.A. Forsyth and J. Ponce, Prentice Hall, 2002.一本不错的计算机视觉教材,全书理论联系实际,并加入了计算机视觉领域的最新研究成果。 * Pattern Classification (2nd Edition), by R.O. Duda, P.E. Hart, and D.G. Stork, Wiley-Interscience, 2000.

## Courses

* Stanford: http://vision.stanford.edu/teaching.html

### 2017

* CSCI 1430: Introduction to Computer Vision http://cs.brown.edu/courses/cs143/ * CS231n: Convolutional Neural Networks for Visual Recognition. http://cs231n.github.io/

## Datasets

### Video

* sport-1m * Youtube-8M 数据集大小:~1.5TB 下载地址:https://research.google.com/youtube8m/ * YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video 含23类共500万手动注释的、紧密贴合对象边界的边界框

### Image

* ImageNet * COCO(监督学习) * YFCC100M(无监督学习数据集) * OpenImages https://github.com/openimages/dataset

## Links

### Paper

* http://openaccess.thecvf.com/menu.py * http://www.cvpapers.com

### Organization

* The Computer Vision Foundation https://www.cv-foundation.org/

### Others

* https://en.wikipedia.org/wiki/Computer_vision * https://en.wikipedia.org/wiki/Outline_of_computer_vision * Awesome Deep Vision: http://jiwonkim.org/awesome-deep-vision/ https://github.com/kjw0612/awesome-deep-vision

References

2015

  • Ali Borji, Ming-Ming Cheng, Huaizu Jiang, Jia Li, Salient Object Detection: A Benchmark, arXiv eprint, 2015. [pdf] [Project page]
  • A. Betancourt,P. Morerio, C. S. Regazzoni, and M. Rauterberg, TheEvolution of First Person Vision Methods: A Survey, IEEE TRANSACTIONS ONCIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 25, NO. 5, MAY 2015
  • J.Galbally, S. Marcel, J. Fierrez, Biometric AntispoofingMethods: A Survey in Face Recognition, IEEE ACCESS, date of publicationDecember 18, 2014
  • T. Li,H. Chang, M. Wang, B.B. Ni, R.C. Hong, S.C. Yan, CrowdedScene Analysis: A Survey, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FORVIDEO TECHNOLOGY, VOL. 25, NO. 3, MARCH 2015
  • E.Sariyanidi, H. Gunes, A. Cavallaro, Automatic Analysisof Facial Affect: A Survey of Registration, Representation, and Recognition,IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 37, NO. 6,JUNE 2015
  • L.Shao, F. Zhu, and X.L. Li, Transfer Learning for VisualCategorization: A Survey, IEEE TRANSACTIONS ON NEURAL NETWORKS ANDLEARNING SYSTEMS, VOL. 26, NO. 5, MAY 2015
  • Freek Stulp, Olivier Sigaud, Many regression algorithms, one unified model: A review, Neural Networks, June 2015
  • B.Tian, B. T. Morris, M. Tang, Y.Q. Liu, Y. J. Yao, C. Guo, D.Y. Shen, S.H. Tang, Hierarchical and Networked Vehicle Surveillance in ITS:A Survey, IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, VOL.16, NO. 2, APRIL 2015
  • Z.Zhang, Y. Xu, J. Yang, X.L. Li, D. Zhang, A Survey of Sparse Representation: Algorithms and Applications, IEEE ACCESS, date ofpublication May 6, 2015
  • Qixiang Ye, David Doermann,Text Detection and Recognition in Imagery: A Survey, IEEE TPAMI, July 2015

2014

  • Ali Borji, Ming-Ming Cheng, Huaizu Jiang, Jia Li. Salient Object Detection: A Survey. arXiv eprint, 2014.
  • S. Fu,H. B. He, Z.G. Hou, Learning Race from Face: A Survey, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 36, NO.12, DECEMBER 2014
  • H.L. Zhou,A. Mian, L. Wei, D. Creighton, M. Hossny, and S. Nahavandi, Recent Advances on Singlemodal and Multimodal FaceRecognition: A Survey, IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, VOL.44, NO. 6, DECEMBER 2014

2013

  • O. D. Lara, M.A. Labrador, A Survey on Human Activity Recognition using WearableSensors, IEEE COMMUNICATIONS SURVEYS & TUTORIALS, VOL. 15, NO. 3,THIRD QUARTER 2013
  • A. Sotiras, C. Davatzikos, Nikos. Paragios, Deformable Medical Image Registration: A Survey, IEEETRANSACTIONS ON MEDICAL IMAGING, VOL. 32, NO. 7, JULY 2013
  • A. Alrahayfeh, M. Faezipour, Eye Tracking and Head Movement Detection: A State-of-ArtSurvey, IEEE Journal of Translational Engineering in Health andMedicine, 2013
  • P.V.K. Borges, N. Conci, and A. Cavallaro, Video-Based Human Behavior Understanding: A Survey, IEEETRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 23, NO. 11,NOVEMBER 2013
  • Mao Ye, Qing Zhang, Liang Wang, Jiejie Zhu, Ruigang Yang, Juergen Gall. A Survey on Human Motion Analysis from Depth Data. Time-of-Flight and Depth Imaging. 2013.
computer_vision.txt · Last modified: 2017/12/15 06:07 by localhost_tm