References

Reference

[1] Moravec HP. Obstacle Avoidance and Navigation in the Real World by a Seeing Robot Rover. Department of Computer Science, Stanford University; 1980.

[2] Harris C, Stephens M. A combined corner and edge detector. 50. Alvey Vision Conference. 1988;vol. 15 Manchester, UK,

[3] Rohr K. Localization properties of direct corner detectors. J. Math. Imaging Vision. 1994;4(2):139–150.

[4] Tomasi C, Kanade T. Detection and Tracking of Point Features. Citeseer; 1991.

[5] Shi J, Tomasi C. Good features to track. In: IEEE International Conference on Computer Vision and Pattern Recognition; 1994:593–600.

[6] Kenney CS, Zuliani M, Manjunath BS. An axiomatic approach to corner detection. 191–197. IEEE International Conference on Computer Vision and Pattern Recognition. 2005;vol. 1.

[7] Schmid C, Mohr R, Bauckhage C. Comparing and evaluating interest points. In: IEEE International Conference on Computer Vision; 1998:230–235.

[8] Lowe DG. Object recognition from local scale-invariant features. In: IEEE International Conference on Computer Vision, Corfu, Greece; 1999:1150–1157.

[9] Lowe DG. Distinctive image features form scale-invariant keypoints. Int. J. Comput. Vision. 2004;20(2):91–110.

[10] Mikolajczyk K, Schmid C. A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 2005;27(10):1615–1630.

[11] Hubel DH, Wiesel TN. Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J. Physiol. 1962;160(1):106–154.

[12] Marr D. Vision: A Computational Investigation into the Human Representation and Processing of Visual Information. New York, NY, USA: Henry Holt and Co., Inc. 1982.

[13] Zhu SC, Wu YN, Mumford D. Minimax entropy principle and its application to texture modeling. Neural Comput. 1997;9(8):1627–1660.

[14] Poggio T, Girosi F. Networks for approximation and learning. Proc. IEEE. 2002;78(9):1481–1497.

[15] Li F-F, Pietro P. A Bayesian hierarchical model for learning natural scene categories. In: IEEE International Conference on Computer Vision, Rio de Janeiro, Brazil; 2007.

[16] Nister D, Stewenius H. Scalable recognition with a vocabulary tree. In: IEEE International Conference on Computer Vision and Pattern Recognition, New York, USA; 2006.

[17] Xie X, Lu L, Jia M, Li H, Seide F, Ma W-Y. Mobile search with multimodal queries. Proc. IEEE. 2008;4:589–601.

[18] Ke Y, Sukthankar R. PCA-SIFT: a more distinctive representation for local image descriptors. In: IEEE International Conference on Computer Vision and Pattern Recognition, Washington, DC, USA; 2004:506–513.

[19] Belongie S, Malik J, Puzicha J. Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 2002;24(4):509–522.

[20] Lazebnik S, Ponce J. A sparse texture representation using local affine regions. IEEE Trans. Pattern Anal. Mach. Intell. 2005;27(8):1265–1278.

[21] Brown M, Szeliski R, Winder S. Multi-image matching using multi-scale oriented patches. In: IEEE International Conference on Computer Vision and Pattern Recognition, San Diego, USA; 2005:510–517.

[22] Simon A, Winder J. Learning local image descriptors. In: IEEE International Conference on Computer Vision and Pattern Recognition, Minneapolis, USA; 2007.

[23] Hua G, Brown M, Winder S. Discriminant embedding for local image descriptors. In: IEEE International Conference on Computer Vision, Rio de Janeiro, Brazil; 2007.

[24] Philbin J, Chum O, Isard M, Sivic J, Zisserman A. Object retrieval with large vocabularies and fast spatial matching. In: IEEE International Conference on Computer Vision and Pattern Recognition, Minneapolis, USA; 2007.

[25] Indyk P, Thaper N. Fast image retrieval via embeddings. In: In 3rd International Workshop on Statistical and Computational Theories of Vision, Nice, France; 2003:1–15.

[26] Jegou H, Douze M, Schmid C. Hamming embedding and weak geometric consistency for large scale image search. In: European Conference on Computer Vision, Marseille, France, Springer; 2008:304–317.

[27] Schindler G, Brown M. City-scale location recognition. In: IEEE International Conference on Computer Vision and Pattern Recognition, Minneapolis, USA; 2007.

[28] Salton G, Buckley C. Term-Weighting Approaches in Automatic Text Retrieval. San Francisco, USA: Morgan Kaufmann Publishers, Inc.; 1988.

[29] Matas J, Chum O, Urban M, Pajdla T. Robust wide-baseline stereo from maximally stable extremal regions. Image Vision Comput. 2004;22(10):761–767.

[30] Sivic J, Philipin J, Zisserman A. Video Google: a text retrieval approach to object matching in videos. In: IEEE International Conference on Computer Vision, Nice, France; 2003:1470–1477.

[31] Jurie F, Triggs B. Creating efficient codebooks for visual recognition. In: IEEE International Conference on Computer Vision, Beijing, China; 2005:604–610.

[32] Yang J, Jiang Y, Hauptmann AG, Ngo C-W. Evaluating bag-of-visual-words representations in scene classification. In: ACM Multimedia Information Retrieval Conference, Augsburg, Germany; 2007:197–206.

[33] Wang L. Toward a discriminative codebook: codeword selection across multi-resolution. In: IEEE International Conference on Computer Vision and Pattern Recognition, Minneapolis, USA; 2007.

[34] Leung T, Malik J. Representing and recognizing the visual appearance of materials using 3-D textons. Int. J. Comput. Vis. 2001;43(1):29–44.

[35] Jegou H, Harzallah H, Schmid C. A contextual dissimilarity measure for accurate and efficient image search. In: IEEE International Conference on Computer Vision and Pattern Recognition, Minneapolis, USA; 2007.

[36] MacQueen D. Information Theory, Inference and Learning Algorithms. Cambridge, United Kingdom: Cambridge Press; 2003.

[37] Comaniciu D, Meer P. Mean Shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 2002;24(5):603–619.

[38] Basu S, Bilenko M, Mooney RJ. A probabilistic framework for semi-supervised clustering. In: ACM Conference on Knowledge and Data Discovery, Seattle, USA; 2004:59–68.

[39] Mairal J, Bach F, Ponce J, Sapiro G, Zisserman A. Supervised dictionary learning. Advances in Neural Information Processing Systems. Vancouver, Canada: Neural Information Processing Systems Foundation; 2007 pp. 481–488.

[40] Lazebnik S, Raginsky M. Supervised learning of quantizer codebooks by information loss minimization. IEEE Trans. Pattern Anal. Mach. Intell. 2009;31(7):1294–1309.

[41] Moosmann F, Triggs B, Jurie F. Fast discriminative visual codebooks using randomized clustering forests. Advances in Neural Information Processing Systems. Vancouver, Canada: Neural Information Processing Systems Foundation; 2006 pp. 481–488.

[42] Perronnin F, Dance C, Csurka G, Bressan M. Adapted vocabularies for generic visual categorization. In: European Conference on Computer Vision, Graz, Austria, Springer; 2006:464–475.

[43] Zhang J, Marszalek M, Lazebnik S, Schmid C. Local features and kernels for classification of texture and object categories: a comprehensive review. Int. J. Comput. Vision. 2007;73(2):213–238.

[44] Liu J, Yang Y, Shah M. Learning semantic visual vocabularies using diffusion distance. In: IEEE International Conference on Computer Vision and Pattern Recognition, Miami, USA; 2009.

[45] Kohonen T. Learning vector quantization for pattern recognition, Technical Report, TKK-F-A601. Helsinki Institute of Technology; 1996.

[46] Kohonen T. Self-Organizing Maps. third ed. 2000 Springer, Cambridge, United Kingdom.

[47] Rao A, Miller D, Rose K, Gersho A. A generalized VQ method for combined compression and estimation. In: IEEE International Conference on Acoustics, Speech and Signal Processing, Toulouse, France; 1996:2032–2035.

[48] Leibe B, Leonardis A, Schiele B. Combined object categorization and segmentation with an implicit shape model. In: European Conference on Computer Vision, Prague, Czech, Springer; 2004:17–23.

[49] Agarwal S, Roth D. Learning a sparse representation for object detection. In: European Conference on Computer Vision, Prague, Czech, Springer; 2002:97–101.

[50] Bosch A, Zisserman A, Munoz X. Scene classification using a hybrid generative/discriminative approach. IEEE Trans. Pattern Anal. Mach. Intell. 2008;30(4):712–727.

[51] Gionis A, Indyk P, Motwani R. Similarity search in high dimensions via hashing. In: International Conference on Very Large Data Bases, Edinburgh, Scotland, Morgan Kaufmann; 2002:518–529.

[52] Shakhnarovich G, Darrell T, Indyk P. Nearest-Neighbor Methods in Learning and Vision: Theory and Practice. Cambridge, Massachusetts: MIT Press; 2006.

[53] Shakhnarovich G, Viola P, Darrell T. Fast pose estimation with parameter-sensitive hashing. In: International Conference on Computer Vision, Nice, France; 2003:750–757.

[54] Torralba A, Weiss Y, Fergus R. Small codes and large databases of images for object recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Anchorage, United States; 2008.

[55] Weiss Y, Torralba A, Fergus R. Spectral hashing. Advances in Neural Information Processing Systems. Vancouver, Canada: MIT Press; 2008.

[56] Kulis B, Grauman K. Kernelized locality-sensitive hashing for scalable image search. In: IEEE International Conference on Computer Vision, Kyoto, Japan; 2009.

[57] Raginsky M, Lazebnik S. Locality-sensitive binary codes from shift-invariant kernels. Advances in Neural Information Processing Systems. Vancouver, Canada: MIT Press; 2009.

[58] Beis J, Lowe D. Indexing without invariants in 3D object recognition. IEEE Trans. Pattern Anal. Mach. Intell. 1999;21(10):1000–1015.

[59] Arya S, Mount D, Netanyahu N, Silverman R, Wu A. An optimal algorithm for approximate nearest neighbor searching in fixed dimensions. J. ACM. 1998;45(6):891–923.

[60] Liang L, Liu C, Xu Y, Guo B, Shum H. Real-time texture synthesis by patch-based sampling, ACM Trans. Graph. 2001;20(3):127–150.

[61] Hjaltason G, Samet H. Index-driven similarity search in metric spaces. ACM Trans. Database Syst. 2003;28(4):517–580.

[62] Nene S, Nayar S. A simple algorithm for nearest neighbor search in high dimensions. IEEE Trans. Pattern Anal. Mach. Intell. 1997;19(9):989–1003.

[63] Grauman K, Darrell T. Approximate correspondences in high dimensions. Advances in Neural Information Processing Systems. Vancouver, Canada: Neural Information Processing Systems Foundation; 2007 pp. 481–488.

[64] Muja M, Lowe D. Fast approximate nearest neighbors with automatic algorithm configuration. In: IEEE International Conference on Computer Vision Theory and Applications, Lisbon, Portugal; 2009.

[65] Bay H, Tuytelaars T, Gool LV. SURF: speeded up robust features. In: European Conference on Computer Vision, Graz, Austria, Springer. 2006:404–417.

[66] Csurka G, Bray C, Dance C, Fan L. Visual categorization with bags of keypoints. In: European Conference on Computer Vision, Workshop on Statistical Learning in Computer Vision, Prague, Czech, Springer; 2004:1–22.

[67] Fergus R, Perona P, Zisserman A. Object class recognition by unsupervised scale-invariant learning. In: IEEE International Conference on Computer Vision and Pattern Recognition, Madison, USA; 2003:264–271.

[68] Crandall, Felzenszwalb P, Hutternlocher D. Spatial priors for part-based recognition using statistical models. In: IEEE International Conference on Computer Vision and Pattern Recognition, San Diego, USA; 2005:10–17.

[69] Sivic J, Zisserman A. Video data mining using configurations of viewpoint invariant regions. In: IEEE International Conference on Computer Vision and Pattern Recognition, Washington, DC, USA; 2004:488–495.

[70] Quack T, Ferrari V, Gool LV. Video mining with frequent item set configurations. In: International Conference on Content-Based Image and Video Retrieval, Tempe, USA, Springer; 2006:360–369.

[71] Yuan J, Wu Y, Yang M. Discovery of collocation patterns: from visual words to phrase. In: IEEE International Conference on Computer Vision and Pattern Recognition, Minneapolis, USA; 2007.

[72] Quack T, Ferrari V, Gool LV. Efficient mining of frequent and distinctive feature configurations. In: IEEE International Conference on Computer Vision, Rio de Janeiro, Brazil; 2007.

[73] Fischler MA, Bolles RC. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM. 1981;24:381–395.

[74] Li T, Mei T, Kweon I-S, Hua X-S. Contextual bag-of-words for visual categorization. IEEE Trans. Circuits Syst. Video Technol. 2011;21(4):381–392.

[75] Wu Z, Ke Q, Isard M, Sun J. Bundling features for large scale partial-duplicate web image search. In: IEEE International Conference on Computer Vision and Pattern Recognition, Miami, United States; 2009.

[76] Brin S, Page L. The anatomy of a large-scale hypertextual (web) search engine. In: International World Wide Web Conference; 1998.

[77] Hofmann T. Probabilistic latent semantic indexing. In: ACM International Conference on Information Retrieval; 1999:50–57.

[78] Blei D, Ng AY, Jordan M. Latent dirichlet allocation. J. Mach. Learn. Res. 2003;3:993–1022.

[79] Harris C, Stephens M. A combined corner and edge detector. In: Alvey Vision Conference, Haifa, Israel, Alvey Publisher; 1988:147–152.

[80] Mikolajczyk K, Schmid C. Indexing based on scale invariant interest points. In: IEEE International Conference on Computer Vision, Vancouver, Canada; 2001:525–531.

[81] Mikolajczyk K, Schmid C. Scale and affine invariant interest point detectors. Int. J. Comput. Vision. 2004;60(1):63–86.

[82] Hubel D. Eye, Brain and Vision. New York: Scientific American Library; 1995.

[83] Gazzaniga M, Ivry R, Mangun G. Cognitive Neuroscience: The Biology of the Mind. second ed. New York: W.W. Norton; 2002.

[84] Viola P, Jones M. Rapid object detection using a boosted cascade of simple features. In: IEEE International Conference on Computer Vision and Pattern Recognition, Hawaii, USA; 2001:511–518.

[85] Lin H, Si J, Abousleman GP. Dynamic point selection in image mosaicking. Opt. Eng. 2006;45(3):030501–2–030501-3.

[86] Paletta L, Fritz G, Seifert C. Q-learning of sequential attention for visual object recognition from informative local descriptors. In: International Conference on Machine Learning, Bonn, Germany, International Machine Learning Society; 2005:649–656.

[87] Lazebnik S, Schmid C, Ponce J. Semi-local affine parts for object recognition. In: Britism Machine Vision Conference, London, United Kingdom, Britism Machine Vision Society; 2004:959–968.

[88] Bruckstein A, Rivlin E, Weiss I. Scale space semi-local invariants. Image Vision Comput. 1997;15(5):335–344.

[89] Bileschi S, Wolf L. Image representations beyond histograms of gradients: the role of gestalt descriptors. In: IEEE International Conference on Computer Vision and Pattern Recognition, Minneapolis, USA; 2007.

[90] Torralba A, Oliva A. Contextual guidance of attention in natural scenes: the role of global features on object search. Psychol. Rev. 2006;113(4):766–786.

[91] Torralba A, Murphy KP, Freeman WT. Contextual models for object detection using boosted random fields. Advances in Neural Information Processing Systems. Vancouver, Canada: Neural Information Processing Systems Foundation; 2004 pp. 1401–1408.

[92] Torralba A. Contextual priming for object detection. Int. J. Comput. Vision. 2003;53(2):169–191.

[93] Jegou H, Schmid C, Harzallah H, Verbeek J. Accurate image search using the contextual dissimilarity measure. IEEE Trans. Pattern Anal. Mach. Intell. 2009;32(1):2–11.

[94] Itti L, Koch C, Niebur E. A model for saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 1998;20(11):1254–1259.

[95] Serre T, Wolf L, Bileschi S, Riesenhuber M, Poggio T. Robust object recognition with cortex-like mechanisms. IEEE Trans. Pattern Anal. Mach. Intell. 2006;29(3):411–426.

[96] Hou X, Zhang L. Saliency detection: a spectral residual approach. In: IEEE International Conference on Computer Vision and Pattern Recognition, Minneapolis, USA; 2007.

[97] Jamieson M, Dickinson S, Stevenson S, Wachsmuth S. Using language to drive the perceptual grouping of local image features. In: IEEE International Conference on Computer Vision and Pattern Recognition, New York, USA; 2006.

[98] Fukunaga K. Statistical Pattern Recognition. second ed. Boston, MA, USA: Boston Academic Publishers, Inc.; 1990.

[99] Huang Y, Shekhar S, Xiong H. Discovering collocation patterns from spatial data sets: a general approach. IEEE Trans. Knowl. Data Eng. 2004;16(12):1472–1485.

[100] Mikolajczyk K, Tuytelaars T, Schmid C, Zisserman A, Matas J, Schaffalitzky F, et al. A comparison of affine region detectors. Int. J. Comput. Vision. 2006;65(1–2):43–72.

[101] Cheng Y. Mean shift, model seeking and clustering. IEEE Trans. Pattern Anal. Mach. Intell. 1995;17(8):790–799.

[102] Dundar M, Bi J. Joint optimization of cascaded classifiers for computer aided detection. In: IEEE International Conference on Computer Vision and Pattern Recognition, Minneapolis, USA; 2007.

[103] Torralba, WordNet Structure in LabelMe, Available from: http://people.csail.mit.edu/torralba/research/LabelMe/wordnet/test.html.

[104] Path Labeling Corespondence Dataset Released in CVPR 2010. Towards semantic embedding in visual vocabulary, Available from: http://vilab.hit.edu.cn/~rrji/index_files/SemanticEmbedding.htm.

[105] Fellbaum C. WordNet: An Electronic Lexical Database. Massachusetts, USA: MIT Press; 1998.

[106] Pedersen T, Patwardhan S, Michelizzi J. WordNet: similarity-measuring the relatedness of concepts. In: Association for the Advancement of Artificial Intelligence Conference, San Jose, USA, Association for the Advancement of Artificial Intelligence; 2004:1024–1025.

[107] Li W, Sun M. Automatic Image Annotation Based on WordNet and Hierarchical Ensemble. Springer, Computational Linguistics and Intelligent Text Processing; 2006.

[108] Geman S, Geman D. Stochastic relaxation, gibbs distributions and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. 1984;6:721–741.

[109] Hammersley JM, Clifford P. Markov fields on finite graphs and lattices. Unpublished manuscript; 1971.

[110] Winn J, Criminisi A, Minka T. Object categorization by learned universal visual dictionary. In: IEEE International Conference on Computer Vision, Beijing, China; 2005.

[111] PASCAL, Pascal voc database, Available from: http://www.PASCAL-network.org/challenges/VOC/.

[112] Chen D, Tsai S, Chandrasekhar V, Takacs G, Singh J, Girod B. Tree histogram coding for mobile image matching. DCC. 2009.

[113] Chandrasekhar V, Takacs G, Chen D, Tsai S, Grzeszczuk R, Girod B. CHoG: compressed histogram of gradients a low bit-rate feature descriptor. CVPR. 2009.

[114] Li F-F, Perona P. A Bayesian hierarchical model for learning natural scene categories. CVPR. 2005.

[115] Bosch A, Zisserman A, Munoz X. Scene classification using a hybrid generative/discriminative approach. PAMI. 2008.

[116] Weiss Y, Torralba A, Fergus R. Spectral hashing. NIPS. 2008.

[117] Hofmann T. Unsupervised learning by probabilistic latent semantic analysis. ML Journal. 2001.

[118] Yang J, Yu K, Gong Y, Huang T. Linear spatial pyramid matching using sparse coding for image classification. CVPR. 2009.

[119] Fergus R, Perona P, Zisserman A. A sparse object category model for efficient learning and exhaustive recognition. CVPR. 2005.

[120] Ji R, Duan L-Y, Chen J, Yao H, Yuan J, Rui Y, Gao W. Location discriminative vocabulary coding for mobile landmark search. IJCV. 2011.

[121] Snavely N, Seitz SM, Szeliski R. PhotoTourism: exploring photo collections in 3D. SIGGRAPH. 2006.

[122] Agrawal R, Imielinski T, Swami AN. Mining association rules between sets of items in large database. In: ACM Conference on Management of Data, Barcelona, Spain; 1993:207–216.

[123] Ji R, Duan L-Y, Chen J, Gao W. Towards compact topical descriptor. CVPR. 2012.

[124] Tibshirani R. Regression shrinkage and selection via the Lasso. Journal of the Royal Statistical Society. 1997.

[125] Jegou H, Douze M, Schmid C, Perez P. Aggregating local descriptors into a compact image representation. CVPR. 2010.

[126] Winder S, Brown M. Learning local image descriptors. CVPR. 2007.

[127] Salton G, Wong A, Yang CS. A vector space model for automatic indexing. Commun. ACM. 1975;18(11):613–620.

[128] Yang J, Hauptamann A. A text categorization approach to video scene classification using keypoint features. 2006 CMU Technical Report.

[129] Mitra M, Buckley C, Cardie C, Singhal A. An analysis of statistical and syntactic phrases. In: Recherche d’Information Assistée par Ordinateur, New York, USA; 1997:200–217.

[130] ETH-Zurich, Zurich building image database, Available from: http://www.vision.ee.ethz.ch/showroom/zubud/index.en.html.

[131] Shao T, Svoboda V, Ferrari T, Tuytelaars LV. Gool, Fast indexing for image retrieval based on local appearance with re-ranking. In: IEEE International Conference on Image Processing, Barcelona, Spain; 2003:737–740.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.142.255.140