Works Cited
Anandakrishnan, R.; Scogland, T.R.; Fenley, A.T.; Gordon, J.C.; Feng, W.-c.; Onufriev, A.V., Accelerating electrostatic surface potential calculation with multi-scale approximation on graphics processing units, Journal of Molecular Graphics and Modelling 28 (8) (2010) 904910.
Ananthanarayanan, R.; Esser, S.K.; Simon, H.D.; Modha, D.S., The cat is out of the bag: cortical simulations with 10^9 neurons, 10^13 synapses, Supercomputing 2009 (2009).
Anderson, J.A.; Lorenz, C.D.; Travesset, A., General purpose molecular dynamics simulations fully implemented on graphics processing units 227 (10) (2008) 53425359.
Andrieu, C.; de Freitas, N.; Doucet, A.; Jordan, M.I., An Introduction to MCMC for Machine Learning, In: Machine Learning, Volume 50 (2003) Kluwer Academic Publishers, The Netherlands, pp. 543.
Beichel, I.; Sullivan, F., The Metropolis algorithm, Computing in Science & Engineering (2000) 6569.
Bell, N.; Garland, M., Implementing sparse matrix-vector multiplication on throughput-oriented processors Networking, Storage and Analysis, In: Proceeding SC '09 Proceedings of the Conference on High Performance Computing (2009) ACM, New York, NY.
Bianchi, S.; Di Leonardo, R., Real-time optical micro-manipulation using optimized holograms generated on the GPU, Computer Physics Communications 181 (8) (2010) 14441448.
BigDFT. (n.d.). Retrieved from Institut Nanosciences et Cryogénie: http://inac.cea.fr/L_Sim/BigDFT/.
Botelho, S.S.; Lautenschlger, W.; de Figueiredo, M.B.; Centeno, T.M.; Mata, M.M., Dimensional Reduction of Large Image Datasets Using Non-linear Principal Components, In: (Editors: Gallagher, M.; Hogan, J.; Maire, F.) Intelligent Data Engineering and Automated Learning - IDEAL 2005, Vol. 3578 (2005) Springer Berlin/Heidelberg, pp. 3140.
Botelho, S.S.; Lautenschlger, W.; de Figueiredo, M.B.; Centeno, T.M.; Mata, M.M., Dimensional Reduction of Large Image Datasets Using Non-linear Principal Components, In: (Editors: Gallagher, M.; Hogan, J.; Maire, F.) Intelligent Data Engineering and Automated Learning - IDEAL 2005, Vol. 3578 (2005) Springer Berlin/Heidelberg, pp. 3140.
Brandes, U., Drawing Graphs, In: (Editors: M., Kaufmann; D., Wagner) Drawing on physical analogies (2001) Springer-Verlag, pp. 7186.
Cao, W.; Yao, L.; Li, Z.; Wang, Y.; Wang, Z., Implementing Sparse Matrix-Vector multiplication using CUDA based on a hybrid sparse matrix format. Computer Application and System Modeling (ICCASM). (2010) IEEE, Taiyuan; pp. V11-161–V11-165.
Catanzaro, B.; Sundaram, N.; Keutzer, K., Fast support vector machine training and classification on graphics processors. Proceedings of the 25th international conference on Machine learning. (2008) ACM, New York.
Che, S.; Boyer, M.; Meng, J.; Tarjan, D.; Sheaffer, J.; Lee, S.; et al., Rodinia: A Benchmark Suite for Heterogeneous Computing, In: Proceedings of the IEEE International Symposium on Workload CharacterizationIISWC. (2009) IEEE, pp. 4454.
Che, S.; Sheaffer, J.; Boyer, M.; Szafaryn, L.G.; Szafaryn, L.; Wang, L.; et al., A Characterization of the Rodinia Benchmark Suite with Comparison to Contemporary. IEEE International Symposium on Workload. (2010) IEEE.
Chen, C.; Schmidt, B.; Weiguo, L.; Müller-Wittig, W., GPU-MEME: Using Graphics Hardware to Accelerate Motif Finding in DNA Sequences, In: (Editors: Chetty, M.; Ngom, A.; Ahmad, S.) Pattern Recognition in Bioinformatics (2008) Springer Berlin, Heidelberg, pp. 448459.
Christley, S.; Lee, B.; Dai, X.; Nie, Q., Integrative multicellular biological modeling: a case study of 3D epidermal development using GPU algorithms, BMC Systems Biology 4 (1) (2010) 107.
Coddington, P., Random Number Generators for Parallel Computers, The NHSE Review (1997).
Coon, B. W., Mills, P. C., Oberman, S. F., & Siu, M. Y. (2008). Patent No. 7434032. United States of America.
Corley, C.D.; Farber, R.M.; Reynolds, W.N., Thought Leaders During Crises in Massive Social Networks, Statistical Analysis and Data Mining (2011); to be published.
Cormen, T.H.; Leiserson, C.E.; Rivest, R.L., Introduction to algorithms. 2nd ed. (2001) The MIT Press.
Cormen, T.H.; Leiserson, C.E.; Rivest, R.L., Introduction to algorithms. 2nd ed. (2001) The MIT Press, Cambridge, MA.
Cover, T.M.; Thomas, J.A., Elements of information theory. (2006) John Wiley and Sons.
Cox, M.A.; Cox, T.F., Multidimensional Scaling. Springer Handbooks of Computational Statistics. (2008) .
Craighead, M. (2002). NV_primitive_restart. Retrieved June 2011, from opengl.org: http://www.opengl.org/registry/specs/NV/primitive_restart.txt.
Dean, J.; Ghemawat, S., MapReduce: a flexible data processing tool. (2010) ACM.
Dehne, F., & Yogaratnam, K. (2010, Feb). http://arxiv.org/abs/1002.4482. Retrieved June 2011, from Cornell University: http://arxiv.org/abs/1002.4482.
Dematte, L.; Prandi, D., GPU computing for systems biology, Brief Bioinform 11 (3) (2010) 323333.
Di Battista, G.; Eades, P.; Tamassia, R.; Tollis, I.G., Graph Drawing: Algorithms for the Visualization of Graphs. (1999) Prentice Hall, Englewood Cliffs, NJ.
Diamantras, K.I.; Kung, S.Y., Principal Component Neural Networks. (1996) John Wiley and Sons.
Diamos, G. (2009). The Design and Implementation of Ocelot's Dynamic Binary Translator from PTX to Multi-Core x86. CERCS Tech Report.
Duda, R.O.; Hart, P.E., Pattern Classification and Scene Analysis. (1973) Wiley, New Yourk.
Dyer, M.; Frieze, A.; Kannan, R., A random polynomial-time algorithm for approximating the volume, Journal of the ACM (1991) 117.
Eades, P., A heuristic for graph drawing, Congressus Nutnerantiunt (1984) 149160.
Eastman, P.; Pande, V.S., Efficient nonbonded interactions for molecular dynamics on a graphics processing unit, J. Comput. Chem. 31 (6) (2010) 12681272.
Ediger, D.; Jiang, K.; Riedy, J.; Bader, D.A.; Corley, C.; Farber, R.; et al., Massive Social Network Analysis: Mining Twitter for Social Good. 39th International Conference on Parallel Processing. (2010) IEEE; (pp. 583–593).
El Zein, A.H.; Rendell, A.P., Generating optimal CUDA sparse matrix–vector product implementations for evolving GPU hardware, Concurrency and Computation: Practice and Experience (2011).
Fang, Q.; Boas, D.A., Monte Carlo simulation of photon migration in 3D turbid media accelerated by graphics processing units, Optics Express (2009) 2017820190.
Farber, R., HPC balance and common sense, Scientific Computing (2007, February) 12+.
Farber, R. (2008, November 1). Extending High-level languages with CUDA. Retrieved June 2011, from Doctor Dobb's Journal: http://drdobbs.com/high-performance-computing/211800683.
Farber, R., Numerical Precision: How much is enough?Scientific Computing (2009, July/August) p. 14+.
Farber, R., Redefining What is Possible, Scientific Computing (2010, November).
Farber, R.M., Efficiently Modeling Neural Networks on Massively Parallel Computers. Proceedings of the Third International Workshop on Neural Networks and Fuzzy Logic. (1992) NASA, Houston; (pp. 3–11).
Farber, R.M.; Lapedes, A.S.; Rico-Martinez, R.; Kevrekidis, I.G., Identification of continuous-time dynamical systems: Neural network based algorithms and parallel implementation, In: Society for Industrial and Applied Mathematics (SIAM) conference on parallel processing for scientific computingNorfolk, VA. (1993).
Farber, R.M.; Lapedes, A.S.; Rico-Martinez, R.; Kevrekidis, I.G., Identification of continuous-time dynamical systems: Neural network based algorithms and parallel implementation. Society for Industrial and Applied Mathematics (SIAM) conference on parallel processing for scientific computing. (1993) American Mathematical Society, Norfolk, VA.
Farber, R.; Trease, H., ssively Parallel Near-Linear Scalability Algorithms with Application to Unstructured Video Analysis, In: TACC TeraGrid08 ConferenceLas Vegas. (2008).
Farber, R.; Lapedes, A.; Sirotkin, K., Determination of Eukaryotic Protein Coding Regions Using Neural Networks and Information Theory, J. Mol. Biology (1992) 471479.
Farooqui, N.; Kerr, A.; Diamos, G.; Yalamanchili, S.; Schwan, K., A Framework for Dynamically Instrumenting GPU. GPGPU-4 Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units. (2011) ACM, New York, NY.
Fatica, M., Accelerating linpack with CUDA on heterogenous clusters. Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units. (2009) ACM.
Frishman, Y.; Ayellet, T., Online Dynamic Graph Drawing, IEEE Transactions on Visualization and Computer Graphics (2008).
Fruchterman, T.M.; Reingold, E.M., Graph Drawing by Force-directed Placement, Journal Software—Practice & Experience (1991) 11291164.
Genovese, L., Ospici, M., Deutsch, T., Méhaut, J.-F., Neelov, A., & Goedecker, S. (2009). Density Functional Theory calculation on many-cores hybrid CPU-GPU architectures.
Godiyal, A.; Hoberock, J.; Garland, M.; Hart, J.C., Rapid Multipole Graph Drawing on the GPU, In: (Editors: Tollis, I.G.; Patrignani, M.) Graph Drawing (2009) Springer-Verlag, Berlin, Heidelberg, pp. 90101.
Gropp, W.; Lusk, E.L.; Skjellum, A., Using MPI. (1999) The MIT Press.
Haixiang, S.; Schmidt, B.; Weiguo, L.; Müller-Wittig, W., A Parallel Algorithm for Error Correction in High-Throughput Short-Read Data on CUDA-Enabled Graphics Hardware, Journal of Computational Biology 17 (4) (2010) 603615.
Hampton, S., Agarwal, P. K., Alam, S. R., & Crozier, P. S. (2010). Towards microsecond biological molecular dynamics simulations on hybrid processors., (pp. 98–107).
Haque, I.S.; Pande, V.S.; Walters, W.P., SIML: A Fast SIMD Algorithm for Calculating LINGO Chemical Similarities on GPUs and CPUs, Journal of Chemical Information and Modeling 50 (4) (2010) 560564.
Harish, P.; Narayanan, P.J., Accelerating large graph algorithms on the GPU using CUDA. Proceeding HiPC'07 Proceedings of the 14th international conference on High performance computing. (2007) Springer-Verlag, Berlin.
Harvey, M.J.; De Fabritiis, G., Swan: A tool for porting CUDA programs to OpenCL, Computer Physics Communications (2010) 10931099.
Hertz, J.A.; Krogh, A.S.; Palmer, R.G., Introduction to the Theory of Neural Computation. (1991) Addison-Wesley, Redwood City, CA.
Hinton, G. E. (2011). Geoffrey E. Hinton. Retrieved 2011, from University of Toronto: http://www.cs.toronto.edu/~hinton/.
Hinton, G.E.; Salakhutdinov, R.R., Reducing the Dimensionality of Data with Neural Networks, SCIENCE (2006, July 28) 504507.
Hinton, G.E.; Krizhevesky, A.; Want, S.D., Transforming Auto-encoders, In: ICANNEspoo, Finland. (2011), pp. 4451.
Hong-tao, B.; Li-li, H.; Dan-tong, O.; Zhan-shan, L.; He, L., K-Means on Commodity GPUs with CUDA, World Congress on Computer Science and Information Engineering (2009) 651655.
Hopcroft, J.E.; Ullman, J.D., Introduction to Automata Theory, Languages, and Computation. (2006) Addison-Wesley, Reading, MA.
Hopfield, J.J.; Tank, D.W., “Neural” Computation of Decisions in Optimization Problems, Biological Cybernetics (1985) 141152.
Hsieh, W.W., Nonlinear principal component analysis by neural networks, Tellus (2001) 599615.
Hsieh, W.W., Nonlinear multivariate and time series analysis by neural network methods, Rev. Geophys. (2004) 125.
Hubbard, D., The Failure of Risk Management: Why It's Broken and How to Fix It. (2009) Wiley.
Humphrey, W.; Dalke, A.; Schulten, K., VMD - Visual Molecular Dynamics, 14 (1996) 3338.
Hwu, W.-m.W., GPU Computing Gems. (2011) Morgan Kaufmann.
In: (Editor: Hwu, W.-m.W.) GPU Computing Gems Emerald Edition (2011) Morgan Kaufmann.
Hwu, W.-m.W., GPU Computing Gems Emerald Edition. (2011) Morgan Kaufmann.
Ines, E.; Hirschmüller, H., Mutual Information Based Semi-Global Stereo Matching on the GPU, In: (Editors: Bebis, G.; Boyle, R.; Parvin, B.; Koracin, D.; Remagnino, P.; Porikli, F.; et al.) Advances in Visual Computing (2008) Springer Berlin, Heidelberg, pp. 228239.
Ingram, S.; Munzner, T.; Olano, M., Glimmer: Multilevel MDS on the GPU, IEEE Transactions on Visualization and Computer Graphics (2009) 249261.
Jerrum, M.; Sinclair, A., The Markov chain Monte Carlo method: an approach to approximate counting, In: (Editor: Hochbaum, D.) Approximation algorithms for NP-hard problems (1996) PWS Publishing, pp. 482519.
Kakumanu, P.; Makrogiannis, S.; Bourbakis, N., A survey of skin-color modeling and detection methods, Pattern Recognition (2007) 11061122.
Kirk, d.; Hwu, W.-m.W., Programming Massively Parallel Processors: A Hands-on Approach. (2010) Morgan Kaufmann.
Kolda, T.G.; Lewis, R.M.; Torczon, V., Optimization by direct search: new perspectives on some classical and modern methods, SIAM J. Sci. Comput (2007) 25072530.
Korber, B.T.; Farber, R.M.; Wolpert, D.H.; Lapedes, A.S., Covariation of mutations in the V3 loop of human immunodeficiency virus type 1 envelope protein: an information theoretic analysis, PNAS (1993) 71767180.
Kramer, M.A., Nonlinear Principle Component Analysis Using Autoassociative Neural Netowrks, AIChE Journal (1991) 233243.
Kumar, N.; Satoor, S.; Buck, I., Fast Parallel Expectation Maximization for Gaussian Mixture Models on GPUs Using CUDA, In: High Performance Computing and Communications, 2009. HPCC '09. 11th IEEE International Conference onSeoul: IEEE. (2009), pp. 103109.
Lapedes, A.S.; Farber, R., How Neural Networks Work. Proceeding of IEEE Denver Conference on Neural Netorks. (1987) IEEE, Denver.
Lapedes, A.; Farber, R., Nonlinear signal processing using neural networks: Prediction and system modelling, In: Nonlinear signal processing using neural networks: Prediction and system modellingSan Diego. (1987).
Laxmikant, K.; Skeel, R.; Bhandarkar, M.; Brunner, R.; Gursoy, A.; Krawetz, N.; et al., NAMD2: Greater scalability for parallel molecular dynamics 151 (1999) 283312.
Little, J., proof for the queuing formula: L = w, Operations research (1961) 383387.
Ma, W., & Agrawal, G. (n.d.). A translation system for enabling data mining applications on GPUs. Proceeding ICS '09 Proceedings of the 23rd international conference on Supercomputing. New York, NY: ACM.
MAGMA. (n.d.). (The University of Tennessee) Retrieved from Innovative Computing Laboratory: http://icl.cs.utk.edu/magma.
Maintz, S.; Eck, B.; Dronskowski, R., Speeding up plane-wave electronic-structure calculations using graphics-processing units, Computer Physic Communications (2011) 14211427.
Malony, A.D.; Biersdorff, S.; Shende, S.; Jagode, H.; Tomov, S.; Jukeland, G.; et al., Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs, ICPP2011 (2011).
Malony, A.D.; Biersdorff, S.; Spear, W.; Mayanglamba, S., An experimental approach to performance measurement of heterogeneous parallel applications using CUDA, Proceedings of the 24th ACM International Conference on Supercomputing (2010) 127136.
McKinnon; McKinnon, K.I., Convergence of the Nelder–Mead simplex method to a non-stationary point, SIAM J Optimization (1999) 148158.
Micikevicius, P., 3D Finite Difference Computation on GPUs using CUDA. Proceeding GPGPU-2 Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units. (2010) ACM, New York, NY.
Micikevicius, P. (2010). Analysis-Driven Optimization (GTC 2010). Retrieved 2011, from 2010 NVIDIA GTC: http://www.nvidia.com/content/GTC-2010/pdfs/2012_GTC2010.pdf.
Minsky, M.; Papert, S., Perceptrons: An Introduction to Computational Geometry. (1969) The MIT Press, Cambridge, MA.
Mitchell, T., Machine Learning. (1997) McGraw Hill.
Monahan, A.H., Nonlinear Principal Component Analysis by Neural Networks: Theory and Application to the Lorenz System, Journal of Climate (2000) 821835.
Narayanan, R.; Ozisikyilmaz, B.; Zambreno, J.; Jayaprakash, P.; Memik, G.; Choudhary, A., MineBench: A Benchmark Suite for Data Mining Workloads. Proceedings of the International Symposium on Workload Characterization (IISWC). (2006) IEEE, San Jose; pp. 182–188.
Nath, R., Stanimire, T., & Dongerra, J. (2010, July 20). An Improved MAGMA GEmm for Fermi. Retrieved April 2011, from http://icl.cs.utk.edu: http://icl.cs.utk.edu/projectsfiles/magma/pubs/fermi_gemm.pdf.
Nelder, J.A.; Mead, R., A Simplex Method for Function Minimization, The Computer Journal (1965) 308313.
Oja, E., Simplified neuron model as a principal component analyzer, Journal of Mathematical Biology (1982) 267273.
Papakonstantinou, A., Gururaj, K., Stratton, J. A., Chen, D., Cong, J., & Hwu, W.-m. W. (2009). FCUDA: Enabling Efficient Compilation of CUDA Kernels onto FPGAs. Proceedings of the Symposium on Application Specific Processors. Proc. IEEE Symp. Application.
Petrini, F.; Kerbyson, D.J.; Pakin, S., The Case of the Missing Supercomputer Performance. SC03. (2003) ACM.
Petrini, F.; Kerbyson, D.J.; Pakin, S., The Case of the Missing Supercomputer Performance: Achieving, In: Proceedings of Supercomputing 2003Phoneix. (2003).
Press, W.H.; Teukolsky, S.A.; Vetterling, W.T., Numerical Recipes 3rd Edition: The Art of Scientific Computing. (2007) Cambridge University Press.
Roberts, E., Stone, J. E., Sepulveda, L., Hwu, W.-M. W., & Luthey-Schulten, Z. (2009). Long time-scale simulations of in vivo diffusion using GPU hardware., (pp. 1–8).
Rodrigues, C.I.; Hardy, D.J.; Stone, J.E.; Schulten, K.; Hwu, W.-M.W., GPU acceleration of cutoff pair potentials for molecular modeling applications. (2008) ACM; pp. 273–282.
Rummelhardt, D.E.; Hinton, G.E.; Williams, R.J., Learning representations by back-propagating errors, Nature 323 (1986) 533536.
Rummelhart, D.E.; McClelland, J.L., The PDP Research Group. Parallel Distributed Processing. (1987) The MIT Press.
Saunders, M.; Simon, H.; Yip, E., Two Conjugate Gradient-Type Methods of Unsymmetric Linear Equations, SIAM J. Num. Anal. (1988) 927940.
In: (Editor: Schmidt, B.) Bioinformatics: High Performance Parallel Computer Architectures (2010) Francis and Taylor.
Schölkopf, B.; Klaus-Robert Müller, A., Nonlinear Component Analysis as a Kernel Eigenvalue Problem, Neural Computation (1998) 12991319.
Scholz, M., Analysing Periodic Phenomena by Circular PCA, Lecture Notes in Computer Science, 4414/2007 (2007) 3847.
Scholz, M. (2011). Nonlinear PCA. Retrieved 2011, from nlpca: nlpca.org.
Sedláček, M., Evaluation of RGB and HSV Models in Human Faces Detection. Central European Seminar on Computer Graphics, Budmerice, CompSysTech’2004 (2004) 125131.
Sejnowski, T.J.; Rosenberg, C.R., Parallel networks that learn to pronounce English text, Complex Systems (1987) 145168.
Shalom, S.A.; Dash, M.; Tue, M., Efficient K-means Clustering Using Accelerated Graphics Processors, In: (Editors: Song, I.-Y.; Eder, J.; Nguyen, T.) Data Warehousing and Knowledge Discovery (2008) Springer Berlin, Heidelberg, pp. 166175.
Shi, Z., & Zhang, B. (2011, June). http://bioinfo.vanderbilt.edu/gpu-fan/. Retrieved June 2011, from Vanderbilt.edu: http://bioinfo.vanderbilt.edu/gpu-fan/.
Stanislaw, M.R.; Carbonell, J.G.; Mitchell, T.M., Machine learning: an artificial intelligence approach. (1985) Morgan Kaufmann.
Stevens, R.W., Advanced Programming in the UNIX Environment. (2005) Addison-Wesley Professional.
Stivala, A.; Stuckey, P.; Wirth, A., Fast and accurate protein substructure searching with simulated annealing and GPUs, BMC bioinformatics 11 (1) (2010) 446.
Stone, J.E.; Hardy, D.J.; Ufimtsev, I.S.; Schulten, K., GPU-accelerated molecular modeling coming of age, Journal of Molecular Graphics and Modelling 29 (2) (2010) 116125.
Stone, J.E.; Ufimtsev, I.S.; Schulten, K.; Hardy, D.J., GPU-accelerated molecular modeling coming of age, Journal of Molecular Graphics and Modelling (2010) 116125.
Stone, J.; Kohlmeyer, A.; Vandivort, K.; Schulten, K., Immersive Molecular Visualization and Interactive Modeling with Commodity Hardware, In: (Editors: Bebis, G.; Boyle, R.; Parvin, B.; Koracin, D.; Chung, R.; Hammound, R.) Advances in Visual Computing, Vol. 6454 (2010) Springer Berlin/Heidelberg, pp. 382393.
Stratton, J.A.; Stone, S.S.; Hwu, W.-m.W., MCUDA: An Efficient Implementation of CUDA Kernels for Multi-Core CPUs. The 21st International Workshop on Languages and Compilers for Parallel Computing. (2008) Springer LNCS 2008; (pp. 16–30).
Stuart, J.A.; Owens, J.D., Multi-GPU MapReduce on GPU Clusters. Proceedings of the 25th IEEE International Parallel and Distributed Processing Symposium. (2011) IEEE, Anchorage, AK.
Suchard, M.A.; Wanq, Q.; Chan, C.; Frelinger, J.; Cron, A.; West, M., Understanding GPU Programming for Statistical Computation: Studies in Massively Parallel Massive Mixtures, J Comput Graph Stat. 19 (2) (2010, Jun 1) 419438.
Suchard, M.; Rambaut, A., Many-Core Algorithms for Statistical Phylogenetics, Bioinformatics (2009) 13701376.
Suchard, M.; Wang, Q.; Chan, C.; Frelinger, J.; Cron, A.; West, M., Understanding GPU Programming for Statistical Computation:, Journal of Computational & Graphical Statistics (2010) 419438.
The Max Planck Institute. (2004). Fractal Landscape and Texture Generation. Retrieved June 2011, from mpi-inf.mpg.de: http://www.mpi-inf.mpg.de/departments/irg3/ws0405/cg/rcomp/29/x173.html.
Thearling, K., Massively Parallel Architectures and Algorithms for Time Series Analysis, In: (Editors: Nadel, L.; Stein, D.) 1993 Lectures in Complex Systems (1995) Addison-Wesley.
Townsend, R.; Sankaralingam, K.; Sinclair, M.D., Leveraging the untapped computation power of GPUs: fast spectral synthesis using texture interpolation, In: (Editor: Hwu, W.-m.W.) GPU Computing Gems (2011) Morgan Kaufmann, p. 886.
Tresadern, P.; Ionita, M.C.; Cootes, T., Real-Time Facial Feature Tracking on a Mobile Device, International Journal of Computer Vision (2011).
Ufimtsev, I.S.; Martinez, T.J., Quantum Chemistry on Graphical Processing Units. 1. Strategies for Two-Electron Integral Evaluation, Journal of Chemical Theory and Computation 4 (2) (2008) 222231.
Ufimtsev, I.S.; Martinez, T.J., Quantum Chemistry on Graphical Processing Units. 2. Direct Self-Consistent-Field Implementation, Journal of Chemical Theory and Computation 5 (4) (2009) 10041015.
Ufimtsev, I.S.; Martinez, T.J., Quantum Chemistry on Graphical Processing Units. 3. Analytical Energy Gradients, Geometry Optimization, and First Principles Molecular Dynamics, Journal of Chemical Theory and Computation 5 (10) (2009) 26192628.
Vezhnevets, V.; Sazonov, V.; Andreeva, A., A Survey on Pixel-Based Skin Color Detection Techniques, GRAPHICON03 (2003) 8592.
Volkov, V., Programming inverse memory hierarchy: case of stencils on GPUs, GPU Workshop for Scientific Computing, International Conference. (2010).
Volkov, V. (2010, September 22). Volkov 10-GTC. Retrieved April 21, 2011, from cs.berkeley.edu: http://www.cs.berkeley.edu/~volkov/volkov10-GTC.pdf.
Volkov, V. (2010, June 30). volkov 10-PMAA. Retrieved April 2011, from http://eech.berkeley.edu: http://www.eecs.berkeley.edu/~volkov/volkov10-PMAA.pdf.
Vuduc, R. (2010, August 2). Teragrid Conference 2010. Retrieved April 2011, from Analysis and Tuning Case Study: http://www.hpcgarage.org/tg10--gpu-tutorial/.
Wong, H.; Papadopoulou, M.-M.; Sadooghi-Alvandi, M.; Moshovos, A., Demystifying GPU Microarchitecture through microbenchmarking. 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS). (2010) IEEE, ^3, ^4; pp. 235–246.
Wu, R.; Zhang, B.; Hsu, M., Clustering billions of data points using GPUs. Proceedings of the combined workshops on UnConventional high performance computing workshop plus memory access workshop. (2009) ACM.
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.189.170.134