Barrai I, Rodriguez-Larralde A, Marnolini E, Manni F, and Scapolini C. 2001. Isonymy structure of USA population. American Journal of Physical Anthropology114(2), 109–123.
Barratt N. 2008. Who Do You Think You Are? Encyclopedia of Genealogy: The Definite Reference Guide to Tracing Your Family History. HarperCollins, New York, NY.
Beaulieu A. 2009. Learning SQL. O'Reilly, Sebastopol, CA.
Berners-Lee T. 2000. Weaving the Web: The Original Design and Ultimate Destiny of the World Wide Web by its Inventor. HarperCollins, New York, NY.
Berners-Lee T, Fielding R, and Frystyk H. 1996. Hypertext transfer protocol—http/1.0. RFC 1945. http://tools.ietf.org/html/rfc1945 (Last accessed December 14, 2013).
Bivand R, Keitt T, and Rowlingson B. 2013a. rgdal: Bindings for the Geospatial Data Abstraction Library. R package version 0.8-14. http://CRAN.R-project.org/package=rgdal
Bivand RS, Pebesma E, and Gómez-Rubio V. 2013b. Applied Spatial Data Analysis with R UseR! Series 2nd edn. Springer, Heidelberg/New York.
Bratton K and Rouse SM. 2011. Networks in the legislative arena: How group dynamics affect cosponsorship. Legislative Studies Quarterly36(3), 423–460.
Broniatowski DA, Paul MJ, and Dredze M. 2013. National and local influenza surveillance through twitter: An analysis of the 2012-2013 influenza epidemic. Plos One8(12), doi:10.1371/journal.pone.0083672.
Burkett T. 1997. Cosponsorship in the United States Senate: A Network Analysis of Senate Communication and Leadership, 1973-1990. University of South Carolina, Columbia.
Castro E and Hyslop B. 2014. HTML and CSS: Visual QuickStart Guide. Peachpit Press.
Cerami E. 2002. Web Services Essentials. O'Reilly, Sebastopol, CA.
Chamberlain S, Boettiger C, Ram K, Barve V, and Mcglinn D. 2013. rgbif: Interface to the Global Biodiversity Information Facility API. R package version 0.4.0.
Chamberlin DD and Boyce RF. 1974. Sequel: A Structured English Query Language. Proceedings of the 1974 ACM SIGFIDET Workshop on Data Description, Access and Control, May 1974, Ann Arbor, MI, pp. 249–264.
Chesney T. 2006. An empirical examination of Wikipedia's credibility. First Monday11(11), doi:10.5210/fm.v11i11.1413.
Cho WKT and Fowler JH. 2010. Legislative success in a small world: Social network analysis and the dynamics of congressional legislation. Journal of Politics72(1), 124–135.
Christian P. 2012. The Genealogist's Internet: The Essential Guide to Researching Your Family History Online. 5th revised edition. A & C Black Business Information and Development, London.
Clauson KA, Polen HH, Boulos MNK, and Dzenowagis JH. 2008. Scope, completeness, and accuracy of drug information in wikipedia. The Annals of Pharmacotherapy42(12), 1814–1821.
Codd EF. 1970. A relational model of data for large shared data banks. Communications of the ACM13(6), 377–387.
Conway J, Eddelbuettel D, Nishiyama T, Prayaga SK (during 2008), and Tiffin N. 2013. RPostgreSQL: R Interface to the PostgreSQL Database System. R package version 0.4.
Döring H. 2013. The collective action of data collection: A data infrastructure on parties, elections and cabinets. European Union Politics14(1), 161–178.
Dreyer AJ and Stockton J. 2013. Internet ‘data scraping’: A primer for counseling clients. New York Law Journal July, 1–3.
Eisenberg JD. 2002. SVG Essentials. O'Reilly, Sebastopol, CA.
Fowler JH. 2006a. Connecting the congress: A study of cosponsorship networks. Political Analysis14(4), 456–487.
Fowler JH. 2006b. Legislative cosponsorship networks in the us house and senate. Social Networks28(4), 454–465.
Fox WR and Lasker GW. 1983. The distribution of surname frequencies. International Statistical Review51, 81–87.
Franks J, Hallam-Baker P, Hostetler J, Lawrence S, Leach P, Luotonen A, and Stewart L. 1999. Http authentication: Basic and digest access authentication. RFC 2617. http://tools.ietf.org/html/rfc2617 (Last accessed December 14, 2013).
Freier A, Karlton P, and Kocher P. 2011. The secure sockets layer (ssl) protocol version 3.0. RFC 6101. http://tools.ietf.org/html/rfc6101 (Last accessed December 14, 2013).
Garfinkel S. 2002. Web Security, Privacy and Commerce. O'Reilly, Sebastopol, CA.
Gennick J. 2011. SQL Pocket Guide. 3rd edn. O'Reilly, Sebastopol, CA.
Ghomi AA, Shirzadi E, and Movassaghi A. 2013. Predicting the Academy Awards’ result by analyzing tweets. Global Journal of Science, Engineering and Technology. 8, 39–47.
Giles J. 2005. Internet encyclopae dias go head to head. Nature438, 900–901.
Gliozzo A, Strapparava C, and Dagan I. 2009. Improving text categorization bootstrapping via unsupervised learning. ACM Transactions on Speech and Language Processing6(1), 1–24.
Gourley D and Totty B. 2002. HTTP. The Definitive Guide. O'Reilly, Sebastopol, CA.
Grimmer J and Stewart BM. 2013. Text as data: The promise and pitfalls of automatic content analysis methods for political texts. Political Analysis21(3), 267–297.
Grolemund G and Wickham H. 2011. Dates and times made easy with lubridate. Journal of Statistical Software40(3), 1–25.
Harold ER and Means WS. 2004. XML in a Nuthsell: A Desktop Quick Reference. 3rd edn. O'Reilly, Sebastopol, CA.
Harrington JL. 2009. Relational Database Design and Implementation. Morgan Kaufmann Series in Data Management Systems. 3rd edn. Elsevier, Amsterdam.
Hemenway K and Calishain T. 2003. Spidering Hacks. O'Reilly, Sebastopol, CA.
Hintze JL and Nelson RD. 1998. Violin plots: A box plot-density trace synergism. The American Statistician52(2), 181–184.
Hogarth RM. 1978. A note on aggregating opinions. Organizational Behavior and Human Performance21, 40–46.
Holdener III AT. 2008. Ajax: The Definitive Guide. O'Reilly, Sebastopol, CA.
Holzner S. 2003. XPath Kick Start: Navigating XML with XPath 1.0 and 2.0. Sams Publishing, Indianapolis, IN.
Hopkins D and King G. 2010. A method of automated nonparametric content analysis for social science. American Journal of Political Science54(1), 229–247.
Hornik K, Buchta C, and Zeileis A. 2009. Open-source machine learning: R meets weka. Computational Statistics24(2), 225–232.
Hu M and Liu B. 2004. Mining and Summarizing Customer Reviews. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2004), August 2004, Seattle, WA.
James DA and DebRoy S. 2013. RMySQL: R Interface to the MySQL Database. Version 0.9-3. http://biostat.mc.vanderbilt.edu/RMySQL (Last accessed August 29, 2013).
Jurka TP, Collingwood L, Boydstun AE, Grossman E, and Atteveldt WV. 2013. Rtexttools: A supervised learning package for text classification. The R Journal5(1), 6–12.
Kahle D and Wickham H. 2013. ggmap: Spatial visualization with ggplot2. The R Journal5(1), 144–161.
Kriegel A and Trukhnov BM. 2008. SQL Bible. 2nd edn. John Wiley & Sons, Hoboken, NJ.
Leithner A, Maurer-Ertl W, Glehr M, Friesenbichler J, Leithner K, and Windhager R. 2010. Wikipedia and osteosarcoma: A trustworthy patients’ information? Journal of the American Medical Informatics Association17(4), 373–374.
Liu B. 2012. Sentiment Analysis and Opinion Mining. Morgan and Claypool, San Rafael, CA.
Liu B, Hu M, and Cheng J. 2005. Opinion Observer: Analyzing and Comparing Opinions on the Web. Proceedings of the 14th International World Wide Web Conference (WWW-2005), May 2005, Chiba, Japan.
Liu C and Albitz P. 2006. DNS and BIND. O'Reilly, Sebastopol, CA.
Manning CD, Paghavan P, and Schütze H. 2008. Introduction to Information Retrieval. Cambridge University Press, Cambridge.
McJones P, Bamford R, Blasgen M, Chamberlin D, Cheng J, Daudenarde JJ, Finkelstein S, Gray J, Jolls B, Lindsay B, Lorie R, Mehl J, Miller R, Mohan C, Nauman J, Pong M, Price T, Putzolu F, Schkolnick M, Selinger B, Selinger P, Slutz D, Traiger I, Wade B, and Yost B. 1997. The 1995 SQL reunion: People, projects, and politics. http://www.scs.stanford.edu/∼dbg/readings/SRC-1997-018.pdf (Last accessed October 30, 2013).
Meng Y. 2012. Sentiment analysis: A study on product features. Dissertation. University of Nebraska, Lincoln.
Mukherjee S and Bhattacharyya P. 2012. Feature specific sentiment analysis for product reviews. In: Computational Linguistics and Intelligent Text Processing. Gelbukh A, ed. Springer, Berlin. pp. 475–487.
Murrell P. 2009. Introduction to Data Technologies. Chapman & Hall/CRC, Boca Raton, FL.
Ray ET. 2003. Learning XML. 2nd edn. O'Reilly, Sebastopol, CA.
Reavley N, Mackinnon A, Morgan A, Alvarez-Jimenez M, Hetrick S, Killackey E, Nelson B, Purcell R, Yap M, and Jorm A. 2012. Quality of information sources about mental disorders: a comparison of Wikipedia with centrally controlled web and printed sources. Psychological Medicine42(8), 1753–1762.
Rector LH. 2008. Comparison of Wikipedia and other encyclopedias for accuracy, breadth, and depth in historical articles. Reference Services Review36(1), 7–22.
Richardson L, Amundsen M, and Ruby S. 2013. RESTful Web APIs. O'Reilly, Sebastopol, CA.
Ripley B and Lapsley M. 2013. RODBC: ODBC Database Access. Version 1.3-7, Lapsley participated from 1999 to 2002.
Schrenk M. 2012. Webbots, Spiders, and Screen Scrapers. A Guide to Developing Internet Agents with PHP/Curl , 2nd ed. No Starch Press, San Francisco, CA.
Temple Lang D. 2013c. XML: Tools for Parsing and Generating XML Within R and S-Plus. R package version 3.95-0.2. http://CRAN.R-project.org/package=XML
Temple Lang D, Keles S, and Dudoit S. 2012. RHTMLForms: Programmatically Create R Functions Corresponding to Web/HTML Forms. R package version 0.6-0. http://www.omegahat.org/RHTMLForms
Tennison J. 2001. XSLT and XPath on the Edge. John Wiley & Sons, Hoboken, NJ.
Torgo L. 2010. Data Mining with R: Learning with Case Studies. Chapman & Hall/CRC, Boca Raton, FL.
Tumasjan A, Sprenger TO, Sandner PG, and Welpe IM. 2011. Election forecasts with twitter. How 140 characters reflect the political landscape. Social Science Computer Review29(4), 402–418.
Witten IH and Frank E. 2005. Data Mining: Practical Machine Learning Tools and Techniques, 2nd ed. Morgan Kaufmann, San Francisco, CA.
Wong C. 2000. HTTP Pocket Reference. O'Reilly, Sebastopol, CA.
Yasuda N, Cavalli-Sforza L, Skolnick M, and Moroni A. 1974. The evolution of surnames: An analysis of their distribution and extinction. Theoretical Population Biology5(1), 123–142.
Zagibalov T and Carroll J. 2008. Automatic Seed Word Selection for Unsupervised Sentimen Classification of Chinese Text. Proceedings of the 22nd International Conference on Computational Linguistics, August 2008, Manchester, UK, pp. 1073–1080.
Zakas NC. 2010. High Performance JavaScript. O'Reilly, Sebastopol, CA.
Zhang Y, Friend AJ, Traud AL, Porter MA, Fowler JH, and Mucha PJ. 2008. Community structure in congressional cosponsorship networks. Physica A387(7), 1705–1712.
Zhao Y. 2012. R and Data Mining. Examples and Case Studies. Elsevier Academic Press, Waltham, MA.
Zumel N and Mount J. 2014. Practical Data Science with R. Manning, Greenwich, CT.