References

Abu-Halimeh A, Pullen D, Tudoreanu M.E. Perception of value-added through a visual join operation. 2013. International Conference on Information Quality. 2013 November 7–9, 2013, Little Rock, AR, pp. 326–337.

ANSI. Data quality part 110: Master data: Exchange of characteristic data: Syntax, semantic encoding, and conformance to data specification. 2009 International Standard ISO 8000-11:2009(E) First edition 2009-11-15. Downloaded from. ansi.org on January 3, 2012.

Baxter R, Christen P, Churches T. A comparison of fast blocking methods for record linkage. First Workshop on Data Cleaning, Record Linkage, and Object Consolidation. 2003 KDD-2003, Washington, DC, August 24–27, 2013.

Benjelloun O, Garcia-Molina H, Menestrina D, Su Q, Whang S.E, Widom J. Swoosh: A generic approach to entity resolution. The VLDB Journal. 2009;18(1):255–276.

Benjelloun O, Garcia-Molina H, Su Q, Widom J. Swoosh: A Generic Approach to Entity Resolution Stanford InfoLab Technical Report. 2005. dbpubs.stanford.edu/pub/2005-5.

Berson A, Dubov L. Master data management and data governance. New York, NY: McGraw Hill; 2011.

Bianco G.D, Galante R, Heuser C.A. A fast approach for parallel deduplication on multicore processors. SAC’11. 2011 March 21–25, 2011, TaiChung, Taiwan.

Borgman C, Siegfried S. Getty’s Synoname™ and its cousins: A survey of applications of personal name-matching algorithms. Journal of the American Society for Information Science. 1992;43(7):459–476.

Caballero I, Parody L, Bermejo I, Lopez T.G, Gasca R, Piattini M. Service level agreement for data quality governed by ISO 8000-1X0. 2014 The 19th International Conference on Information and Data Quality (ICIQ-2014). Xi’an, China, August 1–3, 2014, pp. 114–127.

Center for Identity. Identity threat assessment and prediction (ITAP). 2014 Available at: http://identity.utexas.edu/research/model.

Cervo D, Allen M. Master data management in practice: Achieving true customer MDM. Wiley; 2011.

Chen C, Hanna J, Talburt J.R, Brochhausen M, Hogan W.R. A demonstration of entity identity information management applied to demographic data in a referent tracking system. International Conference on Biomedical Ontology (ICBO 2013). 2013 Montreal, Canada, July 7–12, 2013, pp. 136–137.

Chen C, Mohammed M, Talburt J.R. Visualization tools for results of entity resolution. The 2013 International Conference on Information and Knowledge Engineering (IKE’13). 2013 Las Vegas, Nevada, July 22–25, 2013, CSREA Press, pp. 87–91.

Chiang C, Talburt J, Wu N, Pierce E, Heien C, Gulley E, Moore J. A case study in partial parsing unstructured text. Fifth International Conference on Information Technology: New Generations. 2008 Las Vegas, NV, IEEE Press, pp. 447–452.

Christen P. A comparison of personal name matching: techniques and practical issues. Sixth IEEE International Conference on Data Mining Workshops. 2006:290–294.

Christen P. Febrl – A freely available record linkage system with a graphical user interface. Proceedings of the Australian Workshop on Health Data and Knowledge Management (HDKM). 2008 Conferences in Research and Practice in Information Technology (CRPIT), Wollongong, Australia, January 2008, vol. 80.

Christen P. Data matching: Concepts and techniques for record linkage, entity resolution, and duplicate detection Springer. 2012.

Deaton R, Doan T, Schweiger T. Semantic data matching: Principles and performance. In: Chan Y, Talburt J, Talley T, eds. Data Engineering: Mining, Information and Intelligence. Springer; 2010:17–38.

Decker W, Liu F, Talburt J.R, Wang P, Wu N. A case study on data quality, privacy, and entity resolution. In: Yeoh W, Talburt J.R, Zhou Y, eds. Information Quality and Governance for Business Intelligence. 2013 IGI Global, pp. 66–87.

Doan A, Halevy A, Ives Z. Principles of data integration. Morgan Kaufmann. 2012.

Dreibelbis A, Eberhard H, Milman I, Oberhofer M, van Run P, Wolfson D. Enterprise master data management: An SOA approach to managing core information. IBM Press; 2008.

Dyché J, Levy E. Customer data integration: Reaching a single version of the truth. New York: Wiley; 2006.

English L. Improving data warehouse and business information quality: Methods for reducing costs and increasing profits. New York: Wiley; 1999.

Fellegi I, Sunter A. A theory for record linkage. Journal of the American Statistical Association. 1969;64(328):1183–1210.

Gibson N, Talburt J. Visualizing student growth: Applications of student growth models. Ninth Annual Conference on Applied Research in Information Technology. 2010 University of Central Arkansas, Conway, AR, April 9, 2010, pp. 9–13. research.acxiom.com/publications.

Hashemi R, Talburt J, Wang R. Significance test for the Talburt-Wang Similarity Index. In: Talburt J, Pierce E, Wu N, Campbell T, eds. 11th International Conference on Information Quality. Cambridge, MA: MIT IQ Publishing; 2006:125–132.

Heien C, Wu N, Talburt J. Methods to Measure Importance of Data Attributes to Consumers of Information Products. AMCIS 2010 Proceedings. 2010 Paper 582. http://aisel.aisnet.org/amcis2010/582.

Herzog T.N, Scheuren F.J, Winkler W.E. Data quality and record linkage techniques. New York: Springer; 2007.

Holland G, Talburt J. A framework for evaluating information source interactions. In: Hu C, Berleant D, eds. 2008 Conference on Applied Research in Information Technology. Conway, AR: University of Central Arkansas; 2008 pp. 13–19. http://research.acxiom.com/publications.html.

Holland G, Talburt J. An entity-based integration framework for modeling and evaluating data enhancement products. Journal of Computing Sciences in Colleges. 2010;24(5):65–73.

Holland G, Talburt J. q-Gram Tetrahedral Ratio (qTR) for approximate pattern matching. Ninth Annual Conference on Applied Research in Information Technology. 2010 University of Central Arkansas, Conway, AR, April 9, 2010, pp. 14–17. research.acxiom.com/publications.

Holmes D, McCabe C. Improving precision and recall for Soundex retrieval. In Proc. of the IEEE International Conference on Information Technology – Coding and Computing. 2002 Las Vegas, NV.

Huang K, Lee Y.W, Wang R.Y. Quality Information and Knowledge Management. Prentice Hall; 1999.

International Association for Information and Data Quality (IAIDQ). IQCPSM – Information Quality Certified Professional Available from. http://iaidq.org/iqcp/iqcp.shtml, 2014.

Isele R, Jentzsch A, Bizer C. Efficient multidimensional blocking for link discovery without losing recall. Fourteenth International Workshop on the Web and Databases. 2011 WebDB-2011, June 12, 2011, Athens, Greece.

Jaro M.A. Advances in record-linkage methodology as applied to matching the 1985 census of Tampa, Florida. Journal of the American Statistical Association. 1989;84(406):414–420.

Jonas J. To know semantic reconciliation is to love semantic reconciliation. 2007 Downloaded from: http://jeffjonas.typepad.com/jeff_jonas/2007/04/to_know_semanti.html on December 25, 2014.

Josang A, Pope S. User Centric Identity Management. 2005 In: Proceedings of AusCERT Conference.

Jugulum R. Competing with high-quality data: Concepts, tools, and techniques for building a successful approach to data quality. Wiley; 2014.

Juran J.M. Juran on leadership for quality. The Free Press; 1989.

Kardes H, Konidena D, Agarwal S, Huff M, Sun A. Graph-based approaches for organizational entity resolution in MapReduce. Proceedings of the TextGraphs-8 Workshop. 2013 October 18, 2013, Seattle, WA, pp. 70–78.

Kirsten T, Kolb L, Hartung M, Gross A, Kopche H, Rahm E. Data partitioning for parallel entity matching. Proceedings of the VLDB Endowment. 2010;Vol. 3 No. 2.

Kobayashi F, Talburt J.R. Probabilistic Scoring Methods to Assist Entity Resolution Systems Using Boolean Rules. The 2013 International Conference on Information and Knowledge Engineering (IKE’13). 2013 Las Vegas, Nevada, July 22–25, 2013, CSREA Press, pp. 101–107.

Kobayashi F, Talburt J.R. Decoupling Identity Resolution from the Maintenance of Identity Information. 11th Information and Knowledge Engineering Conference. 2014 July 21–24, 2014, Las Vegas, NV, pp. 349–354.

Kobayashi F, Talburt J.R. Improving the Quality of Entity Resolution for School Enrollment Data through Affinity Scores. 19th MIT International Conference on Information Quality. 2014 August 1–3, 2014, Xi’an, China.

Kobayashi F, Nelson E.D, Talburt J.R. Design consideration for identity resolution in batch and interactive architectures. International Conference on Information Quality (ICIQ 2011). 2011 Adelaide, Australia, 2011.

Kolb L, Thor A, Rahm E. Block-based load balancing for entity resolution with MapReduce. CIKM’11, October 24–28, 2011. Scotland: Glasgow; 2011 pp. 2397–2400.

Kotter J.P. Leading change. Harvard Business Review Press; 1996.

Landauer T.K, Foltz P.W, Laham D. Introduction to latent semantic analysis. Discourse Processes. 1998;25:259–284.

Lawley E. Building a health data hub. 2010 March 29, 2010. Nashville Post (online version, downloaded July 24, 2010).

Lee Y, Madnick S, Wang R, Wang F, Zhang H. A cubic framework for the Chief Data Officer: Succeeding in a world of big data. MIS Quarterly Executive. 2014 March 2014 (13:1).

Lee Y, Pierce E, Talburt J, Wang R, Zhu H. A curriculum for a master of science in information quality. The Journal of Information Systems Education. 2007;18(2):233–242.

Lee Y.W, Pipino L.L, Funk J.D, Wang R.Y. Journey to Data Quality. Cambridge, MA: MIT Press; 2006.

Levenshtein V. Binary Codes capable of correcting deletions, insertions and reversals. Soviet Physics Doklady. 1966;10(8):707–710.

Loshin D. Master data management. Knowledge Integrity, Inc; 2009.

Mahata D, Talburt J.R. A framework for collecting and managing entity identity information from social media. 19th MIT International Conference on Information Quality. 2014 August 1–3, 2014, Xi’an, China, pp. 216–233.

Maydanchik A. Data Quality Assessment. Technics Publications; 2007.

Mazzucchi-Augel P.N, Ceballos H.G. An alignment comparator for entity resolution with multi-valued attributes. 13th Mexican International Conference on Artificial Intelligence (MICAI),. 2014;8857(2):272–284 November 2014.

Mazzucchi-Augel P.N. An aggregation and alignment operator to solve the entity matching problem. Master’s thesis, Instituto Tecnológico y de Esudios Superiores de Monterrey. 2014 Mexico, December 2014.

McGilvray D. Executing Data Quality Projects: Ten Steps to Quality Data and Trusted Information. Morgan Kaufmann; 2008.

Menestrina D, Whang S.E, Garcia-Molina H. Evaluating entity resolution results. Proceedings of the VLDB Endowment. 2010;Vol. 3 No. 1.

Naumann F, Herschel M. An introduction to duplicate detection. Synthesis Lectures on Data Management. 2010 Morgan and Claypool Publishers.

Nelson E, Talburt J. Improving the quality of law enforcement information through entity resolution. In: Hu C, Berleant D, eds. 2008 Conference on Applied Research in Information Technology. Conway, AR: University of Central Arkansas; 2008:113–118. http://research.acxiom.com/publications.html.

Nelson E, Talburt J. Entity resolution for longitudinal studies in education using OYSTER. Proceedings: 2011 Information and Knowledge Engineering Conference (IKE 2011). 2011 Las Vegas, NV, July 18–20, 2011, pp. 286–290.

Oberhofer M, Hechler E, Milman I, Schumacher S, Wolfson D. Beyond Big Data: Using social MDM to drive deep customer insight. IBM Press; 2014.

Odell M, Russell R. U.S. patent number 1,261,167. Washington, DC: U.S. Patent Office; 1918.

Osesina I, Talburt J. A data-intensive approach to named entity recognition combining contextual and intrinsic indicators. International Journal of Business Intelligence Research. 2012;3(1):55–71.

Papadakis G, Ioannou E, Niederée C, Palpanas T, Nedjl W. WSDM’12 February 8–12, 2012, Seattle, WA, pp. 53–62. 2012.

Penning M, Talburt J.R. Information quality assessment and improvement of student information in the university environment. The 2012 International Conference on Information and Knowledge Engineering (IKE’12). 2012 Las Vegas, Nevada, July 16–29, 2012, pp. 351–357.

Philips L. The double-metaphone search algorithm. 2000 C/C++ User’s Journal, 18(6).

PiLog. Master data quality solutions. 2014 Website available at: http://www.pilog.in/.

Power D, Hunt J. The 8 worst practices in master data management and how to avoid them. 2013 White paper downloaded from: http://www.informationbuilders.com on December 22, 2014.

Power D, Lyngsø. Multidomain MDM – Why it’s a superior solution. Inside Analysis online newsletter. 2013 on Downloaded from: http://insideanalysis.com/2013/08/multidomain-mdm/ on December 22, 2014.

Provost F, Fawcett T. Data science for business: What you need to know about data mining and data-analytic thinking O’Reilly. 2013.

Pullen D. Developing and refining matching rules for entity resolution. 2012 International Conference on Information and Knowledge Engineering (IKE’12). 2012;2012 Las Vegas, NV, pp. 345–350.

Pullen D, Wang P, Talburt J.R, Wu N. A false positive review indicator for entity resolution systems using Boolean rules. The 18th International Conference on Information Quality (ICIQ-2013). 2013 University of Arkansas at Little Rock, November 7–9, 2013, pp. 26–36.

Pullen D, Wang P, Wu N, Talburt J.R. Mitigating data quality impairment on entity resolution errors in student enrollment data. 2013 Information and Knowledge Engineering Conference. 2013 July 21–24, 2013, Las Vegas, NV, pp. 96–100.

Rand W.M. Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association. 1971;66:846–850.

Redman T.C. Data quality for the information age. Artech House; 1996.

Redman T.C. The impact of poor data quality on the typical enterprise. Communications of the ACM. 1998;41(2):79–82.

Redman T.C. Data driven: Profiting from your most important business asset. Boston, MA: Harvard Business Press; 2008.

Schumacher S. The need for accuracy in today’s data world. Database Trends and Applications (online newsletter). 2010 Downloaded from: http://www.dbta.com on December 28, 2014.

Sebastian-Coleman L. Measuring data quality for ongoing improvement. Morgan Kaufmann; 2013.

Sedgewick R, Wayne K. Algorithms. Fourth Edition. Addison Wesley; 2011.

Shannon C.E. A mathematical theory of communication. Bell System Technical Journal. 1948.

Soares S. Big Data governance: An emerging imperative. MC Press Online; 2013.

Soares S. IBM InfoSphere: A platform for Big Data governance and process data governance. MC Press Online; 2013.

Soares S. Data governance tools: Evaluation criteria, Big Data governance, and alignment with enterprise data management. MC Press Online; 2014.

Sørensen H.L. The Liliendahl 101 on MDM. 2011 Downloaded from: http://liliendahl.com/mdm-notes on December 22, 2014.

Sørensen H.L. Beyond True Positives in Deduplication. Blog Post. 2012 Downloaded from: http://liliendahl.com/2012/11/20/beyond-true-positives-in-deduplication on December 22, 2014.

Syed H, Talburt J.R, Liu F, Pullen D, Wu N. Developing and refining matching rules for entity resolution. The 2012 International Conference on Information and Knowledge Engineering (IKE’12). 2012 Las Vegas, Nevada, July 16–29, 2012, pp. 345–350.

Taguchi G, Chowdhury S, Wu Y. Taguchi's Quality Engineering Handbook In: Part III: Quality Loss Function. Wiley-Interscience, NJ; 2005 2005, pp. 171 –98.

Talburt J, Hashemi R. A formal framework for defining entity-based, data source integration. In: Arabnia H, Hashemi R, eds. 2008 International Conference on Information and Knowledge Engineering. Las Vegas, NV: CSREA Press; 2008:394–398.

Talburt J, Nelson E. CoDoSA: A light-weight, XML framework for integrating unstructured textual information. 15th Americas Conference on Information Systems. 2009 San Francisco, CA, AIS Electronic Library (aisel.asnet.org), Paper 489.

Talburt J, Zhou Y. OYSTER: An open source entity resolution system supporting identity information management. ID360 – The Global Forum on Identity. 2012 Austin, TX, April 23–24, 2012, Best Paper Award, pp. 69–86.

Talburt J, Zhou Y. A practical guide to entity resolution with OYSTER. In: Sadiq Shazia, ed. Handbook on Research and Practice in Data Quality. Springer; 2013:235–270.

Talburt J, Kuo E, Wang R, Hess K. An algebraic approach to data quality metrics for customer recognition. In: Chengular-Smith S, Raschid L, Long J, Seko C, eds. 9th International Conference on Information Quality. Cambridge, MA: MIT IQ Publishing; 2004:234–247.

Talburt J, Morgan C, Talley T, Archer K. Using commercial data integration technologies to improve the quality of anonymous entity resolution in the public sector. In: Naumann F, Gertz M, Madnick S, eds. 10th International Conference on Information Quality. Cambridge, MA: MIT IQ Publishing; 2005:133–142.

Talburt J, Wang R, Hess K, Kuo E. An algebraic approach to data quality metrics for entity resolution over large datasets. In: Al-Hakim L, ed. Information quality management: Theory and applications. Hershey, PA: Idea Group Publishing; 2007:1–22.

Talburt J, Zhou Y, Shivaiah S. SOG: A synthetic occupancy generator to support entity resolution instruction and research. 2009 International Conference on Information Quality. 2009 Potsdam, Germany, November 2009, pp. 91–105.

Talburt J.R. Entity resolution and information quality. Morgan Kaufmann; 2011.

Talburt J.R. Overview: The criticality of entity resolution in data and information quality. The ACM Journal of Data and Information Quality (JDIQ),. 2013;Vol. 4 No. 2, pp. 6:1–2.

Wang P, Pullen D, Talburt J.R, Wu N. Iterative approach to weight calculation in probabilistic entity resolution. 2014 International Conference on Information Quality. 2014 August 1–3, 2014, Xi'an, China.

Wang P, Pullen D, Talburt J.R, Wu N. Probabilistic matching compared to deterministic matching for student enrollment records. 2014 International Conference on Information Technology: New Generation. 2014 April 7–9, 2014, Las Vegas, NV, pp. 355–359.

Wang R.Y. A product perspective on total data quality management. Communications of the ACM. 1998;41(2):58–65.

Wang R.Y, Strong D.M. Beyond accuracy: What data quality means to consumers. Journal of Management Information Systems. 1996;12(4):5–34.

Winkler W.E. Using the EM algorithm for weight computation in the Fellegi–Sunter model of record linkage. Journal of the American Statistical Association, Proceedings of the Section on Survey Research Methods. 1988:667–671.

Winkler W.E. Methods for adjusting for lack of independence in an application of the Fellegi-Sunter Model of record linkage. Survey Methodology. 1989;15:101–117.

Winkler W.E. Near automatic weight computation in the Fellegi-Sunter Model of record linkage. Proceedings of the Fifth Census Bureau Annual Research Conference. 1989:145–155.

Winkler W.E. The state of record linkage and current research problems. 1999 Statistics of Income Division, Internal Revenue Service Publication R99/04.

Wu N, Talburt J, Heien C, Pippenger N, Chiang C, Pierce E, et al. A method for entity identification in open source documents with partially redacted attributes. The Journal of Computing Sciences in Colleges. 2007;22(5):138–144.

Yancey W. BigMatch: A program extracting possible matches from a large file. Research Report Series (Computing #2007-1). Washington, DC: Statistical Research Division, U.S. Census Bureau; 2007.

Yonke C.L, Walenta C, Talburt J.R. The job of the information/data quality professional. Industry Report from the International Association for Information and Data Quality. 2012 Retrieved from: http://iaidq.org/publications/yonke-2011-02.shtml.

Zhou Y, Talburt J.R. Entity identity information management. International Conference on Information Quality 2011. 2011 Adelaide, Australia, November 18–20, 2011, electronic proceedings at. http://iciq2011.unisa.edu.au/doc/ICIQ2011_Proceeding_Nov.zip.

Zhou Y, Talburt J. Staging a Realistic Entity Resolution Challenge for Students. Journal of Computing Sciences in Colleges. 2011;26(5):88–95.

Zhou Y, Talburt J. The role of asserted resolution in entity identity information management. Proceedings: 2011 Information and Knowledge Engineering Conference (IKE 2011). 2011 Las Vegas, NV, July 18–20, 2011, pp. 291–296.

Zhou Y, Talburt J.R. Strategies for large-scale entity resolution based on inverted index data partitioning. In: Talburt J, Yeoh W, Zhou Y, eds. Information Quality and Governance for Business Intelligence. IGI Global; 2014 pp. 329–151.

Zhou Y, Kooshesh A, Talburt J. Optimizing the accuracy of entity-based data integration of multiple data sources using genetic programming methods. International Journal of Business Intelligence Research. 2012;3(1):72–82.

Zhou Y, Nelson E.D, Kobayashi F, Talburt J.R. A graduate-level course on entity resolution and information quality: A step toward ER education. Journal of Data and Information Quality (JDIQ). 2013 Special Issue on Entity Resolution, Vol. 4, No. 2, March 2013, Article No. 10.

Zhou Y, Talburt J, Nelson E. The interaction of data, data structures, and software in entity resolution systems. Software Quality Professional. 2011;13(4):32–41.

Zhou Y, Talburt J, Su Y, Yin L. OYSTER: A tool for entity resolution in health information exchange. 5th International Conference on the Cooperation and Promotion of Information Resources in Science and Technology (COINFO’10). 2010 Beijing, China, November 27–29, 2010, pp. 356–362.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.217.147.193