Preface
This IBM® Redpaper publication provides a comprehensive overview of the IBM Spectrum® Discover metadata management software platform. We give a detailed explanation of how the product creates, collects, and analyzes metadata.
Several in-depth use cases are used that show examples of analytics, governance, and optimization. We also provide step-by-step information to install and set up the IBM Spectrum Discover trial environment.
More than 80% of all data that is collected by organizations is not in a standard relational database. Instead, it is trapped in unstructured documents, social media posts, machine logs, and so on. Many organizations face significant challenges to manage this deluge of unstructured data such as:
Pinpointing and activating relevant data for large-scale analytics
Lacking the fine-grained visibility that is needed to map data to business priorities
Removing redundant, obsolete, and trivial (ROT) data
Identifying and classifying sensitive data
IBM Spectrum Discover is a modern metadata management software that provides data insight for petabyte-scale file and Object Storage, storage on premises, and in the cloud. This software enables organizations to make better business decisions and gain and maintain a competitive advantage.
IBM Spectrum Discover provides a rich metadata layer that enables storage administrators, data stewards, and data scientists to efficiently manage, classify, and gain insights from massive amounts of unstructured data. It improves storage economics, helps mitigate risk, and accelerates large-scale analytics to create competitive advantage and speed critical research.
Authors
This paper was produced by a team of specialists from around the world working at IBM Redbooks, Tucson Center.
Joe Dain is a Senior Technical Staff Member and Master Inventor in the IBM Systems Storage organization in Tucson, Arizona. He is currently on his 26th invention plateau and has over 100 patents issued and pending worldwide. Joseph joined IBM in 2003 with a BS in Electrical Engineering and is the Chief Architect for IBM Spectrum Discover.
Norman Bogard is a Senior Technical Sales Specialist with the IBM Washington Systems Center and a member of the IBM Spectrum Storage™ Technical Leadership Team. He began his career in Information technology in 1984 with Intel Corporation and came into IBM through the acquisition of Sequent Computers. His areas of expertise though the years includes Ethernet networking, Storage Area Networks, Network Attached Storage, and unstructured data.
Isom Crawford Jr. is a Subject Matter Expert for Software Defined Infrastructure at IBM Washington Systems Center. He has over 20 years of experience in computer software product architecture and development. He holds a PhD in Mathematical Sciences from the University of Texas at Dallas and MS in Applied Mathematics from Oklahoma State University. He has developed and delivered multiple technical training courses, holds nine patents, and authored multiple publications, including Software Optimization for High Performance Computers (ISBN 0130170089).
Mathias Defiebre is a leading IBM expert for Analytics, Object Storage, and Data Protection with over 20 years of storage experience. From IBM’s EMEA Storage Competence Centre (ESCC), he provides support to customers through the Advanced Technical Skills (pre-sales support) and Lab Services channels (Implementations, Migrations, Health checks, Proof of Concepts, and Workshops). He graduated from the University of Cooperative Education Mannheim with a German Diploma in Information Technology Management and a Bachelor of Science. Mathias also is a Master Certified IT Specialist and an IBM Certified Specialist for TotalStorage™ Networking and Virtualization Architectures. He is an IBM Redbooks® author for several Storage Redbooks publications, including IBM Software-Defined Storage Guide Redpaper publication.
Larry Coyne is a Project Leader at the International Technical Support Organization, Tucson Arizona Center. He has over 35 years of IBM experience, with 23 in IBM storage software management. He holds degrees in Software Engineering from the University of Texas at El Paso and Project Management from George Washington University. His areas of expertise include client relationship management, quality assurance, development management, and support management for IBM Storage Management Software.
Thanks to the following people for their contributions to this project:
Nilesh Bhosale
Scott Brewer
Stephen Edel
Denver Hopkins
Stephen Moffitt
Guillermo Nolasco
Daithi Ocuinn
IBM Systems
Now you can become a published author, too!
Here’s an opportunity to spotlight your skills, grow your career, and become a published author—all at the same time! Join an IBM Redbooks residency project and help write a book in your area of expertise, while honing your experience using leading-edge technologies. Your efforts will help to increase product acceptance and customer satisfaction, as you expand your network of technical contacts and relationships. Residencies run from two to six weeks in length, and you can participate either in person or as a remote resident working from your home base.
Find out more about the residency program, browse the residency index, and apply online at:
Comments welcome
Your comments are important to us!
We want our papers to be as helpful as possible. Send us your comments about this paper or other IBM Redbooks publications in one of the following ways:
Use the online Contact us review Redbooks form found at:
Send your comments in an email to:
Mail your comments to:
IBM Corporation, IBM Redbooks
Dept. HYTD Mail Station P099
2455 South Road
Poughkeepsie, NY 12601-5400
Stay connected to IBM Redbooks
Find us on Facebook:
Follow us on Twitter:
Look for us on LinkedIn:
Explore new Redbooks publications, residencies, and workshops with the IBM Redbooks weekly newsletter:
Stay current on recent Redbooks publications with RSS Feeds:
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.222.120.133