Data-Products-For-Dummiespdf (1)
Data-Products-For-Dummiespdf (1)
Data-Products-For-Dummiespdf (1)
by Vishal Singh,
Ryo Komatsuzaki, and
Andrew Mott, MBA
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Data Products For Dummies®, Starburst Special Edition
Published by
John Wiley & Sons, Inc.
111 River St.
Hoboken, NJ 07030-5774
www.wiley.com
Copyright © 2023 by John Wiley & Sons, Inc., Hoboken, New Jersey
No part of this publication may be reproduced, stored in a retrieval system or transmitted in any
form or by any means, electronic, mechanical, photocopying, recording, scanning or otherwise,
except as permitted under Sections 107 or 108 of the 1976 United States Copyright Act, without the
prior written permission of the Publisher. Requests to the Publisher for permission should be
addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ
07030, (201) 748-6011, fax (201) 748-6008, or online at http://www.wiley.com/go/permissions.
Trademarks: Wiley, For Dummies, the Dummies Man logo, The Dummies Way, Dummies.com,
Making Everything Easier, and related trade dress are trademarks or registered trademarks of
John Wiley & Sons, Inc. and/or its affiliates in the United States and other countries, and may not
be used without written permission. All other trademarks are the property of their respective
owners. John Wiley & Sons, Inc., is not associated with any product or vendor mentioned in this
book.
For general information on our other products and services, or how to create a custom For
Dummies book for your business or organization, please contact our Business Development
Department in the U.S. at 877-409-4177, contact info@dummies.biz, or visit www.wiley.com/go/
custompub. For information about licensing the For Dummies brand for products or services,
contact BrandedRights&Licenses@Wiley.com.
ISBN 978-1-394-21837-0 (pbk); ISBN 978-1-394-21838-7 (ebk)
Publisher’s Acknowledgments
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Applying product management principles
to data
Chapter 1
Defining Data Products
M
aximizing value from data is a persistent business chal-
lenge. Among the latest contributions to the field of data
is the concept of data mesh — a decentralized, distributed
approach to enterprise data management — which brings the
concept of the data product into the mainstream. In this chapter,
we lift the lid on the idea of a data product and what it can do to
modernize your data analytics strategy.
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Looking at Data Products and Data Mesh
The data mesh perspective provides a more precise definition
of a data product. Within this framework, a data product is a
self-contained entity composed of code responsible for data
collection, transformation, metadata definition, and infrastruc-
ture required to run the code, with the output being a dataset that
is valuable in and of itself. While not every organization is mature
enough or has the immediate need to adopt a data mesh approach,
the concept of treating data as a product remains essential.
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Designing for value and usability
Chapter 2
Creating, Managing,
and Optimizing Data
Products
I
n this chapter, we focus on key areas for your successful data
product program: design, usability, scalability, and technology.
Dive in!
Discoverability
Design data products in such a way as to promote data discover-
ability, ensuring that users can find and access the data they need
easily:
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
»» Metadata: Data products that include information about the
data help users to understand its contents, context and find
more easily. This metadata might include information about
the data source, the date it was collected, business relevance,
and other details that aid in comprehension.
»» Search functionality: Use keywords and descriptions that
are commonly used across your organization that can be
indexed and searched via tools of your choice. If your
organization utilizes a business glossary, applying the terms
will help ensure consistent discovery. Many data products
enable users to find specific data sets or topics. Optimize this
search functionality to provide relevant results quickly and
make it easier for users to find the needed data.
»» Data categorization: Tag data with standard categories and
subcategories such as domains and subject matter that are
familiar across your organization to enable quick filtration and
drilling down to data of relevance. Organize data products
into categories and subcategories, such as topic, data source,
date, or other relevant factors, making it easier for users to
browse the available data sets and find their needs.
»» User feedback: Collect user feedback about their experiences
using the product so that developers can continuously
improve the discoverability of data products.
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
»» Data consumption: Add charts, code snippets, and
examples of consumption and output that can help the
consumer reduce time to utilization.
»» Collaboration and sharing: Inspire a data products culture
by enabling users to collaborate on analysis and share
insights. This might include features such as commenting,
sharing, and publishing.
»» Training and support: Provide training and support for data
products to help users get up to speed with the product and
learn how to use it effectively. This might include online
tutorials, help documentation, and user forums.
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
product that provides weather data could be combined with
a data product that provides traffic data to create a new
application that helps users plan their routes based on
weather and traffic conditions.
»» Programmatic Access (API): Data products that offer
Application Programming Interfaces (APIs) can be easily
integrated with other data products to provide new insights or
create new applications. For example, an organization could
create a custom application that combines data from multiple
data products to provide a unique customer experience.
Governing at scale
Platforms to manage data products help administrators by pro-
viding a common framework to centrally manage and automate
various data governance processes, improve data quality, and
enable security and compliance across the organization. Here are
some ways in which these platforms assist data admins in gov-
erning data products at scale:
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Reducing the cost of ownership
Data products reduce the total cost of ownership (TCO) of data
management by improving data efficiency, reducing manual
work, and streamlining processes, which helps organizations to
save time and money while improving overall business perfor-
mance. Here are some ways in which data products can assist in
reducing the TCO:
(continued)
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
(continued)
While LLMs drive the adoption of data products, factors such as the
usefulness of the data products, value proposition, effective commu-
nication strategies, and addressing data privacy concerns also play
significant roles. LLMs enhance the accessibility, usability, and user
experience of data products, thereby promoting their adoption.
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Meeting engineers, product managers,
and analysts
Chapter 3
People and Processes
D
ata products serve as the units of exchange between the data
producers and data consumers. In this chapter, we explore
the crucial people involved: data platform engineers, data
product producers, and data consumers. Data products need to be
built atop a data product creation and exchange platform managed
by a data platform engineering team, which we also cover.
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
operate within. These may include treatment of personally iden-
tifiable information (PII) data, API standards, and data product
templates and design standards.
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
»» Continuous process improvement: The two roles need to
work together to identify areas for improvement in data
product processes.
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Having centralized policies with
decentralized governance
It stands to reason that data products are often an output of mul-
tiple data sources abstracted via a centralized data infrastructure.
While data lakes and warehouses may constitute a single source
of truth, this is, more often than not, an amalgamation of analyti-
cal data from multiple data sources. However, when considering
agility, the various domain data teams either need to be empow-
ered to manage their own corner of a centralized data infrastruc-
ture and the satellite operational and analytical data sources, or
they need to have the ability to manage via a centralized data gov-
ernance layer.
»» Metadata management
»» Data discovery
»» Data integration
»» Data delivery
»» Security and access control
Modern solutions have started to realize the ability for centralized
governance across a decentralized data ecosystem. Built around
federated query engines, they realize the ability to create innova-
tive federated data products, while dramatically decreasing the
need for data movement and eliminating the need for centralized
data infrastructure. The need to repetitively implement data gov-
ernance across each data source and compute resources in each
cloud region in each cloud vendor, compounded by each tool in
the data stack, is starting to become a story of the past.
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Chapter 4
Ten Tips for Adopting
Data Products
T ake inspiration from the following ten tips for smoothly and
successfully adopting data products:
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
practices. With robust governance in place, businesses can
make confident decisions based on reliable data, fostering a
culture of data-driven decision-making across the organization.
»» Design for use — with consumers in mind: The design of
data products for customers should be based on a deep
understanding of their unique needs, preferences, and
workflows, resulting in maximum value and fostering user
adoption and engagement.
»» Reuse the same data products without making copies:
Data products that prioritize reuse without making multiple
copies offer significant advantages in terms of cost savings,
data consistency, and improved productivity. By leveraging
technologies like data virtualization, data lakes, and data
APIs, organizations can unlock the full potential of their data
assets without the need for redundant copies.
»» Incentivize production and usage for data producers and
data consumers — avoid shadow IT: By aligning data products
with user needs, providing adequate support and education,
and ensuring the security and usability of authorized solutions,
organizations can minimize the risks associated with shadow IT
while maximizing the benefits of data-driven decision-making.
»» Invest in Data Product Owner/Manager Role: The data
product owner/manager is crucial in overseeing the entire
life cycle of a data product. They act as the primary point of
contact and advocate for optimizing the product, ensuring
alignment between business objectives and product develop-
ment. The data product owner collaborates with stakeholders
to gather requirements, prioritize features, and make
data-driven decisions.
»» Remember that iteration is key: Through continuous
feedback and iterative development around refinement,
user-centric design, adaptation, testing, risk mitigation,
continuous learning, and user adoption, data products can
evolve, improve, and remain aligned with user needs and
changing requirements, ultimately driving their effectiveness
and value within the organization.
»» Invest in your organization’s culture: Adopting a data product
mindset can build a culture of informed decision-making and
continuous growth in your organization by promoting data-
driven decision-making, increasing transparency, encouraging
experimentation and innovation, facilitating collaboration,
enabling self-service analytics, celebrating successes, and
fostering continuous learning and improvement.
These materials are © 2023 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
WILEY END USER LICENSE AGREEMENT
Go to www.wiley.com/go/eula to access Wiley’s ebook EULA.