White Paper:: Three Open Blueprints For Big Data Success
White Paper:: Three Open Blueprints For Big Data Success
White Paper:: Three Open Blueprints For Big Data Success
Paper:
Three
Open
Blueprints
For
Big
Data
Success
Inside:
• Leverage
open
framework
and
open
source
• Kickstart
your
efforts
with
repeatable
blueprints
• Tailor
these
use
cases
for
your
enterprise
1
About This Paper
Enterprises are awash in more data than they can make sense of. This has given rise to the
current “Big Data” phenomenon, where the opportunity for sensemaking over data calls for new
solutions.
About Pentaho
Pentaho delivers a business analytics framework based on open concepts and open source
software, and they support open exchange of lessons learned by a vibrant community of users.
Their capturing of successful use cases seen across industry segments led to their production
of the Big Data Blueprints presented here.
2
the amount of data being stored, but not by shoving it into the warehouse, but by adding
Hadoop to house the additional data. Once you have Hadoop in the mix, the open framework of
Pentaho makes it easy to move data into Hadoop from external sources, move data bi-
directionally between the warehouse and Hadoop, as well as makes it easy to process data in
Hadoop. Again, this is a great place to start. It’s not as transformative to your business as the
other use cases can be, but it will build expertise and save you money.
Pentaho simplifies offloading to Hadoop and speeds development and deployment time by as
much as 15x versus hand-coding approaches. Complete visual integration tools eliminate the
need for hand coding in SQL or java-based MapReduce jobs. The objective is to Save data
costs and boost analytics performance.
3
There is no quicker or more cost-effective way to immediately get value from data
through integrated reporting, dashboards, data discovery and predictive analytics.
You should expect up to 15x data cost improvement with this approach.
This can help you turn Hadoop into a Valuable Multi-source Business Information Hub,
Just waiting to be queried. Pentaho’s agile data integration and analytics platform allows
you to stream data through Hadoop for transformation processing and immediately push
the refined data to any analytic databases. For the end-user, a rich set of data
discovery, reports, dashboards and visualizations are immediately available.
4
• An electronic marketing firm created a refinery architecture for delivering
personalized offers.
• Online campaign, enrollment, and transaction data is ingested via Hadoop,
processed and then sent on to an analytic database.
• A business analytics front-end includes reporting and ad hoc analysis for
business users.
5
can be huge. Don’t worry too much about getting the full 360-degree view at first;
starting with even one small slice can drive huge positive changes.
Integrating diverse data sources is simplified with Pentaho’s broad support for both big
and traditional data sources allowing the 360-degree view to be extended to external
and internal customer related data. The Pentaho platform scales as business grows,
enabling routing of governed blended, time-sensitive streams of data to be distributed to
customer-facing teams – in real-time empowering more productive and profitable
decisions.
6
Figure
3:
Customer
360-‐Degree
Assessments
Concluding Thoughts
The open Big Data Blueprints presented here can help you accelerate your projects by
giving you repeatable frameworks you can tailor to meet your needs. We hope they
help accelerate your implementation of enhanced data analysis capabilities and believe
they can accelerate the use of data in support of your mission.
Please give us your thoughts on these approaches, we would love to have your
feedback.
7
More Reading
For more federal technology and policy issues visit:
• CTOvision.com- A blog for enterprise technologists with a special focus on Big Data.
• J.mp/ctonews - Sign up for the government technology newsletters including the Government
Big Data Weekly.
Contact:
Bob Gourley
bob.gourley@cognitiocorp.com
CTOlabs.com
8