SlideShare une entreprise Scribd logo
1  sur  6
Télécharger pour lire hors ligne
Data Lake-based Approaches to Regulatory-
Driven Technology Challenges
How a Data Lake Approach Improves Accuracy and Cost
Effectiveness in the Extract, Transform, and Load Process
for Business and Regulatory Purposes
The concept of big data offers financial institutions an opportunity to build capabilities that both reduce costs
and produce better insight. In the area of regulatory compliance, the work required to prepare the organization
typically involves modifications to systems, process, and data to allow Collection, Alignment, Aggregation, and
Analysis (CA3) to occur. For example, new rules, such as Dodd Frank, over-the-counter (OTC) collateral, and
risk management requirements, rely on the same legal entity and customer data infrastructure that need to
be upgraded for Anti Money Laundering/Bank Secrecy Act, Sanctions, and Foreign Account Tax Compliance Act
(FATCA). Linking the data while limiting the modifications to the systems that underpin both the business and
compliance requirements improves performance for customer-facing platforms and regulatory compliance
systems alike.
The potential is real, but the volume, variety, and velocity of the data is growing so fast that it is outpacing the
ability of current tools to take full advantage of it. Much of the problem lies in the need to extensively prepare
the data before it can be analyzed. In parallel, the technologies and techniques underpinning Big Data have
matured to the point where they can address the challenge. While early uses focused on deriving insights from
very large pools of unstructured data, recent deployments have harnessed multiple tools, including advanced data
management, pattern recognition, and adaptive analytics, to address large-scale, high-accuracy, low-latency CA3 of
diverse, dispersed data.
Applying Robust Financial Intelligence and
Analytics to Stay Ahead
The Extract, Transform, and Load Challenge
For the past 30 years, traditional approaches to sharing and transferring data have all involved some type of
Extract, Transform, and Load (ETL) capability that extracts information from one format (database, silo, file, etc.)
and transforms it into another data format. The process then loads the data into the target system for use in a
set of predetermined analyses. While these approaches to handling data have served some organizations well in
the past, they have some notable drawbacks, which become more significant as the volume, variety, and velocity
of the data expands.
First and foremost, the process is resource intensive and requires
investments in high-cost tools to access the data. For example, each
time a new regulation is issued that calls for a new type of analytically
derived report, banks must initiate a dedicated IT project, often focused
on solving the data ingest issue. This portfolio of projects results in a
very large number of data warehouses, each with their own ETL process.
To use the diverse data warehouses calls for the creation of customized
Point-to-Point (PtP) solutions. These PtP solutions can certainly meet
the short-term goal, but often fail to scale up to meet longer-term
organizational goals. As banks move into the era of big data, this PtP
approach becomes overly complex and difficult to manage.
The Data Lake-based Approach
In stark contrast to the challenges presented by a point-to-point ETL approach, Booz Allen Hamilton, a leading
strategy and technology consulting firm, has found that a data lake-based approach to CA3 requirements is
scalable, extensible, and improves the range and sophistication of analyses that can be supported while providing
higher levels of data control and security.
A data lake-based approach takes advantage of the most recent developments in large-scale distributed
computing hardware/software to create an innovative way to ingest, index, and analyze massive amounts of
data in batch and real time that can scale to exabytes—without compromising integrity, cost-effectiveness,
or performance. The Data Lake Approach embeds business rules, often the result of policy and procedure
documentation for regulatory compliance, in the cell level data, allowing alignment, aggregation, and analysis to
occur rapidly and with far less upfront work by IT departments. With the data lake, an organization’s repository of
information—including structured and unstructured data—is consolidated in a single, large “table.” Every inquiry
can use the entire body of information stored in the data lake—and it is all available at the same time.
This approach, also referred to as “schema on read,” has five core features that can help banks address
increasingly demanding, constantly evolving regulatory requirements. In a data lake-based approach:
1.	 ETL is not done en-masse prior to the analysis. Data is ingested
rapidly in “raw” form, and the indices and relationships to support
the analysis are derived, enriched, and overlaid as needed—or
even executed at the time of the analysis, reducing the time to
operationalize data.
2.	 Unified queries can be created quickly to allow access across all
information sources, reducing the time and complexity involved in
creating and federating queries across multiple databases.
3.	 Multiple data sources can be more quickly fused to enable a very
high degree of data agility to compose new reports that meet
emerging requirements (e.g., new regulations).
4.	 Operations and management (O&M) complexity is significantly
reduced, with a corresponding drop in O&M costs, while creating
the basis for improved security and data management posture.
ETL
Transactions
FEDERATED QUERY
ETL
Transactions
ETL
Transactions
Tailored
Reporting
Tools
Transactions Transactions Transactions
Lightweight
Security
Tagging
Runtime
Creation of
Views
Figure 2. Advanced Data Lake-based Approaches
Figure 1. Traditional Point-to-Point Solution
5.	 The low-cost, streamlined ingestion process can be performed in near real-time, making the Data Lake
Approach a viable alternative for some requirements that would typically be addressed by implementing
Straight Through Processing platforms—at far less cost and disruption to the revenue-generating operations
of the bank.
Putting the Data Lake to Work
With the Data Lake Approach, it now becomes practical—in terms of time, cost, and analytic ability—to turn big
data into a powerful tool to deal with escalating regulatory challenges while meeting business demands. We can
now ask more far-reaching and complex questions, and find the often hidden patterns and relationships that can
lead to game-changing knowledge and insight. The Data Lake Concept is particularly well suited for challenges
that have one or more of the following characteristics:
1.	 Streaming analytics are performed on large-scale data sets
2.	 PtP data mart solutions are involved
3.	 The ETL requirement is data, not process heavy
While applying a big data approach to financial regulatory requirements may be innovative, it would not
experimental—Booz Allen has created data lake-based systems for more than a dozen government clients. Each
time we addressed a new class of problem, (e.g., Homeland Security, Defense) we used a prototype approach to
build/test/tailor the Data Lake Approach. We are prepared to work with your leadership team in a similar manner
to introduce this capability.
To launch a prototype project, we work with clients to:
•	 Identify a small set of business and regulatory critical applications as the basis for the prototype—basically,
a subset of projects in process that can be executed quickly to yield results
•	 Set up design requirements for information reporting requirements for internal/external users
•	 Mirror a set of real-world scenarios to create an analytics platform (i.e., a data lake) that we will use to
demonstrate the schema on read process against the critical applications identified above
•	 Develop a results summary on multiple levels (speed, cost, accuracy) and test the data for internal validity
and defensibility
Booz Allen knows that a clean-sheet approach is not feasible; any viable solution approach must be able to deal
with a diverse base of legacy systems and select from the existing portfolio of regulatory IT project requirements.
While such conditions can be challenging, by creating an isolated, parallel analytics platform, we are be able to
work with live data with no risk to the bank’s production systems.
“With the Data Lake Approach, it now becomes practical—in terms of time, cost, and
analytic ability—to turn big data into a powerful tool to deal with escalating regulatory
challenges while meeting business demands.
”
www.boozallen.com
About Booz Allen
Booz Allen Hamilton has been at the forefront of strategy and technology consulting for nearly a century.
Today, Booz Allen is a leading provider of management and technology consulting services to the US government
in defense, intelligence, and civil markets, and to major corporations, institutions, and not-for-profit organizations.
In the commercial sector, the firm focuses on leveraging its existing expertise for clients in the financial services,
healthcare, and energy markets, and to international clients in the Middle East. Booz Allen offers clients
deep functional knowledge spanning strategy and organization, engineering and operations, technology, and
analytics—which it combines with specialized expertise in clients’ mission and domain areas to help solve
their toughest problems.
The firm’s management consulting heritage is the basis for its unique collaborative culture and operating model,
enabling Booz Allen to anticipate needs and opportunities, rapidly deploy talent and resources, and deliver
enduring results. By combining a consultant’s problem-solving orientation with deep technical knowledge and
strong execution, Booz Allen helps clients achieve success in their most critical missions—as evidenced by
the firm’s many client relationships that span decades. Booz Allen helps shape thinking and prepare for future
developments in areas of national importance, including cybersecurity, homeland security, healthcare, and
information technology.
Booz Allen is headquartered in McLean, Virginia, employs approximately 25,000 people, and had revenue of
$5.86 billion for the 12 months ended March 31, 2012. For over a decade, Booz Allen’s high standing as a
business and an employer has been recognized by dozens of organizations and publications, including Fortune,
Working Mother, G.I. Jobs, and DiversityInc. More information is available at www.boozallen.com. (NYSE: BAH)
For more information, contact
Thomas Sanzone
Senior Vice President
sanzone_thomas@bah.com
917-305-8003
James Newfrock
Vice President
newfrock_jim@bah.com
917-305-8037	 	
Joshua Sullivan
Vice President
sullivan_joshua@bah.com
301-543-4611	 	
Albert Belman
Principal
belman_albert@bah.com
917-305-8002	 	
Michael Delurey
Principal
delurey_mike@bah.com
703-902-6858	 	
03.078.13

Contenu connexe

Tendances

Big data security and privacy issues in the
Big data security and privacy issues in theBig data security and privacy issues in the
Big data security and privacy issues in theIJNSA Journal
 
IBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveIBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveKun Le
 
Python's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPython's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPeter Wang
 
big data Big Things
big data Big Thingsbig data Big Things
big data Big Thingspateelhs
 
Information economics and big data
Information economics and big dataInformation economics and big data
Information economics and big dataMark Albala
 
Dealing with Dark Data
Dealing with Dark DataDealing with Dark Data
Dealing with Dark DataKazoup
 
IABE Big Data information paper - An actuarial perspective
IABE Big Data information paper - An actuarial perspectiveIABE Big Data information paper - An actuarial perspective
IABE Big Data information paper - An actuarial perspectiveMateusz Maj
 
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Edureka!
 
Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsLogical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsDenodo
 
Accelerating Time to Success for Your Big Data Initiatives
Accelerating Time to Success for Your Big Data InitiativesAccelerating Time to Success for Your Big Data Initiatives
Accelerating Time to Success for Your Big Data Initiatives☁Jake Weaver ☁
 
Expanded top ten_big_data_security_and_privacy_challenges
Expanded top ten_big_data_security_and_privacy_challengesExpanded top ten_big_data_security_and_privacy_challenges
Expanded top ten_big_data_security_and_privacy_challengesTom Kirby
 
Data Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data EnvironmentData Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data EnvironmentDenodo
 
Big Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and ChallengesBig Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and ChallengesGregg Barrett
 
Security issues in big data
Security issues in big data Security issues in big data
Security issues in big data Shallote Dsouza
 
Real callenges in big data security
Real callenges in big data securityReal callenges in big data security
Real callenges in big data securitybalasahebcomp
 

Tendances (20)

Big data security and privacy issues in the
Big data security and privacy issues in theBig data security and privacy issues in the
Big data security and privacy issues in the
 
Big Data
Big DataBig Data
Big Data
 
IBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveIBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep dive
 
1
11
1
 
Big Data (security Issue)
Big Data (security Issue)Big Data (security Issue)
Big Data (security Issue)
 
Python's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPython's Role in the Future of Data Analysis
Python's Role in the Future of Data Analysis
 
big data Big Things
big data Big Thingsbig data Big Things
big data Big Things
 
Information economics and big data
Information economics and big dataInformation economics and big data
Information economics and big data
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Dealing with Dark Data
Dealing with Dark DataDealing with Dark Data
Dealing with Dark Data
 
IABE Big Data information paper - An actuarial perspective
IABE Big Data information paper - An actuarial perspectiveIABE Big Data information paper - An actuarial perspective
IABE Big Data information paper - An actuarial perspective
 
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
 
Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsLogical Data Fabric: Architectural Components
Logical Data Fabric: Architectural Components
 
Accelerating Time to Success for Your Big Data Initiatives
Accelerating Time to Success for Your Big Data InitiativesAccelerating Time to Success for Your Big Data Initiatives
Accelerating Time to Success for Your Big Data Initiatives
 
Expanded top ten_big_data_security_and_privacy_challenges
Expanded top ten_big_data_security_and_privacy_challengesExpanded top ten_big_data_security_and_privacy_challenges
Expanded top ten_big_data_security_and_privacy_challenges
 
Data Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data EnvironmentData Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data Environment
 
Big Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and ChallengesBig Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and Challenges
 
Taming the data beast
Taming the data beastTaming the data beast
Taming the data beast
 
Security issues in big data
Security issues in big data Security issues in big data
Security issues in big data
 
Real callenges in big data security
Real callenges in big data securityReal callenges in big data security
Real callenges in big data security
 

En vedette

Why We're Hungry for Remarkable Content
Why We're Hungry for Remarkable ContentWhy We're Hungry for Remarkable Content
Why We're Hungry for Remarkable ContentBlack Marketing
 
Brandhome speaks at Cannes Lions 2015 on Social Storytelling
Brandhome speaks at Cannes Lions 2015 on Social StorytellingBrandhome speaks at Cannes Lions 2015 on Social Storytelling
Brandhome speaks at Cannes Lions 2015 on Social StorytellingBrandhome
 
How to School Your Mind to Think Laterally
How to School Your Mind to Think LaterallyHow to School Your Mind to Think Laterally
How to School Your Mind to Think LaterallyÈspresso1882 Australia
 
Wearable Tech - Trends for 2016
Wearable Tech - Trends for 2016Wearable Tech - Trends for 2016
Wearable Tech - Trends for 2016Scott Eggertsen
 
The Future of Medical Education - Top Trends Likely to Have an Impact on the ...
The Future of Medical Education - Top Trends Likely to Have an Impact on the ...The Future of Medical Education - Top Trends Likely to Have an Impact on the ...
The Future of Medical Education - Top Trends Likely to Have an Impact on the ...Ogilvy Health
 
Beyond the Gig Economy
Beyond the Gig EconomyBeyond the Gig Economy
Beyond the Gig EconomyJon Lieber
 
Does That Belong Here? A Facebook Versus LinkedIn Infographic
Does That Belong Here? A Facebook Versus LinkedIn InfographicDoes That Belong Here? A Facebook Versus LinkedIn Infographic
Does That Belong Here? A Facebook Versus LinkedIn InfographicBoston Interactive
 
Venture capital investment trends May 2014
Venture capital investment trends May 2014Venture capital investment trends May 2014
Venture capital investment trends May 2014JLL
 
Startup Turkey 2014 VitalCV Pitch Slides
Startup Turkey 2014 VitalCV Pitch SlidesStartup Turkey 2014 VitalCV Pitch Slides
Startup Turkey 2014 VitalCV Pitch SlidesJoshgun Karimov
 
Dove- Evolution of a brand
Dove- Evolution of a brandDove- Evolution of a brand
Dove- Evolution of a brandSameer Mathur
 
ontology based- data_integration.
ontology based- data_integration.ontology based- data_integration.
ontology based- data_integration.AliAlJadaa
 
TH TRUE HEAL - Brand Love Key
TH TRUE HEAL - Brand Love KeyTH TRUE HEAL - Brand Love Key
TH TRUE HEAL - Brand Love KeyLong Nguyen
 
What 1 Million Tweets Taught Us About Tweeting Successfully
What 1 Million Tweets Taught Us About Tweeting SuccessfullyWhat 1 Million Tweets Taught Us About Tweeting Successfully
What 1 Million Tweets Taught Us About Tweeting SuccessfullyBuffer
 
The Death of Trial and Error, SMX London 2013 by Sean Malseed of SEMrush.com
The Death of Trial and Error, SMX London 2013 by Sean Malseed of SEMrush.comThe Death of Trial and Error, SMX London 2013 by Sean Malseed of SEMrush.com
The Death of Trial and Error, SMX London 2013 by Sean Malseed of SEMrush.comSemrush
 
10 Inconvenient Truths About Creating Branded Content with Influencers
10 Inconvenient Truths About Creating Branded Content with Influencers10 Inconvenient Truths About Creating Branded Content with Influencers
10 Inconvenient Truths About Creating Branded Content with InfluencersErica Ehm
 

En vedette (16)

Why We're Hungry for Remarkable Content
Why We're Hungry for Remarkable ContentWhy We're Hungry for Remarkable Content
Why We're Hungry for Remarkable Content
 
Brandhome speaks at Cannes Lions 2015 on Social Storytelling
Brandhome speaks at Cannes Lions 2015 on Social StorytellingBrandhome speaks at Cannes Lions 2015 on Social Storytelling
Brandhome speaks at Cannes Lions 2015 on Social Storytelling
 
How to School Your Mind to Think Laterally
How to School Your Mind to Think LaterallyHow to School Your Mind to Think Laterally
How to School Your Mind to Think Laterally
 
SAP World Cup Insights
SAP World Cup InsightsSAP World Cup Insights
SAP World Cup Insights
 
Wearable Tech - Trends for 2016
Wearable Tech - Trends for 2016Wearable Tech - Trends for 2016
Wearable Tech - Trends for 2016
 
The Future of Medical Education - Top Trends Likely to Have an Impact on the ...
The Future of Medical Education - Top Trends Likely to Have an Impact on the ...The Future of Medical Education - Top Trends Likely to Have an Impact on the ...
The Future of Medical Education - Top Trends Likely to Have an Impact on the ...
 
Beyond the Gig Economy
Beyond the Gig EconomyBeyond the Gig Economy
Beyond the Gig Economy
 
Does That Belong Here? A Facebook Versus LinkedIn Infographic
Does That Belong Here? A Facebook Versus LinkedIn InfographicDoes That Belong Here? A Facebook Versus LinkedIn Infographic
Does That Belong Here? A Facebook Versus LinkedIn Infographic
 
Venture capital investment trends May 2014
Venture capital investment trends May 2014Venture capital investment trends May 2014
Venture capital investment trends May 2014
 
Startup Turkey 2014 VitalCV Pitch Slides
Startup Turkey 2014 VitalCV Pitch SlidesStartup Turkey 2014 VitalCV Pitch Slides
Startup Turkey 2014 VitalCV Pitch Slides
 
Dove- Evolution of a brand
Dove- Evolution of a brandDove- Evolution of a brand
Dove- Evolution of a brand
 
ontology based- data_integration.
ontology based- data_integration.ontology based- data_integration.
ontology based- data_integration.
 
TH TRUE HEAL - Brand Love Key
TH TRUE HEAL - Brand Love KeyTH TRUE HEAL - Brand Love Key
TH TRUE HEAL - Brand Love Key
 
What 1 Million Tweets Taught Us About Tweeting Successfully
What 1 Million Tweets Taught Us About Tweeting SuccessfullyWhat 1 Million Tweets Taught Us About Tweeting Successfully
What 1 Million Tweets Taught Us About Tweeting Successfully
 
The Death of Trial and Error, SMX London 2013 by Sean Malseed of SEMrush.com
The Death of Trial and Error, SMX London 2013 by Sean Malseed of SEMrush.comThe Death of Trial and Error, SMX London 2013 by Sean Malseed of SEMrush.com
The Death of Trial and Error, SMX London 2013 by Sean Malseed of SEMrush.com
 
10 Inconvenient Truths About Creating Branded Content with Influencers
10 Inconvenient Truths About Creating Branded Content with Influencers10 Inconvenient Truths About Creating Branded Content with Influencers
10 Inconvenient Truths About Creating Branded Content with Influencers
 

Similaire à Data Lake-based Approaches to Regulatory-Driven Technology Challenges

December 2015 - TDWI Checklist Report - Seven Best Practices for Adapting DWA
December 2015 - TDWI Checklist Report - Seven Best Practices for Adapting DWADecember 2015 - TDWI Checklist Report - Seven Best Practices for Adapting DWA
December 2015 - TDWI Checklist Report - Seven Best Practices for Adapting DWACarsten Roland
 
Fbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_servicesFbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_servicesCindy Irby
 
Stream Meets Batch for Smarter Analytics- Impetus White Paper
Stream Meets Batch for Smarter Analytics- Impetus White PaperStream Meets Batch for Smarter Analytics- Impetus White Paper
Stream Meets Batch for Smarter Analytics- Impetus White PaperImpetus Technologies
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse conceptsobieefans
 
Next generation Data Governance
Next generation Data GovernanceNext generation Data Governance
Next generation Data GovernanceVladimiro Borsi
 
Evolving Big Data Strategies: Bringing Data Lake and Data Mesh Vision to Life
Evolving Big Data Strategies: Bringing Data Lake and Data Mesh Vision to LifeEvolving Big Data Strategies: Bringing Data Lake and Data Mesh Vision to Life
Evolving Big Data Strategies: Bringing Data Lake and Data Mesh Vision to LifeSG Analytics
 
Emerging database landscape july 2011
Emerging database landscape july 2011Emerging database landscape july 2011
Emerging database landscape july 2011navaidkhan
 
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...DataScienceConferenc1
 
BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)Syaifuddin Ismail
 
Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Materialobieefans
 
Advances And Research Directions In Data-Warehousing Technology
Advances And Research Directions In Data-Warehousing TechnologyAdvances And Research Directions In Data-Warehousing Technology
Advances And Research Directions In Data-Warehousing TechnologyKate Campbell
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Nathan Bijnens
 
Modern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleModern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleVasu S
 
Solution Architecture US healthcare
Solution Architecture US healthcare Solution Architecture US healthcare
Solution Architecture US healthcare sumiteshkr
 
Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.Andrey Akulov
 
Enhancing Data Staging as a Mechanism for Fast Data Access
Enhancing Data Staging as a Mechanism for Fast Data AccessEnhancing Data Staging as a Mechanism for Fast Data Access
Enhancing Data Staging as a Mechanism for Fast Data AccessEditor IJCATR
 

Similaire à Data Lake-based Approaches to Regulatory-Driven Technology Challenges (20)

Benefits of a data lake
Benefits of a data lake Benefits of a data lake
Benefits of a data lake
 
December 2015 - TDWI Checklist Report - Seven Best Practices for Adapting DWA
December 2015 - TDWI Checklist Report - Seven Best Practices for Adapting DWADecember 2015 - TDWI Checklist Report - Seven Best Practices for Adapting DWA
December 2015 - TDWI Checklist Report - Seven Best Practices for Adapting DWA
 
Fbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_servicesFbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_services
 
Stream Meets Batch for Smarter Analytics- Impetus White Paper
Stream Meets Batch for Smarter Analytics- Impetus White PaperStream Meets Batch for Smarter Analytics- Impetus White Paper
Stream Meets Batch for Smarter Analytics- Impetus White Paper
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 
Next generation Data Governance
Next generation Data GovernanceNext generation Data Governance
Next generation Data Governance
 
Evolving Big Data Strategies: Bringing Data Lake and Data Mesh Vision to Life
Evolving Big Data Strategies: Bringing Data Lake and Data Mesh Vision to LifeEvolving Big Data Strategies: Bringing Data Lake and Data Mesh Vision to Life
Evolving Big Data Strategies: Bringing Data Lake and Data Mesh Vision to Life
 
Emerging database landscape july 2011
Emerging database landscape july 2011Emerging database landscape july 2011
Emerging database landscape july 2011
 
Big data and oracle
Big data and oracleBig data and oracle
Big data and oracle
 
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
 
BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)
 
Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Material
 
Advances And Research Directions In Data-Warehousing Technology
Advances And Research Directions In Data-Warehousing TechnologyAdvances And Research Directions In Data-Warehousing Technology
Advances And Research Directions In Data-Warehousing Technology
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
Modern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleModern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | Qubole
 
Solution Architecture US healthcare
Solution Architecture US healthcare Solution Architecture US healthcare
Solution Architecture US healthcare
 
Data warehouse presentation
Data warehouse presentationData warehouse presentation
Data warehouse presentation
 
Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.
 
Enhancing Data Staging as a Mechanism for Fast Data Access
Enhancing Data Staging as a Mechanism for Fast Data AccessEnhancing Data Staging as a Mechanism for Fast Data Access
Enhancing Data Staging as a Mechanism for Fast Data Access
 

Plus de Booz Allen Hamilton

You Can Hack That: How to Use Hackathons to Solve Your Toughest Challenges
You Can Hack That: How to Use Hackathons to Solve Your Toughest ChallengesYou Can Hack That: How to Use Hackathons to Solve Your Toughest Challenges
You Can Hack That: How to Use Hackathons to Solve Your Toughest ChallengesBooz Allen Hamilton
 
Examining Flexibility in the Workplace for Working Moms
Examining Flexibility in the Workplace for Working MomsExamining Flexibility in the Workplace for Working Moms
Examining Flexibility in the Workplace for Working MomsBooz Allen Hamilton
 
Booz Allen's 10 Cyber Priorities for Boards of Directors
Booz Allen's 10 Cyber Priorities for Boards of DirectorsBooz Allen's 10 Cyber Priorities for Boards of Directors
Booz Allen's 10 Cyber Priorities for Boards of DirectorsBooz Allen Hamilton
 
Homeland Threats: Today and Tomorrow
Homeland Threats: Today and TomorrowHomeland Threats: Today and Tomorrow
Homeland Threats: Today and TomorrowBooz Allen Hamilton
 
Preparing for New Healthcare Payment Models
Preparing for New Healthcare Payment ModelsPreparing for New Healthcare Payment Models
Preparing for New Healthcare Payment ModelsBooz Allen Hamilton
 
The Product Owner’s Universe: Agile Coaching
The Product Owner’s Universe: Agile CoachingThe Product Owner’s Universe: Agile Coaching
The Product Owner’s Universe: Agile CoachingBooz Allen Hamilton
 
Immersive Learning: The Future of Training is Here
Immersive Learning: The Future of Training is HereImmersive Learning: The Future of Training is Here
Immersive Learning: The Future of Training is HereBooz Allen Hamilton
 
Nuclear Promise: Reducing Cost While Improving Performance
Nuclear Promise: Reducing Cost While Improving PerformanceNuclear Promise: Reducing Cost While Improving Performance
Nuclear Promise: Reducing Cost While Improving PerformanceBooz Allen Hamilton
 
Frenemies – When Unlikely Partners Join Forces
Frenemies – When Unlikely Partners Join ForcesFrenemies – When Unlikely Partners Join Forces
Frenemies – When Unlikely Partners Join ForcesBooz Allen Hamilton
 
Booz Allen Secure Agile Development
Booz Allen Secure Agile DevelopmentBooz Allen Secure Agile Development
Booz Allen Secure Agile DevelopmentBooz Allen Hamilton
 
Booz Allen Industrial Cybersecurity Threat Briefing
Booz Allen Industrial Cybersecurity Threat BriefingBooz Allen Industrial Cybersecurity Threat Briefing
Booz Allen Industrial Cybersecurity Threat BriefingBooz Allen Hamilton
 
Booz Allen Hamilton and Market Connections: C4ISR Survey Report
Booz Allen Hamilton and Market Connections: C4ISR Survey ReportBooz Allen Hamilton and Market Connections: C4ISR Survey Report
Booz Allen Hamilton and Market Connections: C4ISR Survey ReportBooz Allen Hamilton
 
Modern C4ISR Integrates, Innovates and Secures Military Networks
Modern C4ISR Integrates, Innovates and Secures Military NetworksModern C4ISR Integrates, Innovates and Secures Military Networks
Modern C4ISR Integrates, Innovates and Secures Military NetworksBooz Allen Hamilton
 
Agile and Open C4ISR Systems - Helping the Military Integrate, Innovate and S...
Agile and Open C4ISR Systems - Helping the Military Integrate, Innovate and S...Agile and Open C4ISR Systems - Helping the Military Integrate, Innovate and S...
Agile and Open C4ISR Systems - Helping the Military Integrate, Innovate and S...Booz Allen Hamilton
 
Booz Allen Field Guide to Data Science
Booz Allen Field Guide to Data Science Booz Allen Field Guide to Data Science
Booz Allen Field Guide to Data Science Booz Allen Hamilton
 

Plus de Booz Allen Hamilton (20)

You Can Hack That: How to Use Hackathons to Solve Your Toughest Challenges
You Can Hack That: How to Use Hackathons to Solve Your Toughest ChallengesYou Can Hack That: How to Use Hackathons to Solve Your Toughest Challenges
You Can Hack That: How to Use Hackathons to Solve Your Toughest Challenges
 
Examining Flexibility in the Workplace for Working Moms
Examining Flexibility in the Workplace for Working MomsExamining Flexibility in the Workplace for Working Moms
Examining Flexibility in the Workplace for Working Moms
 
The True Cost of Childcare
The True Cost of ChildcareThe True Cost of Childcare
The True Cost of Childcare
 
Booz Allen's 10 Cyber Priorities for Boards of Directors
Booz Allen's 10 Cyber Priorities for Boards of DirectorsBooz Allen's 10 Cyber Priorities for Boards of Directors
Booz Allen's 10 Cyber Priorities for Boards of Directors
 
Inaugural Addresses
Inaugural AddressesInaugural Addresses
Inaugural Addresses
 
Military Spouse Career Roadmap
Military Spouse Career Roadmap Military Spouse Career Roadmap
Military Spouse Career Roadmap
 
Homeland Threats: Today and Tomorrow
Homeland Threats: Today and TomorrowHomeland Threats: Today and Tomorrow
Homeland Threats: Today and Tomorrow
 
Preparing for New Healthcare Payment Models
Preparing for New Healthcare Payment ModelsPreparing for New Healthcare Payment Models
Preparing for New Healthcare Payment Models
 
The Product Owner’s Universe: Agile Coaching
The Product Owner’s Universe: Agile CoachingThe Product Owner’s Universe: Agile Coaching
The Product Owner’s Universe: Agile Coaching
 
Immersive Learning: The Future of Training is Here
Immersive Learning: The Future of Training is HereImmersive Learning: The Future of Training is Here
Immersive Learning: The Future of Training is Here
 
Nuclear Promise: Reducing Cost While Improving Performance
Nuclear Promise: Reducing Cost While Improving PerformanceNuclear Promise: Reducing Cost While Improving Performance
Nuclear Promise: Reducing Cost While Improving Performance
 
Frenemies – When Unlikely Partners Join Forces
Frenemies – When Unlikely Partners Join ForcesFrenemies – When Unlikely Partners Join Forces
Frenemies – When Unlikely Partners Join Forces
 
Booz Allen Secure Agile Development
Booz Allen Secure Agile DevelopmentBooz Allen Secure Agile Development
Booz Allen Secure Agile Development
 
Booz Allen Industrial Cybersecurity Threat Briefing
Booz Allen Industrial Cybersecurity Threat BriefingBooz Allen Industrial Cybersecurity Threat Briefing
Booz Allen Industrial Cybersecurity Threat Briefing
 
Booz Allen Hamilton and Market Connections: C4ISR Survey Report
Booz Allen Hamilton and Market Connections: C4ISR Survey ReportBooz Allen Hamilton and Market Connections: C4ISR Survey Report
Booz Allen Hamilton and Market Connections: C4ISR Survey Report
 
CITRIX IN AMAZON WEB SERVICES
CITRIX IN AMAZON WEB SERVICESCITRIX IN AMAZON WEB SERVICES
CITRIX IN AMAZON WEB SERVICES
 
Modern C4ISR Integrates, Innovates and Secures Military Networks
Modern C4ISR Integrates, Innovates and Secures Military NetworksModern C4ISR Integrates, Innovates and Secures Military Networks
Modern C4ISR Integrates, Innovates and Secures Military Networks
 
Agile and Open C4ISR Systems - Helping the Military Integrate, Innovate and S...
Agile and Open C4ISR Systems - Helping the Military Integrate, Innovate and S...Agile and Open C4ISR Systems - Helping the Military Integrate, Innovate and S...
Agile and Open C4ISR Systems - Helping the Military Integrate, Innovate and S...
 
Women On The Leading Edge
Women On The Leading Edge Women On The Leading Edge
Women On The Leading Edge
 
Booz Allen Field Guide to Data Science
Booz Allen Field Guide to Data Science Booz Allen Field Guide to Data Science
Booz Allen Field Guide to Data Science
 

Dernier

Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxellehsormae
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 

Dernier (20)

Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptx
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 

Data Lake-based Approaches to Regulatory-Driven Technology Challenges

  • 1. Data Lake-based Approaches to Regulatory- Driven Technology Challenges How a Data Lake Approach Improves Accuracy and Cost Effectiveness in the Extract, Transform, and Load Process for Business and Regulatory Purposes
  • 2. The concept of big data offers financial institutions an opportunity to build capabilities that both reduce costs and produce better insight. In the area of regulatory compliance, the work required to prepare the organization typically involves modifications to systems, process, and data to allow Collection, Alignment, Aggregation, and Analysis (CA3) to occur. For example, new rules, such as Dodd Frank, over-the-counter (OTC) collateral, and risk management requirements, rely on the same legal entity and customer data infrastructure that need to be upgraded for Anti Money Laundering/Bank Secrecy Act, Sanctions, and Foreign Account Tax Compliance Act (FATCA). Linking the data while limiting the modifications to the systems that underpin both the business and compliance requirements improves performance for customer-facing platforms and regulatory compliance systems alike. The potential is real, but the volume, variety, and velocity of the data is growing so fast that it is outpacing the ability of current tools to take full advantage of it. Much of the problem lies in the need to extensively prepare the data before it can be analyzed. In parallel, the technologies and techniques underpinning Big Data have matured to the point where they can address the challenge. While early uses focused on deriving insights from very large pools of unstructured data, recent deployments have harnessed multiple tools, including advanced data management, pattern recognition, and adaptive analytics, to address large-scale, high-accuracy, low-latency CA3 of diverse, dispersed data. Applying Robust Financial Intelligence and Analytics to Stay Ahead
  • 3. The Extract, Transform, and Load Challenge For the past 30 years, traditional approaches to sharing and transferring data have all involved some type of Extract, Transform, and Load (ETL) capability that extracts information from one format (database, silo, file, etc.) and transforms it into another data format. The process then loads the data into the target system for use in a set of predetermined analyses. While these approaches to handling data have served some organizations well in the past, they have some notable drawbacks, which become more significant as the volume, variety, and velocity of the data expands. First and foremost, the process is resource intensive and requires investments in high-cost tools to access the data. For example, each time a new regulation is issued that calls for a new type of analytically derived report, banks must initiate a dedicated IT project, often focused on solving the data ingest issue. This portfolio of projects results in a very large number of data warehouses, each with their own ETL process. To use the diverse data warehouses calls for the creation of customized Point-to-Point (PtP) solutions. These PtP solutions can certainly meet the short-term goal, but often fail to scale up to meet longer-term organizational goals. As banks move into the era of big data, this PtP approach becomes overly complex and difficult to manage. The Data Lake-based Approach In stark contrast to the challenges presented by a point-to-point ETL approach, Booz Allen Hamilton, a leading strategy and technology consulting firm, has found that a data lake-based approach to CA3 requirements is scalable, extensible, and improves the range and sophistication of analyses that can be supported while providing higher levels of data control and security. A data lake-based approach takes advantage of the most recent developments in large-scale distributed computing hardware/software to create an innovative way to ingest, index, and analyze massive amounts of data in batch and real time that can scale to exabytes—without compromising integrity, cost-effectiveness, or performance. The Data Lake Approach embeds business rules, often the result of policy and procedure documentation for regulatory compliance, in the cell level data, allowing alignment, aggregation, and analysis to occur rapidly and with far less upfront work by IT departments. With the data lake, an organization’s repository of information—including structured and unstructured data—is consolidated in a single, large “table.” Every inquiry can use the entire body of information stored in the data lake—and it is all available at the same time. This approach, also referred to as “schema on read,” has five core features that can help banks address increasingly demanding, constantly evolving regulatory requirements. In a data lake-based approach: 1. ETL is not done en-masse prior to the analysis. Data is ingested rapidly in “raw” form, and the indices and relationships to support the analysis are derived, enriched, and overlaid as needed—or even executed at the time of the analysis, reducing the time to operationalize data. 2. Unified queries can be created quickly to allow access across all information sources, reducing the time and complexity involved in creating and federating queries across multiple databases. 3. Multiple data sources can be more quickly fused to enable a very high degree of data agility to compose new reports that meet emerging requirements (e.g., new regulations). 4. Operations and management (O&M) complexity is significantly reduced, with a corresponding drop in O&M costs, while creating the basis for improved security and data management posture. ETL Transactions FEDERATED QUERY ETL Transactions ETL Transactions Tailored Reporting Tools Transactions Transactions Transactions Lightweight Security Tagging Runtime Creation of Views Figure 2. Advanced Data Lake-based Approaches Figure 1. Traditional Point-to-Point Solution
  • 4. 5. The low-cost, streamlined ingestion process can be performed in near real-time, making the Data Lake Approach a viable alternative for some requirements that would typically be addressed by implementing Straight Through Processing platforms—at far less cost and disruption to the revenue-generating operations of the bank. Putting the Data Lake to Work With the Data Lake Approach, it now becomes practical—in terms of time, cost, and analytic ability—to turn big data into a powerful tool to deal with escalating regulatory challenges while meeting business demands. We can now ask more far-reaching and complex questions, and find the often hidden patterns and relationships that can lead to game-changing knowledge and insight. The Data Lake Concept is particularly well suited for challenges that have one or more of the following characteristics: 1. Streaming analytics are performed on large-scale data sets 2. PtP data mart solutions are involved 3. The ETL requirement is data, not process heavy While applying a big data approach to financial regulatory requirements may be innovative, it would not experimental—Booz Allen has created data lake-based systems for more than a dozen government clients. Each time we addressed a new class of problem, (e.g., Homeland Security, Defense) we used a prototype approach to build/test/tailor the Data Lake Approach. We are prepared to work with your leadership team in a similar manner to introduce this capability. To launch a prototype project, we work with clients to: • Identify a small set of business and regulatory critical applications as the basis for the prototype—basically, a subset of projects in process that can be executed quickly to yield results • Set up design requirements for information reporting requirements for internal/external users • Mirror a set of real-world scenarios to create an analytics platform (i.e., a data lake) that we will use to demonstrate the schema on read process against the critical applications identified above • Develop a results summary on multiple levels (speed, cost, accuracy) and test the data for internal validity and defensibility Booz Allen knows that a clean-sheet approach is not feasible; any viable solution approach must be able to deal with a diverse base of legacy systems and select from the existing portfolio of regulatory IT project requirements. While such conditions can be challenging, by creating an isolated, parallel analytics platform, we are be able to work with live data with no risk to the bank’s production systems.
  • 5. “With the Data Lake Approach, it now becomes practical—in terms of time, cost, and analytic ability—to turn big data into a powerful tool to deal with escalating regulatory challenges while meeting business demands. ”
  • 6. www.boozallen.com About Booz Allen Booz Allen Hamilton has been at the forefront of strategy and technology consulting for nearly a century. Today, Booz Allen is a leading provider of management and technology consulting services to the US government in defense, intelligence, and civil markets, and to major corporations, institutions, and not-for-profit organizations. In the commercial sector, the firm focuses on leveraging its existing expertise for clients in the financial services, healthcare, and energy markets, and to international clients in the Middle East. Booz Allen offers clients deep functional knowledge spanning strategy and organization, engineering and operations, technology, and analytics—which it combines with specialized expertise in clients’ mission and domain areas to help solve their toughest problems. The firm’s management consulting heritage is the basis for its unique collaborative culture and operating model, enabling Booz Allen to anticipate needs and opportunities, rapidly deploy talent and resources, and deliver enduring results. By combining a consultant’s problem-solving orientation with deep technical knowledge and strong execution, Booz Allen helps clients achieve success in their most critical missions—as evidenced by the firm’s many client relationships that span decades. Booz Allen helps shape thinking and prepare for future developments in areas of national importance, including cybersecurity, homeland security, healthcare, and information technology. Booz Allen is headquartered in McLean, Virginia, employs approximately 25,000 people, and had revenue of $5.86 billion for the 12 months ended March 31, 2012. For over a decade, Booz Allen’s high standing as a business and an employer has been recognized by dozens of organizations and publications, including Fortune, Working Mother, G.I. Jobs, and DiversityInc. More information is available at www.boozallen.com. (NYSE: BAH) For more information, contact Thomas Sanzone Senior Vice President sanzone_thomas@bah.com 917-305-8003 James Newfrock Vice President newfrock_jim@bah.com 917-305-8037 Joshua Sullivan Vice President sullivan_joshua@bah.com 301-543-4611 Albert Belman Principal belman_albert@bah.com 917-305-8002 Michael Delurey Principal delurey_mike@bah.com 703-902-6858 03.078.13