SlideShare une entreprise Scribd logo
1  sur  45
http://www.virtus-it.com
A trusted partner
Software to model &
build Business
Intelligence &
Big Data Solutions
http://www.virtus-it.com
Table of Contents
• Big Data Overview………..………………………………………………………… 3 - 6
• Step by Step approach to building Big Data Solutions……………….. 7 - 10
• StreamCentral Introduction……………………………………..………….... 11 - 21
• Keeping it structured? –Extending current DW/BI investments………. 22 - 25
• An approach to building Operational Intelligence solutions………….... 26 - 28
• StreamCentral additional details……………………………………..………….... 29 - 37
• StreamCentral physical architecture……………………………..………….... 38 - 40
• How StreamCentral fits in an enterprise technology architecture…… 41 - 44
2
http://www.virtus-it.com
10101010101010101010101010101010101010101010101ABC010
1010101010101010101010100101010101010101ABCABCABCABC
1
0
1
0
1
0
1
0
1
0
1
0
3
BIGData
Today With Big Data
Custom Application
ERP
• Analysis of structured data from
internal applications
• Data sets updated using batch
processes
• Traditional BI & Data Warehousing
• Traditional BI and data
warehousing extended to
include structured and
unstructured data from internal
and external applications
processed in real-time or in
batch.
• Ability to predict events as well
as analyze historical
associations in wide sets of data
for patterns and trends.
Data Management Discovery & Analysis Event Detection
Big Data Solution
Modeling & Model
Deployment
Stream Processing &
Batch Data
Acquisition
Blocks for building Big Data Solutions
ERP
Internal & External Applications Data Stores
1
0
1
0
1
0
Real-Time + Batch Big Data Processing Layer
Real-time event data in
operational applications
Pattern, trend and
association analysis
Understand the connections in a wide variety of data that impact
business performance and use that knowledge to deliver exceptional
business results
http://www.virtus-it.com
Understanding business performance with Big
Data includes two distinct capabilities:
Managing performance by analyzing
internal and external, structured and
unstructured data for patterns and
associations collected over time
• Customer segmentation based on buying
history patterns and finding associations
with population, census and twitter data
to develop marketing strategy
• Web analytics to improve marketing
campaigns and relevant content
• Sales pipeline analysis compared to
industry data to understand the right
goals to set
• Cash flow analysis to make capital
investment decisions considering
external variables
Managing performance by analyzing real-
time data for day to day events –
Operational Intelligence
• What is the current workload?
• Is staff available and working on
high priority work?
• What factors are impacting
customer experience right now?
• What processes are taking longer
than expected?
4
A few example scenarios: A few example scenarios:
http://www.virtus-it.com
Telco’s Core IMS
Network Data
Data, Voice & Video
Performance Data
Data, Voice & Video
Performance Data
Data from
Telco Towers
Weather Data
Traffic
IncidentsPopulation
Data
Data Stream
weatherundergrou
nd
MapquestUSA Today Census
data
Sources of real
time streaming
data from
networks,
devices, services
and other
internal
applications
External sources
of data that add
understanding of
what’s happening
when events are
detected
Example Big Data Solutions: Telco
Network
Test
New
Service –
Investment
Planning
Adaptive
Bit Rate –
Video
Streaming
QoE
360o
Customer
QoE for
1st Level
customer
service
Video QoE
for IPTV
Business
Solutions
5
http://www.virtus-it.com
Telco’s Core IMS
Network Data
Data, Voice & Video
Performance Data
Data, Voice & Video
Performance Data
Data from
Telco Towers
Weather Data Traffic IncidentsPopulation Data
Data Stream
weatherunderground INRIXUSA Today Census data
Example Big Data Scenario : Utilities - Water
Sources of real
time data
relating to
your business
Sources of
BIG DATA
relevant to
your business
VIBRATION
SENSOR
ENERGY
HARVESTING
WATER MAIN
PRESSURE
SCADA
NETWORK
6
http://www.virtus-it.com 7
Steps it takes to build
powerful Big Data
solutions!
Solution
Modeling
Model
Deployment
Data
Acquisition
(streaming or batch, internal or
external, structured or unstructured)
Data
Management
Event
Detection
Discovery &
Analysis
Big Data Solution Lifecycle
Start here
http://www.virtus-it.com 8
Solution Modeling
•Logical Data Model
design
•Data standardization
& transformation
modeling
•Key Performance
Indicator modeling
via business rules
•Dimensional
modeling
•Historical Data Mart
Modeling
•Event detection
modeling via
business rules
•Real-time analytics
data mart modeling
Model Deployment
•Physical Design
Implementation
•Physical deployment
of dimensional
model
•Database
deployment
•Physical deployment
of data marts
•Rules deployment
Data Acquisition
•Data from internal
data sources
•Data from external
sources
•Streaming data
•Batch data
•Structured Data
•Unstructured Data
•Data transformation
•Data standardization
Data Management
•Structured Data
Storage
•Unstructured Data
Storage
•Scalability
•Performance
Event Detection
•Detecting events on
streaming data
•Alerting
•Integration with
operational
applications
Discovery & Analysis
•Information
Discovery
•Data Classification
•Analytics
•Querying
•Visualization
Solution
Modeling
Model
Deployment
Data
Acquisition
Data
Management
Event
Detection
Discovery&
Analysis
Big Data Solution Lifecycle – Tasks Detailed
http://www.virtus-it.com
9
Solution
Modeling
Model
Deployment
Data
Acquisition
Data
Management
Event
Detection
Discovery &
Analysis
1. Hadoop - MapReduce
2. MPP Columnar Databases like Neteeza, Vertica, ParStream
3. NoSQL – MongoDB, Cassandra
4. Evolution of traditional RDBMS to support column indexes
SQL Server
Big Data Innovations in Data Management
Big Data Innovations in
Discovery & Analysis
Where has the
innovation been
in Big Data?
The last few years have seen
lots of innovation in Data
Management as well as
Discovery and Analysis
http://www.virtus-it.com
Solution
Modeling
Model
Deployment
Data
Acquisition
Data
Management
Event
Detection
Discovery &
Analysis
Big Data Lifecycle
But, where is the innovation in
these areas?
• Fragmented, point use or lack of
industry strength technology to aid in
Design, Model Deployment, Data
Acquisition and Event Detection
makes it difficult, time consuming
and specialist resource intensive to
build Big Data Solutions
• What is the use of having scalable
platforms that can store and manage
this data and tools that can deliver
incredible visualizations when the
effort to get the data right is still a
problem as it has always been?
http://www.virtus-it.com 11
Introducing
StreamCentral
http://www.virtus-it.com
Solution
Modeling
Model
Deployment
Data
Acquisition
Data
Management
Event
Detection
Discovery &
Analysis
Big Data Solution Life cycle
1. StreamCentral Solutions Designer makes it easy to
model traditional BI/DW and Big Data solutions
2. Builds and deploys model on HP Vertica or Microsoft
SQL Server
3. Adds context by connecting all streaming and static
data to time, location and entities
4. StreamCentral Big Data Server, horizontally scalable,
executes the model definition in real-time
5. StreamCentral drastically reduces time to market, risk
and cost in building Big Data solutions!
Software to design &
build BI & Big Data
SolutionsStreamCentral enables you to quickly move from a blank
sheet of paper to a production system, comprehensive and
powerful that can be delivered without a large investment
in specialist skills.
http://www.virtus-it.com 13
1010101010101010
ABCABCABCABCABC
StreamCentral Workbench:
Solution Designer
StreamCentral
Workbench:
Model DeploymentData
Collection
Data
Processing
Correlation
Data
Publishing
Data
Security
StreamCentral Big Data Server
StreamCentral has three main components:
1. Use the Workbench Designer to define source data, entities,
rules for monitoring conditions, events and data correlation,
analytical models and knowledgebase
2. Workbench Model Deployment configures, builds and deploys
the model on top of HP Vertica or Microsoft SQL Server
3. Big Data Server executes the defined model in real-time
1
2
3
http://www.virtus-it.com
Database
REST/SOAP
API
LDAP
PUSH
API
Data Processing
Engine
Vertica SQL Server
Correlation EngineCollector
Data Publishing, Access
and Security
• Capture data
• Validate data
• Prepare data
• Apply transformations
• Perform calculations
• Determine conditions & KPIs
• Identify & build dimensions
• Identify alerts
• Correlate incoming data based
on defined rules
• Detect events based on
correlated data
• Update fact data
• Update entity & dimension data
• Update analysis collections
• Update event collections
• Manage data level security
Data Acquisition –
Push / Pull data from
variety of sources
Design data
transformations
Conditions & KPI
modeler via rules
builder
Real-time data
correlation
Event detection
via rules builder
Real-time data
mart designer
360o data mart
designer
Define entities and
Import Entity Data
Dimension
modeler
Data Security
designer
StreamCentral Big Data Server
StreamCentral Workbench: Big Data Solution Designer
Meta Data
Create Database
Structure
Add Context
StreamCentral Workbench: Big Data Solution Deployment
http://www.virtus-it.com 15
Model Pull
Data
Sources with
strong REST,
SOAP & DB
Support Push Data
API
Data
Transformat
-ion
Model
Entities &
import static
data
Dimension
modeler
Time &
Location
Standard-
ization
Conditions &
KPI modeler
Correlation
Modeler
Event
Detection
rules on
real-time
data
Real-time &
Historical
analytics
Data mart
modeler
• Software targeted to be used by IT and non IT people to
design and build Big Data solutions
• Can work with batch data (as in traditional Business
Intelligence) or real-time streams (as in Operational
Intelligence)
• Workbench lets analysts model all necessary steps in
building a Big Data Solution
• Data Pull/Push
• Model Transformations
• Model Entities (like customers, patients, products),
import static entity data and define entity relationships to
source data
• Shared dimensions across data
• Condition modeler via business rules to monitor specific
sets of conditions in batch or streaming data
• Evaluate different entities with different sets of
conditions as data flows in
• Specify rules to model how to correlate data streams in
real-time
• Event detection
• Model data marts that aggregate the right data for
association and pattern analysis
StreamCentral Workbench : Software to design
traditional BI/DW & Big Data Solutions
Workbench
http://www.virtus-it.com 16
Generating insights from data requires context to be
added to the data. This context is a continuous
thread that connects all types of data throughout the
Big Data Solution lifecycle. Four typical examples of
context..
Insight
Who (entities
like customer,
patient)
When (time)
Where
(location)
What
(streaming &
static data
correlation)
• StreamCentral automatically builds
and maintains time and location
dimensions
• Entities can be created and defined in
StreamCentral
• All data in StreamCentral is
continuously and automatically
connected to time, location and
defined entities
• Resultant real-time events and
analytical data marts automatically
inherit this context without need for
any programming or development
work
• This increases the impact and value
of collected data
Converting data to insights by continuously adding context
http://www.virtus-it.com 17
Auto build
and deploy
DB structure
based on
Workbench
Model
Continuous
Pull with
strong REST,
SOAP & DB
Support
Push Data
API for
streaming
sources
Time &
Location
Standard-
ization
Monitor
conditions
Event
detection
Build data
marts &
continuously
update new
data
In-Memory
Operations
Distributed
Architecture
MPP Support
StreamCentral Big Data Server: Software
that runs Big Data Solutions
• Extends your Business Intelligence strategy by
easily incorporating external data sets
• Introduces integration of real-time data for
event insight to your organization
• Auto-builds database schema (facts,
dimensions, entities, flat tables and more)
• By default, standardizes all incoming data by
connecting it to auto created time and location
dimensions
• Builds event data marts and continuously loads
data
• Builds real-time data marts to help in
understanding associations in data
Continuously loads these analysis data marts
• Deliver real-time event insights to new or
existing operational applications
• Significantly reduces IT overhead in building Big
Data solutions
Big Data Server
http://www.virtus-it.com
18
Solution
Modeling
Model
Deployment
Data Acquisition
Data
Management
Event Detection
Discovery &
Analysis
Bringing it together:
Building Big Data Solutions
with StreamCentral and
partner solutions
1. MPP Columnar Databases : Vertica,
ParStream
2. Microsoft SQL Server
StreamCentral Big Data
Server
StreamCentral
Workbench: Model
Deployment
StreamCentral Workbench:
Big Data Solutions Modeler
Tableau Software,
Microsoft PowerView
StreamCentral Big Data
Server
http://www.virtus-it.com
• Industrial strength, enterprise ready with web scale
characteristics - handles extremely large amounts of
data
• Uses in-memory processing for high speed
• Next generation distributed architecture – allows you to
run on any number of commodity hardware
• Built in redundancy at every layer for high availability
• Easy to use tools to monitor and manage StreamCentral
• Built on Microsoft technology that most enterprises
already have invested in
• Runs on best of breed and latest database technology
from Microsoft SQL Server and HP Vertica
Choose
database from:
19
http://www.virtus-it.com
Why StreamCentral?
• Roadmap to Big Data: StreamCentral is the only solution that enables the evolution
of current practices in Business Intelligence and Data warehousing to now include
external data, event monitoring and real-time insights
• No programming solution modeler: StreamCentral takes a solution approach –
designing and modeling shifts to analysts versus everything being done by
developers or programmed from scratch
• Continuously adds context to data: Any kind of data that is streamed to
StreamCentral, pulled in near real-time or imported via batch is continuously and
automatically connected to time, location and defined entities. This significantly
reduces risk, time and cost associated with building BI/DW and Big Data solutions
• Reduced dependency on specialist skills: No in-depth knowledge needed on HP
Vertica or SQL Server development as StreamCentral builds, deploys and maintains
all internal structures in those environments automatically
• Plays well: Is standards based and agnostic to existing enterprise technologies
• Adaptable: Everything created in StreamCentral can be modified. Makes it easy to
adapt the Big Data solution to changing needs of the business
20
http://www.virtus-it.com 21
Making a business case for leveraging Big Data
just got a whole lot easier with StreamCentral
70%
Time taken to build Big Data
solutions is drastically reduced
by using StreamCentral
60%
Cost of building Big Data
solutions is drastically reduced
by using StreamCentral
In addition, StreamCentral reduces risk, data
quality issues, specialist skillsets requirements and
complexity in building traditional Business
Intelligence/Data Warehousing or Big Data
solutions
http://www.virtus-it.com 22
No immediate plans to go Big on Data?
Planning to work primarily with
structured data?
But would like to deliver additional
insights by enhancing your existing
investments in Business Intelligence and
Data Warehousing?
http://www.virtus-it.com
Traditional Data Warehousing
Interrogation of historical data for trend
analysis. Business Intelligence applications
deliver analytics or reports to
management for performance analysis
On-Demand Business Intelligence
Update Data Warehouse continuously with
real-time data. Provides the ability to
analyze data updated in real-time
Operational Intelligence
Allows organizations to monitor fast
moving data for key indicators and events
and immediately act on these insights,
through manual or automated actions
Reporting:-
What did happen ?
Analysis:-
Why did it happen ?
Happens on previously
stored data
(data at rest)
Happens on real-time
streaming data
(data in-flight)
Solution value to businessLower Higher
PerceivedComplexityHigherLower
Event Monitoring:-
What is happening ?
Predictive Analytics:-
What will happen ?
Traditional Data Warehousing
Solutions
On-Demand
BI
Operational
Intelligence
23
Keeping it structured – A roadmap to
extend current investments in BI/DW
http://www.virtus-it.com
Reporting:-
What did happen ?
Analysis:-
Why did it happen ?
Happens on previously
stored data
(data at rest)
Happens on real-time
streaming data
(data in-flight)
Solution value to businessLower Higher
PerceivedComplexityHigherLower
Event Monitoring:-
What is happening ?
Predictive Analytics:-
What will happen ?
Traditional Data Warehousing
Solutions
On-Demand
BI
Operational
Intelligence
Most organizations have
traditionally invested in
this area
In most companies, the scope of
understanding business performance is
limited to historical analysis and rarely
includes real-time understanding of key
events that impact day to day
operational processes
Keeping it structured – A roadmap to
extend current investments in BI/DW
http://www.virtus-it.com
Reporting:-
What did happen ?
Analysis:-
Why did it happen ?
Happens on previously
stored data
(data at rest)
Happens on real-time
streaming data
(data in-flight)
Solution value to businessLower Higher
PerceivedComplexityHigherLower
Event Monitoring:-
What is happening ?
Predictive Analytics:-
What will happen ?
Traditional Data Warehousing
Solutions
On-Demand
BI
Operational
Intelligence
Most organizations have
traditionally invested in
this area
StreamCentral’s
area of focus
25
Keeping it structured – A roadmap to
extend current investments in BI/DW
http://www.virtus-it.com 26
An approach to working
with real-time data -
Operational Intelligence
http://www.virtus-it.com
Data Layer
Interfaces
Data Processing
Real-Time Insights
Business Solutions
Operational (User)
Internal
Applications and
Data Sets
External Data
Connections to existing
architecture for tapping
data & data streams
APIs
Databases
Enterprise Service Bus
Messages
Push Streaming Data |Pull Data |Format | Standardize | Transform |
Measure | Correlate | Event Detection | Rules Engine | In-Memory
Processing Real-Time Streaming Analytics
Real-Time Event Notification
Historical data that
supports pattern &trend
analytics. New insights are
added in real time
Customer
Experience
Continuous
Improvement
Day to day insights and actions
delivered in multiple mediums to many
users
KPIs
Complaints
Brand –
Protection
1
2
3
4
5
6
!
Access to right information at the right
time along with knowledge base of
actions to perform
Operational Intelligence practices are similar to traditional Data Warehousing practices
27
http://www.virtus-it.com
Data Layer
Interfaces
Data Processing
Real-Time Insights
Business Solutions
Operational (User)
Internal Data Sets External Data
Connections to existing
architecture for tapping
data & data streams
APIs
Databases
Enterprise Service Bus
Messages
Push Streaming Data |Pull Data |Format | Standardize | Transform |
Measure | Correlate | Event Detection | Rules Engine | In-Memory
Processing Real-Time Streaming Analytics
Real-Time Event Notification
Historical data supporting
pattern &trend analytics.
New insights added in real
time
Customer
Experience
Continuous
Improvement
Day to day insights and actions
delivered in multiple mediums to many
users
KPIs
Complaints
Brand –
Protection
1
2
3
4
5
6
!
Access to right information at the right
time along with knowledge base of
actions to perform
Focus of StreamCentral
28
http://www.virtus-it.com 29
More details on how
StreamCentral works
http://www.virtus-it.com
StreamCentral Workbench Big Data
Solutions Modeler - Inputs
• Data Sources
• Push/Pull
• Data transformations
• Define and import entity data
• Modeling
• Rules for monitoring conditions in data
• Correlation rules to identify related records across data sources in real-time
• Rules for detecting events
• Common dimension modeling
• Data Mart modelers
• Support for Real-time
• Correlation rules to identify related records across data sources in real-time
• Rules for detecting events
• Configure real-time data marts
• 360o data aggregation**
• Define data relationships across data sources
• Configure 360o data marts
• Data level security**
30** Coming Q3 2013
http://www.virtus-it.com
StreamCentral Big Data Server - Output
• Database structure automatically created, updated and managed in Big Data databases like HP
Vertica or SQL Server by StreamCentral.
• The StreamCentral database automatically builds time and location dimensions, fact tables, other
dimension tables, standardizes facts across data sources to the one time and location dimension
as well as connects facts to KPIs. StreamCentral also auto-loads this database from various data
sources into Big Data databases like HP Vertica or SQL Server
• Real-time event notification that can be consumed by operational applications via an API**
• Real-time event alerts
• Data marts that are automatically created, updated and managed by StreamCentral. The data
marts denormalize data into a single table facilitating faster querying and analysis of data
• Real-time analytical data marts built that aggregates events and data across data sources to better understand
conditions that influence events
• Real-time event data marts that bring together all relevant information for a single event
• 360o data marts for association and pattern analysis**
31** Coming Q3 2013
http://www.virtus-it.com 32
Sensors
Weather
Enterprise
Applications
Data Visualization
(Reporting, Analytics,
Dashboards)
Correlates
Data
Generates Key
Performance
Indicators
Uncovers
Events
Consumes real-
time or static data
OR Pulls data from
various data
sources and
applies
transformation and
standardization
rules
Model Deployment
Auto-builds database schema
Auto-loads database
Builds and continuous loads data to
event data marts
Builds and continuous loads
Analysis Collections
Publishes event data that can be
subscribed by Operational
Applications
Devices
Auto-build Database
Schema
360o Data marts and real-time data
marts
Event Data Marts for every event along
with its context as denormalized flat
tables
StreamCentral
Push
Push
Massively Parallel Processing Systems - Vertica
RDBMS – MS SQL Server
Publish event data
to operational
applications – Web,
mobile or desktop
StreamCentral Workbench – Big Data
Solutions Modeler
Collate Raw Data (Push/Pull) – Real-Time or Static
Model data standardization and transformation
rules
Define business entities and connect raw data to
business entities
Model Dimensions
Model conditions to monitor across data sources
Assign different conditions to different entities
Model Correlation Rules
Model events and specify context to add to events
Model analytical data marts auto built by
StreamCentral
StreamCentral Big Data Server
Enterprise
Applications
API
Traffic
API
API
API
http://www.virtus-it.com 33
builds two distinct types of analytical data marts
360o Data Marts** Real-Time Data Marts
• Defined: Easily bring together and aggregate data
across data sources to get 360o insight. Analyze
associations in data to determine patterns that
impact business performance
• Define data mart structure by choosing the right
set of attributes from data sources, KPIs, attributes
from entities, and dimensions in the Workbench
• . StreamCentral auto-builds the data mart
• Standardize data across time and location
• Update data mart at pre-defined intervals
StreamCentral Data Marts are denormalized flat tables – Why?
• Defined: Aggregate real-time events and bring
together data across data sources to analyze
conditions that existed when events are detected
• Standardize data across time and location
• Define data mart structure by choose the right set
of attributes from data sources, KPIs, events,
attributes from entities, and dimensions .
StreamCentral auto-builds the data mart
• Once data gets correlated in real-time data mart
gets updated with appropriate insights
• Technology advancements in columnar data stores, bit map indexes, column indexes make it
possible to scan and query large amounts of data in a single table
• Takes advantage of distributed architectures to scale out using commodity software
• Supports :
• SQL Server columnar indexes
• Vertica MPP
** Coming Q3 2013
http://www.virtus-it.com 34
StreamCentral Real-Time Operational Intelligence
• Data Sources
• Import initial data load
• Push data to StreamCentral API
• Pull data from data sources at defined intervals
• Apply transformations on the data in flight
• SQL Server, Oracle, My SQL, REST API, SOAP Web Service, LDAP
• Auto connects data to time and location dimension
• Model entities. Connect data sources to entities
• Model measures and KPIs
• Model standard dimensions
• Model real-time correlation rule (to identify related records across data sources in real time)
• Model Events
• Events based on real-time correlation rule
• Event Data Mart (automatically gets created when event is detected)
• Requires real-time correlation
• Brings together all data across data sources that were captured at the time the event was detected
• Model Real-Time Data Marts
• Requires a real-time correlation rule
• Update real-time data mart with streaming correlated data
• Define attributes that make up the real-time data mart definition. Select subsets of information from : specific attributes from data sources, KPIs, events, entity
attributes, dimensions, time and location
• Edit real-time data mart definition
http://www.virtus-it.com 35
StreamCentral 360o Data Aggregation**
• Data Sources
• Import initial data load
• Pull data from data sources at defined intervals
• Apply transformations on the data
• SQL Server, Oracle, My SQL, REST API, SOAP Web Service, LDAP
• Auto connects data to time and dimension location
• Model entities. Connect data sources to entities
• Model measures and KPIs
• Model standard dimensions
• Model 360o Data Marts
• Model 360 view query (define relationships across data sources to aggregate data)
• Schedule batch update interval (typically hours)
• Define attributes that make up the analysis collection. Select subsets of information from : specific
attributes from data sources, KPIs, entity attributes, dimensions, time and location
• Edit and update data mart definition
• Define data level security
** Coming Q3 2013
http://www.virtus-it.com 36
Data formats supported :
• XML
• JSON
• String
Data Sources supported :
• Database
• Microsoft SQL Server
• Oracle
• My SQL
• REST API
• SOAP API
• LDAP
• Specify transformation rules to data
that is applied to data in flight
• Specify parameters when calling APIs
• Auto fills location parameters based
on location data stored in the
database about entities
• Auto creates tables in the backend
database for data source data
Pull Data from Applications Push data to StreamCentral
• StreamCentral REST API available to
stream data to StreamCentral –
stream data from agents, sensors,
probes, devices
• Specify transformation rules to data
that are applied to the data in flight
• Auto creates the tables in the
backend database for source data
StreamCentral Databases
• Supports Microsoft SQL Server and
HP Vertica
• Auto creates data structures in the
database for source data
• Auto creates fact tables, dimensions,
flat tables for event analysis and flat
tables for pattern and association
analysis
• Data level security
StreamCentral Analytics
• Device friendly visualization
• Powerful portfolio of
visualization tools
• Ability to embed in custom
applications
• In-memory operations for fast
querying
StreamCentral Reports
• Role based security
• Subscribe to reports
• Ability to embed in custom
applications
• Export reports to various
formats
http://www.virtus-it.com 37
Transformation Description
LTRIM Removes all white spaces from the left
RTRIM Removes all white spaces from the right
Ignore Space Removes all white spaces from left, middle or right
Ignore Special Characters Returns string after ignoring all special characters
Contains Search for specific characters
Substring Extract a substring from a string
Left Removes the left part of a character string
Right Removes the right part of a character string
Replace Replaces specified string with another string
Startswith Search for a starting character
Endswith Search for an ending character
DoesNotContain Search for specific characters
Remove Remove specified characters or words from string
Range Search for a range
RoundOff Rounds off decimal value to a specific length
StreamCentral
Transformations
• Easy to use transformations
• Multiple transformations can
be executed on one attribute
http://www.virtus-it.com 38
StreamCentral
Physical Architecture
http://www.virtus-it.com 39
StreamCentral Collector
Windows Server 2012, .Net Framework 4.5, MSMQ
StreamCentral Stream Processing Engine
Windows Server 2012, .Net Framework 4.5,, MSMQ
StreamCentral Stream Correlation Engine
Windows Server 2012, .Net Framework 4.5,, MSMQ
StreamCentral Data Engine
Windows Server 2012, .Net Framework 4.5,, MSMQ
All components can run on
one machine
Every component can run
on more than one machine
StreamCentral Cache Cluster
Windows Server 2012, .Net Framework 4.5, AppFabric
StreamCentral Metadata database
Windows Server 2012, Microsoft SQL Server2008 R2
or Microsoft SQL Server 2012
StreamCentral Database and data marts
Option 1:
Windows Server 2012, Microsoft SQL Server2008 R2
or Microsoft SQL Server 2012 or SQL Server Parallel
Data Warehouse
Option 2
Linux, HP Vertica
StreamCentral Analytics
Windows Server 2012, Tableau Software
StreamCentral Physical Architecture and
Software Requirements
http://www.virtus-it.com 40
1 server for StreamCentral Components:
Collector, Stream Processing Engine, Correlation Engine, Data Engine
Characteristics of this server : Processor dependent therefore the higher
the number of cores the better, medium cache and low disk storage
Software: Windows Server 2012, .Net Framework 4.5, MSMQ
1 server for cache
Hardware characteristics: : Cache
dependent therefore more memory the
better. Medium CPU and low disk storage
Software : Windows Server 2012, .Net
Framework 4.5, AppFabric
1 server for StreamCentral Meta Data
Database, data mart storage and reporting
Hardware Characteristics:: High CPU, High
Memory and High Storage
Software : Windows Server 2012, SQL Server
1 server for StreamCentral Meta Data
Database and reporting
Hardware Characteristics:: Medium CPU,
Medium Memory and High Storage
Software : Windows Server 2012, SQL Server
OR
1 server for StreamCentral data marts
Hardware Characteristics:: High CPU, High
Memory and High Storage
Software : Linux, HP Vertica
+
StreamCentral suggested minimum system configuration
http://www.virtus-it.com
How does StreamCentral
fit within your enterprise
technology architecture?
41
http://www.virtus-it.com
Data Sources Method of Access
StreamCentral -
Read data from
Application
Application - Read data
by subscribing to
StreamCentral Real-Time
Event API
Application - Read data
by querying
StreamCentral database
Enterprise Applications X real-time X
Using Web Service or REST API X real-time
Using database query X
Enterprise Service Bus X real-time X
via Web Service or REST API
X real-time
via subscribing to messages
X real-time
Enterprise Data
Warehouse
via database query
X X
Point databases
via database query
X X
LDAP via database query X
External Data Sources via Web Service or REST API X real-time 42
http://www.virtus-it.com
Sensors
Weather
Devices
Traffic
Custom
ApplicationsMainframe
Business Services
Enterprise Service Bus - Messaging / Mediation / Orchestration / Security
Business
Process
Business
Process
Business
Process
Composite
Application
Composite
Application
Composite
Application
Auto-build Database
Schema
Analysis Collections – Data marts as
denormalized flat tables
Event Collections – Data Marts for
every event along with its context as
denormalized flat tables
StreamCentral Engine
StreamCentral Workbench
Collate Raw Data (Push/Pull)
Standardize Data
Define Business Rules
Define Correlation
Define events
Define analytical data marts
auto built by StreamCentral
Historical
Analysis
Real-time event data published
to operational applications
and dashboards
Massively Parallel Processing Systems - Vertica
Columnar databases with Bit Map indexes – ParStream
RDBMS – MS SQL Server
StreamCentral as part of an Enterprise Service Bus architecture
API
ERP
Push / Pull
43
http://www.virtus-it.com
Sensors
Weather
Devices
Traffic
ERP
Custom
ApplicationsMainframe
Business Services
Enterprise Service Bus - Messaging / Mediation / Orchestration / Security
Business
Process
Business
Process
Business
Process
Composite
Application
Composite
Application
Composite
Application
Auto-build Database
Schema
Analysis Collections – Data marts as
denormalized flat tables
Event Collections – Data Marts for
every event along with its context as
denormalized flat tables
StreamCentral Engine
StreamCentral Workbench
Collate Raw Data (Push/Pull)
Standardize Data
Define Business Rules
Define Correlation
Define events
Define analytical data marts
auto built by StreamCentral
Historical
AnalysisEnterprise Business Intelligence
System
Massively Parallel Processing Systems - Vertica
Columnar databases with Bit Map indexes – ParStream
RDBMS – MS SQL Server
StreamCentral and Enterprise BI as part of an Enterprise
Service Bus architecture
Real-time event data published
to operational applications
and dashboardsAPI
Push
Pull
Push / Pull
44
http://www.virtus-it.com
Thank you
for your time
Contact us for a demonstration
Stephen Wells
CEO - Virtus IT Ltd
E: stephen.wells@virtus-it.com
M: +44 77 111 30879
Raheel Retiwalla
CTO - Virtus IT Ltd
E: raheel.retiwalla@virtus-it.com
M: +1 617 901 8370
A trusted partner
45

Contenu connexe

Tendances

How Precisely and Splunk Can Help You Better Manage Your IBM Z and IBM i Envi...
How Precisely and Splunk Can Help You Better Manage Your IBM Z and IBM i Envi...How Precisely and Splunk Can Help You Better Manage Your IBM Z and IBM i Envi...
How Precisely and Splunk Can Help You Better Manage Your IBM Z and IBM i Envi...Precisely
 
Slides: Proven Strategies for Hybrid Cloud Computing with Mainframes — From A...
Slides: Proven Strategies for Hybrid Cloud Computing with Mainframes — From A...Slides: Proven Strategies for Hybrid Cloud Computing with Mainframes — From A...
Slides: Proven Strategies for Hybrid Cloud Computing with Mainframes — From A...DATAVERSITY
 
Deliver Big Data, Database and AI/ML as-a-Service anywhere
Deliver Big Data, Database and AI/ML as-a-Service anywhereDeliver Big Data, Database and AI/ML as-a-Service anywhere
Deliver Big Data, Database and AI/ML as-a-Service anywhereRavikumar Alluboyina
 
A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?
A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?
A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?DATAVERSITY
 
Constant Contact: An Online Marketing Leader’s Data Lake Journey
Constant Contact: An Online Marketing Leader’s Data Lake JourneyConstant Contact: An Online Marketing Leader’s Data Lake Journey
Constant Contact: An Online Marketing Leader’s Data Lake JourneySeeling Cheung
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonJeffrey T. Pollock
 
Performance Acceleration: Summaries, Recommendation, MPP and more
Performance Acceleration: Summaries, Recommendation, MPP and morePerformance Acceleration: Summaries, Recommendation, MPP and more
Performance Acceleration: Summaries, Recommendation, MPP and moreDenodo
 
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision MakingAnalyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision MakingDenodo
 
Big Data Analytics Webinar
Big Data Analytics WebinarBig Data Analytics Webinar
Big Data Analytics WebinarEckerson Group
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonCapgemini
 
Business Intelligence Architecture
Business Intelligence ArchitectureBusiness Intelligence Architecture
Business Intelligence ArchitecturePhilippe Julio
 
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesWhat is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesTony Pearson
 
Hadoop 2.0 - Solving the Data Quality Challenge
Hadoop 2.0 - Solving the Data Quality ChallengeHadoop 2.0 - Solving the Data Quality Challenge
Hadoop 2.0 - Solving the Data Quality ChallengeInside Analysis
 
Data Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation AnalyticsData Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation AnalyticsDenodo
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Denodo
 
Data Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
Data Warehouse Like a Tech Startup with Oracle Autonomous Data WarehouseData Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
Data Warehouse Like a Tech Startup with Oracle Autonomous Data WarehouseRittman Analytics
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Erik Fransen
 
Applying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to HealthcareApplying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to HealthcarePaul Boal
 
The Role of the Logical Data Fabric in a Unified Platform for Modern Analytics
The Role of the Logical Data Fabric in a Unified Platform for Modern AnalyticsThe Role of the Logical Data Fabric in a Unified Platform for Modern Analytics
The Role of the Logical Data Fabric in a Unified Platform for Modern AnalyticsDenodo
 
Change data capture
Change data captureChange data capture
Change data captureJames Deppen
 

Tendances (20)

How Precisely and Splunk Can Help You Better Manage Your IBM Z and IBM i Envi...
How Precisely and Splunk Can Help You Better Manage Your IBM Z and IBM i Envi...How Precisely and Splunk Can Help You Better Manage Your IBM Z and IBM i Envi...
How Precisely and Splunk Can Help You Better Manage Your IBM Z and IBM i Envi...
 
Slides: Proven Strategies for Hybrid Cloud Computing with Mainframes — From A...
Slides: Proven Strategies for Hybrid Cloud Computing with Mainframes — From A...Slides: Proven Strategies for Hybrid Cloud Computing with Mainframes — From A...
Slides: Proven Strategies for Hybrid Cloud Computing with Mainframes — From A...
 
Deliver Big Data, Database and AI/ML as-a-Service anywhere
Deliver Big Data, Database and AI/ML as-a-Service anywhereDeliver Big Data, Database and AI/ML as-a-Service anywhere
Deliver Big Data, Database and AI/ML as-a-Service anywhere
 
A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?
A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?
A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?
 
Constant Contact: An Online Marketing Leader’s Data Lake Journey
Constant Contact: An Online Marketing Leader’s Data Lake JourneyConstant Contact: An Online Marketing Leader’s Data Lake Journey
Constant Contact: An Online Marketing Leader’s Data Lake Journey
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
 
Performance Acceleration: Summaries, Recommendation, MPP and more
Performance Acceleration: Summaries, Recommendation, MPP and morePerformance Acceleration: Summaries, Recommendation, MPP and more
Performance Acceleration: Summaries, Recommendation, MPP and more
 
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision MakingAnalyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
 
Big Data Analytics Webinar
Big Data Analytics WebinarBig Data Analytics Webinar
Big Data Analytics Webinar
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
Business Intelligence Architecture
Business Intelligence ArchitectureBusiness Intelligence Architecture
Business Intelligence Architecture
 
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesWhat is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use Cases
 
Hadoop 2.0 - Solving the Data Quality Challenge
Hadoop 2.0 - Solving the Data Quality ChallengeHadoop 2.0 - Solving the Data Quality Challenge
Hadoop 2.0 - Solving the Data Quality Challenge
 
Data Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation AnalyticsData Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation Analytics
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
 
Data Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
Data Warehouse Like a Tech Startup with Oracle Autonomous Data WarehouseData Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
Data Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
 
Applying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to HealthcareApplying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to Healthcare
 
The Role of the Logical Data Fabric in a Unified Platform for Modern Analytics
The Role of the Logical Data Fabric in a Unified Platform for Modern AnalyticsThe Role of the Logical Data Fabric in a Unified Platform for Modern Analytics
The Role of the Logical Data Fabric in a Unified Platform for Modern Analytics
 
Change data capture
Change data captureChange data capture
Change data capture
 

En vedette

Business model 2.0
Business model 2.0Business model 2.0
Business model 2.0Cenk Sezgin
 
How to create new business models with Big Data and Analytics
How to create new business models with Big Data and AnalyticsHow to create new business models with Big Data and Analytics
How to create new business models with Big Data and AnalyticsAki Balogh
 
BIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in LogisticsBIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in LogisticsSkillspeed
 
SC4 Workshop 1: Logistics and big data German herrero
SC4 Workshop 1: Logistics and big data  German herreroSC4 Workshop 1: Logistics and big data  German herrero
SC4 Workshop 1: Logistics and big data German herreroBigData_Europe
 
A Primer on Big Data for Business
A Primer on Big Data for BusinessA Primer on Big Data for Business
A Primer on Big Data for BusinessLeslie Bradshaw
 
Data-Driven Business Model Innovation Blueprint
Data-Driven Business Model Innovation BlueprintData-Driven Business Model Innovation Blueprint
Data-Driven Business Model Innovation BlueprintMohamed Zaki
 
Dental problems in kids dental clinic noida
Dental problems in kids   dental clinic noidaDental problems in kids   dental clinic noida
Dental problems in kids dental clinic noidaDr.Reena Gupta
 
Mercedes. Rychlost. tianDe.
Mercedes. Rychlost. tianDe.Mercedes. Rychlost. tianDe.
Mercedes. Rychlost. tianDe.Liza Alypova
 
Manual de Comunicacion IASD - DSA
Manual de Comunicacion IASD - DSAManual de Comunicacion IASD - DSA
Manual de Comunicacion IASD - DSALu Esqueche
 
Lessons learned v1b cmmaao pmi pmp
Lessons learned v1b cmmaao pmi pmpLessons learned v1b cmmaao pmi pmp
Lessons learned v1b cmmaao pmi pmpvishvasyadav45
 

En vedette (14)

Business model 2.0
Business model 2.0Business model 2.0
Business model 2.0
 
How to create new business models with Big Data and Analytics
How to create new business models with Big Data and AnalyticsHow to create new business models with Big Data and Analytics
How to create new business models with Big Data and Analytics
 
Big data3
Big data3Big data3
Big data3
 
BIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in LogisticsBIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in Logistics
 
SC4 Workshop 1: Logistics and big data German herrero
SC4 Workshop 1: Logistics and big data  German herreroSC4 Workshop 1: Logistics and big data  German herrero
SC4 Workshop 1: Logistics and big data German herrero
 
A Primer on Big Data for Business
A Primer on Big Data for BusinessA Primer on Big Data for Business
A Primer on Big Data for Business
 
Data-Driven Business Model Innovation Blueprint
Data-Driven Business Model Innovation BlueprintData-Driven Business Model Innovation Blueprint
Data-Driven Business Model Innovation Blueprint
 
Dental problems in kids dental clinic noida
Dental problems in kids   dental clinic noidaDental problems in kids   dental clinic noida
Dental problems in kids dental clinic noida
 
Economy Matters, February 2014
Economy Matters, February 2014Economy Matters, February 2014
Economy Matters, February 2014
 
Mercedes. Rychlost. tianDe.
Mercedes. Rychlost. tianDe.Mercedes. Rychlost. tianDe.
Mercedes. Rychlost. tianDe.
 
Multilateral Newsletter February 2016
Multilateral Newsletter February 2016Multilateral Newsletter February 2016
Multilateral Newsletter February 2016
 
Manual de Comunicacion IASD - DSA
Manual de Comunicacion IASD - DSAManual de Comunicacion IASD - DSA
Manual de Comunicacion IASD - DSA
 
Slideshare 2
Slideshare 2Slideshare 2
Slideshare 2
 
Lessons learned v1b cmmaao pmi pmp
Lessons learned v1b cmmaao pmi pmpLessons learned v1b cmmaao pmi pmp
Lessons learned v1b cmmaao pmi pmp
 

Similaire à StreamCentral for the IT Professional

Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformArvind Sathi
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
Confluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPointConfluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPointconfluent
 
Take Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven BusinessTake Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven BusinessInside Analysis
 
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsEnabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsStreamsets Inc.
 
Real time data integration best practices and architecture
Real time data integration best practices and architectureReal time data integration best practices and architecture
Real time data integration best practices and architectureBui Kiet
 
StreamCentral Technical Overview
StreamCentral Technical OverviewStreamCentral Technical Overview
StreamCentral Technical OverviewRaheel Retiwalla
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4Janani Eshwaran
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4Janani Eshwaran
 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester WebinarCloudera, Inc.
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesDATAVERSITY
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Oil and gas big data edition
Oil and gas  big data editionOil and gas  big data edition
Oil and gas big data editionMark Kerzner
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Denodo
 
Company Profile - NPC with TIBCO Spotfire solution
Company Profile - NPC with TIBCO Spotfire solution  Company Profile - NPC with TIBCO Spotfire solution
Company Profile - NPC with TIBCO Spotfire solution Sirinporn Setworaya
 
Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionDenodo
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Inside Analysis
 
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)Denodo
 

Similaire à StreamCentral for the IT Professional (20)

Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics Platform
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Confluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPointConfluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPoint
 
Take Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven BusinessTake Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven Business
 
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsEnabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
 
Real time data integration best practices and architecture
Real time data integration best practices and architectureReal time data integration best practices and architecture
Real time data integration best practices and architecture
 
StreamCentral Technical Overview
StreamCentral Technical OverviewStreamCentral Technical Overview
StreamCentral Technical Overview
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4
 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester Webinar
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Oil and gas big data edition
Oil and gas  big data editionOil and gas  big data edition
Oil and gas big data edition
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
 
Company Profile - NPC with TIBCO Spotfire solution
Company Profile - NPC with TIBCO Spotfire solution  Company Profile - NPC with TIBCO Spotfire solution
Company Profile - NPC with TIBCO Spotfire solution
 
Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An Introduction
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion
 
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)
 

Dernier

Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 

Dernier (20)

Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 

StreamCentral for the IT Professional

  • 1. http://www.virtus-it.com A trusted partner Software to model & build Business Intelligence & Big Data Solutions
  • 2. http://www.virtus-it.com Table of Contents • Big Data Overview………..………………………………………………………… 3 - 6 • Step by Step approach to building Big Data Solutions……………….. 7 - 10 • StreamCentral Introduction……………………………………..………….... 11 - 21 • Keeping it structured? –Extending current DW/BI investments………. 22 - 25 • An approach to building Operational Intelligence solutions………….... 26 - 28 • StreamCentral additional details……………………………………..………….... 29 - 37 • StreamCentral physical architecture……………………………..………….... 38 - 40 • How StreamCentral fits in an enterprise technology architecture…… 41 - 44 2
  • 3. http://www.virtus-it.com 10101010101010101010101010101010101010101010101ABC010 1010101010101010101010100101010101010101ABCABCABCABC 1 0 1 0 1 0 1 0 1 0 1 0 3 BIGData Today With Big Data Custom Application ERP • Analysis of structured data from internal applications • Data sets updated using batch processes • Traditional BI & Data Warehousing • Traditional BI and data warehousing extended to include structured and unstructured data from internal and external applications processed in real-time or in batch. • Ability to predict events as well as analyze historical associations in wide sets of data for patterns and trends. Data Management Discovery & Analysis Event Detection Big Data Solution Modeling & Model Deployment Stream Processing & Batch Data Acquisition Blocks for building Big Data Solutions ERP Internal & External Applications Data Stores 1 0 1 0 1 0 Real-Time + Batch Big Data Processing Layer Real-time event data in operational applications Pattern, trend and association analysis Understand the connections in a wide variety of data that impact business performance and use that knowledge to deliver exceptional business results
  • 4. http://www.virtus-it.com Understanding business performance with Big Data includes two distinct capabilities: Managing performance by analyzing internal and external, structured and unstructured data for patterns and associations collected over time • Customer segmentation based on buying history patterns and finding associations with population, census and twitter data to develop marketing strategy • Web analytics to improve marketing campaigns and relevant content • Sales pipeline analysis compared to industry data to understand the right goals to set • Cash flow analysis to make capital investment decisions considering external variables Managing performance by analyzing real- time data for day to day events – Operational Intelligence • What is the current workload? • Is staff available and working on high priority work? • What factors are impacting customer experience right now? • What processes are taking longer than expected? 4 A few example scenarios: A few example scenarios:
  • 5. http://www.virtus-it.com Telco’s Core IMS Network Data Data, Voice & Video Performance Data Data, Voice & Video Performance Data Data from Telco Towers Weather Data Traffic IncidentsPopulation Data Data Stream weatherundergrou nd MapquestUSA Today Census data Sources of real time streaming data from networks, devices, services and other internal applications External sources of data that add understanding of what’s happening when events are detected Example Big Data Solutions: Telco Network Test New Service – Investment Planning Adaptive Bit Rate – Video Streaming QoE 360o Customer QoE for 1st Level customer service Video QoE for IPTV Business Solutions 5
  • 6. http://www.virtus-it.com Telco’s Core IMS Network Data Data, Voice & Video Performance Data Data, Voice & Video Performance Data Data from Telco Towers Weather Data Traffic IncidentsPopulation Data Data Stream weatherunderground INRIXUSA Today Census data Example Big Data Scenario : Utilities - Water Sources of real time data relating to your business Sources of BIG DATA relevant to your business VIBRATION SENSOR ENERGY HARVESTING WATER MAIN PRESSURE SCADA NETWORK 6
  • 7. http://www.virtus-it.com 7 Steps it takes to build powerful Big Data solutions! Solution Modeling Model Deployment Data Acquisition (streaming or batch, internal or external, structured or unstructured) Data Management Event Detection Discovery & Analysis Big Data Solution Lifecycle Start here
  • 8. http://www.virtus-it.com 8 Solution Modeling •Logical Data Model design •Data standardization & transformation modeling •Key Performance Indicator modeling via business rules •Dimensional modeling •Historical Data Mart Modeling •Event detection modeling via business rules •Real-time analytics data mart modeling Model Deployment •Physical Design Implementation •Physical deployment of dimensional model •Database deployment •Physical deployment of data marts •Rules deployment Data Acquisition •Data from internal data sources •Data from external sources •Streaming data •Batch data •Structured Data •Unstructured Data •Data transformation •Data standardization Data Management •Structured Data Storage •Unstructured Data Storage •Scalability •Performance Event Detection •Detecting events on streaming data •Alerting •Integration with operational applications Discovery & Analysis •Information Discovery •Data Classification •Analytics •Querying •Visualization Solution Modeling Model Deployment Data Acquisition Data Management Event Detection Discovery& Analysis Big Data Solution Lifecycle – Tasks Detailed
  • 9. http://www.virtus-it.com 9 Solution Modeling Model Deployment Data Acquisition Data Management Event Detection Discovery & Analysis 1. Hadoop - MapReduce 2. MPP Columnar Databases like Neteeza, Vertica, ParStream 3. NoSQL – MongoDB, Cassandra 4. Evolution of traditional RDBMS to support column indexes SQL Server Big Data Innovations in Data Management Big Data Innovations in Discovery & Analysis Where has the innovation been in Big Data? The last few years have seen lots of innovation in Data Management as well as Discovery and Analysis
  • 10. http://www.virtus-it.com Solution Modeling Model Deployment Data Acquisition Data Management Event Detection Discovery & Analysis Big Data Lifecycle But, where is the innovation in these areas? • Fragmented, point use or lack of industry strength technology to aid in Design, Model Deployment, Data Acquisition and Event Detection makes it difficult, time consuming and specialist resource intensive to build Big Data Solutions • What is the use of having scalable platforms that can store and manage this data and tools that can deliver incredible visualizations when the effort to get the data right is still a problem as it has always been?
  • 12. http://www.virtus-it.com Solution Modeling Model Deployment Data Acquisition Data Management Event Detection Discovery & Analysis Big Data Solution Life cycle 1. StreamCentral Solutions Designer makes it easy to model traditional BI/DW and Big Data solutions 2. Builds and deploys model on HP Vertica or Microsoft SQL Server 3. Adds context by connecting all streaming and static data to time, location and entities 4. StreamCentral Big Data Server, horizontally scalable, executes the model definition in real-time 5. StreamCentral drastically reduces time to market, risk and cost in building Big Data solutions! Software to design & build BI & Big Data SolutionsStreamCentral enables you to quickly move from a blank sheet of paper to a production system, comprehensive and powerful that can be delivered without a large investment in specialist skills.
  • 13. http://www.virtus-it.com 13 1010101010101010 ABCABCABCABCABC StreamCentral Workbench: Solution Designer StreamCentral Workbench: Model DeploymentData Collection Data Processing Correlation Data Publishing Data Security StreamCentral Big Data Server StreamCentral has three main components: 1. Use the Workbench Designer to define source data, entities, rules for monitoring conditions, events and data correlation, analytical models and knowledgebase 2. Workbench Model Deployment configures, builds and deploys the model on top of HP Vertica or Microsoft SQL Server 3. Big Data Server executes the defined model in real-time 1 2 3
  • 14. http://www.virtus-it.com Database REST/SOAP API LDAP PUSH API Data Processing Engine Vertica SQL Server Correlation EngineCollector Data Publishing, Access and Security • Capture data • Validate data • Prepare data • Apply transformations • Perform calculations • Determine conditions & KPIs • Identify & build dimensions • Identify alerts • Correlate incoming data based on defined rules • Detect events based on correlated data • Update fact data • Update entity & dimension data • Update analysis collections • Update event collections • Manage data level security Data Acquisition – Push / Pull data from variety of sources Design data transformations Conditions & KPI modeler via rules builder Real-time data correlation Event detection via rules builder Real-time data mart designer 360o data mart designer Define entities and Import Entity Data Dimension modeler Data Security designer StreamCentral Big Data Server StreamCentral Workbench: Big Data Solution Designer Meta Data Create Database Structure Add Context StreamCentral Workbench: Big Data Solution Deployment
  • 15. http://www.virtus-it.com 15 Model Pull Data Sources with strong REST, SOAP & DB Support Push Data API Data Transformat -ion Model Entities & import static data Dimension modeler Time & Location Standard- ization Conditions & KPI modeler Correlation Modeler Event Detection rules on real-time data Real-time & Historical analytics Data mart modeler • Software targeted to be used by IT and non IT people to design and build Big Data solutions • Can work with batch data (as in traditional Business Intelligence) or real-time streams (as in Operational Intelligence) • Workbench lets analysts model all necessary steps in building a Big Data Solution • Data Pull/Push • Model Transformations • Model Entities (like customers, patients, products), import static entity data and define entity relationships to source data • Shared dimensions across data • Condition modeler via business rules to monitor specific sets of conditions in batch or streaming data • Evaluate different entities with different sets of conditions as data flows in • Specify rules to model how to correlate data streams in real-time • Event detection • Model data marts that aggregate the right data for association and pattern analysis StreamCentral Workbench : Software to design traditional BI/DW & Big Data Solutions Workbench
  • 16. http://www.virtus-it.com 16 Generating insights from data requires context to be added to the data. This context is a continuous thread that connects all types of data throughout the Big Data Solution lifecycle. Four typical examples of context.. Insight Who (entities like customer, patient) When (time) Where (location) What (streaming & static data correlation) • StreamCentral automatically builds and maintains time and location dimensions • Entities can be created and defined in StreamCentral • All data in StreamCentral is continuously and automatically connected to time, location and defined entities • Resultant real-time events and analytical data marts automatically inherit this context without need for any programming or development work • This increases the impact and value of collected data Converting data to insights by continuously adding context
  • 17. http://www.virtus-it.com 17 Auto build and deploy DB structure based on Workbench Model Continuous Pull with strong REST, SOAP & DB Support Push Data API for streaming sources Time & Location Standard- ization Monitor conditions Event detection Build data marts & continuously update new data In-Memory Operations Distributed Architecture MPP Support StreamCentral Big Data Server: Software that runs Big Data Solutions • Extends your Business Intelligence strategy by easily incorporating external data sets • Introduces integration of real-time data for event insight to your organization • Auto-builds database schema (facts, dimensions, entities, flat tables and more) • By default, standardizes all incoming data by connecting it to auto created time and location dimensions • Builds event data marts and continuously loads data • Builds real-time data marts to help in understanding associations in data Continuously loads these analysis data marts • Deliver real-time event insights to new or existing operational applications • Significantly reduces IT overhead in building Big Data solutions Big Data Server
  • 18. http://www.virtus-it.com 18 Solution Modeling Model Deployment Data Acquisition Data Management Event Detection Discovery & Analysis Bringing it together: Building Big Data Solutions with StreamCentral and partner solutions 1. MPP Columnar Databases : Vertica, ParStream 2. Microsoft SQL Server StreamCentral Big Data Server StreamCentral Workbench: Model Deployment StreamCentral Workbench: Big Data Solutions Modeler Tableau Software, Microsoft PowerView StreamCentral Big Data Server
  • 19. http://www.virtus-it.com • Industrial strength, enterprise ready with web scale characteristics - handles extremely large amounts of data • Uses in-memory processing for high speed • Next generation distributed architecture – allows you to run on any number of commodity hardware • Built in redundancy at every layer for high availability • Easy to use tools to monitor and manage StreamCentral • Built on Microsoft technology that most enterprises already have invested in • Runs on best of breed and latest database technology from Microsoft SQL Server and HP Vertica Choose database from: 19
  • 20. http://www.virtus-it.com Why StreamCentral? • Roadmap to Big Data: StreamCentral is the only solution that enables the evolution of current practices in Business Intelligence and Data warehousing to now include external data, event monitoring and real-time insights • No programming solution modeler: StreamCentral takes a solution approach – designing and modeling shifts to analysts versus everything being done by developers or programmed from scratch • Continuously adds context to data: Any kind of data that is streamed to StreamCentral, pulled in near real-time or imported via batch is continuously and automatically connected to time, location and defined entities. This significantly reduces risk, time and cost associated with building BI/DW and Big Data solutions • Reduced dependency on specialist skills: No in-depth knowledge needed on HP Vertica or SQL Server development as StreamCentral builds, deploys and maintains all internal structures in those environments automatically • Plays well: Is standards based and agnostic to existing enterprise technologies • Adaptable: Everything created in StreamCentral can be modified. Makes it easy to adapt the Big Data solution to changing needs of the business 20
  • 21. http://www.virtus-it.com 21 Making a business case for leveraging Big Data just got a whole lot easier with StreamCentral 70% Time taken to build Big Data solutions is drastically reduced by using StreamCentral 60% Cost of building Big Data solutions is drastically reduced by using StreamCentral In addition, StreamCentral reduces risk, data quality issues, specialist skillsets requirements and complexity in building traditional Business Intelligence/Data Warehousing or Big Data solutions
  • 22. http://www.virtus-it.com 22 No immediate plans to go Big on Data? Planning to work primarily with structured data? But would like to deliver additional insights by enhancing your existing investments in Business Intelligence and Data Warehousing?
  • 23. http://www.virtus-it.com Traditional Data Warehousing Interrogation of historical data for trend analysis. Business Intelligence applications deliver analytics or reports to management for performance analysis On-Demand Business Intelligence Update Data Warehouse continuously with real-time data. Provides the ability to analyze data updated in real-time Operational Intelligence Allows organizations to monitor fast moving data for key indicators and events and immediately act on these insights, through manual or automated actions Reporting:- What did happen ? Analysis:- Why did it happen ? Happens on previously stored data (data at rest) Happens on real-time streaming data (data in-flight) Solution value to businessLower Higher PerceivedComplexityHigherLower Event Monitoring:- What is happening ? Predictive Analytics:- What will happen ? Traditional Data Warehousing Solutions On-Demand BI Operational Intelligence 23 Keeping it structured – A roadmap to extend current investments in BI/DW
  • 24. http://www.virtus-it.com Reporting:- What did happen ? Analysis:- Why did it happen ? Happens on previously stored data (data at rest) Happens on real-time streaming data (data in-flight) Solution value to businessLower Higher PerceivedComplexityHigherLower Event Monitoring:- What is happening ? Predictive Analytics:- What will happen ? Traditional Data Warehousing Solutions On-Demand BI Operational Intelligence Most organizations have traditionally invested in this area In most companies, the scope of understanding business performance is limited to historical analysis and rarely includes real-time understanding of key events that impact day to day operational processes Keeping it structured – A roadmap to extend current investments in BI/DW
  • 25. http://www.virtus-it.com Reporting:- What did happen ? Analysis:- Why did it happen ? Happens on previously stored data (data at rest) Happens on real-time streaming data (data in-flight) Solution value to businessLower Higher PerceivedComplexityHigherLower Event Monitoring:- What is happening ? Predictive Analytics:- What will happen ? Traditional Data Warehousing Solutions On-Demand BI Operational Intelligence Most organizations have traditionally invested in this area StreamCentral’s area of focus 25 Keeping it structured – A roadmap to extend current investments in BI/DW
  • 26. http://www.virtus-it.com 26 An approach to working with real-time data - Operational Intelligence
  • 27. http://www.virtus-it.com Data Layer Interfaces Data Processing Real-Time Insights Business Solutions Operational (User) Internal Applications and Data Sets External Data Connections to existing architecture for tapping data & data streams APIs Databases Enterprise Service Bus Messages Push Streaming Data |Pull Data |Format | Standardize | Transform | Measure | Correlate | Event Detection | Rules Engine | In-Memory Processing Real-Time Streaming Analytics Real-Time Event Notification Historical data that supports pattern &trend analytics. New insights are added in real time Customer Experience Continuous Improvement Day to day insights and actions delivered in multiple mediums to many users KPIs Complaints Brand – Protection 1 2 3 4 5 6 ! Access to right information at the right time along with knowledge base of actions to perform Operational Intelligence practices are similar to traditional Data Warehousing practices 27
  • 28. http://www.virtus-it.com Data Layer Interfaces Data Processing Real-Time Insights Business Solutions Operational (User) Internal Data Sets External Data Connections to existing architecture for tapping data & data streams APIs Databases Enterprise Service Bus Messages Push Streaming Data |Pull Data |Format | Standardize | Transform | Measure | Correlate | Event Detection | Rules Engine | In-Memory Processing Real-Time Streaming Analytics Real-Time Event Notification Historical data supporting pattern &trend analytics. New insights added in real time Customer Experience Continuous Improvement Day to day insights and actions delivered in multiple mediums to many users KPIs Complaints Brand – Protection 1 2 3 4 5 6 ! Access to right information at the right time along with knowledge base of actions to perform Focus of StreamCentral 28
  • 29. http://www.virtus-it.com 29 More details on how StreamCentral works
  • 30. http://www.virtus-it.com StreamCentral Workbench Big Data Solutions Modeler - Inputs • Data Sources • Push/Pull • Data transformations • Define and import entity data • Modeling • Rules for monitoring conditions in data • Correlation rules to identify related records across data sources in real-time • Rules for detecting events • Common dimension modeling • Data Mart modelers • Support for Real-time • Correlation rules to identify related records across data sources in real-time • Rules for detecting events • Configure real-time data marts • 360o data aggregation** • Define data relationships across data sources • Configure 360o data marts • Data level security** 30** Coming Q3 2013
  • 31. http://www.virtus-it.com StreamCentral Big Data Server - Output • Database structure automatically created, updated and managed in Big Data databases like HP Vertica or SQL Server by StreamCentral. • The StreamCentral database automatically builds time and location dimensions, fact tables, other dimension tables, standardizes facts across data sources to the one time and location dimension as well as connects facts to KPIs. StreamCentral also auto-loads this database from various data sources into Big Data databases like HP Vertica or SQL Server • Real-time event notification that can be consumed by operational applications via an API** • Real-time event alerts • Data marts that are automatically created, updated and managed by StreamCentral. The data marts denormalize data into a single table facilitating faster querying and analysis of data • Real-time analytical data marts built that aggregates events and data across data sources to better understand conditions that influence events • Real-time event data marts that bring together all relevant information for a single event • 360o data marts for association and pattern analysis** 31** Coming Q3 2013
  • 32. http://www.virtus-it.com 32 Sensors Weather Enterprise Applications Data Visualization (Reporting, Analytics, Dashboards) Correlates Data Generates Key Performance Indicators Uncovers Events Consumes real- time or static data OR Pulls data from various data sources and applies transformation and standardization rules Model Deployment Auto-builds database schema Auto-loads database Builds and continuous loads data to event data marts Builds and continuous loads Analysis Collections Publishes event data that can be subscribed by Operational Applications Devices Auto-build Database Schema 360o Data marts and real-time data marts Event Data Marts for every event along with its context as denormalized flat tables StreamCentral Push Push Massively Parallel Processing Systems - Vertica RDBMS – MS SQL Server Publish event data to operational applications – Web, mobile or desktop StreamCentral Workbench – Big Data Solutions Modeler Collate Raw Data (Push/Pull) – Real-Time or Static Model data standardization and transformation rules Define business entities and connect raw data to business entities Model Dimensions Model conditions to monitor across data sources Assign different conditions to different entities Model Correlation Rules Model events and specify context to add to events Model analytical data marts auto built by StreamCentral StreamCentral Big Data Server Enterprise Applications API Traffic API API API
  • 33. http://www.virtus-it.com 33 builds two distinct types of analytical data marts 360o Data Marts** Real-Time Data Marts • Defined: Easily bring together and aggregate data across data sources to get 360o insight. Analyze associations in data to determine patterns that impact business performance • Define data mart structure by choosing the right set of attributes from data sources, KPIs, attributes from entities, and dimensions in the Workbench • . StreamCentral auto-builds the data mart • Standardize data across time and location • Update data mart at pre-defined intervals StreamCentral Data Marts are denormalized flat tables – Why? • Defined: Aggregate real-time events and bring together data across data sources to analyze conditions that existed when events are detected • Standardize data across time and location • Define data mart structure by choose the right set of attributes from data sources, KPIs, events, attributes from entities, and dimensions . StreamCentral auto-builds the data mart • Once data gets correlated in real-time data mart gets updated with appropriate insights • Technology advancements in columnar data stores, bit map indexes, column indexes make it possible to scan and query large amounts of data in a single table • Takes advantage of distributed architectures to scale out using commodity software • Supports : • SQL Server columnar indexes • Vertica MPP ** Coming Q3 2013
  • 34. http://www.virtus-it.com 34 StreamCentral Real-Time Operational Intelligence • Data Sources • Import initial data load • Push data to StreamCentral API • Pull data from data sources at defined intervals • Apply transformations on the data in flight • SQL Server, Oracle, My SQL, REST API, SOAP Web Service, LDAP • Auto connects data to time and location dimension • Model entities. Connect data sources to entities • Model measures and KPIs • Model standard dimensions • Model real-time correlation rule (to identify related records across data sources in real time) • Model Events • Events based on real-time correlation rule • Event Data Mart (automatically gets created when event is detected) • Requires real-time correlation • Brings together all data across data sources that were captured at the time the event was detected • Model Real-Time Data Marts • Requires a real-time correlation rule • Update real-time data mart with streaming correlated data • Define attributes that make up the real-time data mart definition. Select subsets of information from : specific attributes from data sources, KPIs, events, entity attributes, dimensions, time and location • Edit real-time data mart definition
  • 35. http://www.virtus-it.com 35 StreamCentral 360o Data Aggregation** • Data Sources • Import initial data load • Pull data from data sources at defined intervals • Apply transformations on the data • SQL Server, Oracle, My SQL, REST API, SOAP Web Service, LDAP • Auto connects data to time and dimension location • Model entities. Connect data sources to entities • Model measures and KPIs • Model standard dimensions • Model 360o Data Marts • Model 360 view query (define relationships across data sources to aggregate data) • Schedule batch update interval (typically hours) • Define attributes that make up the analysis collection. Select subsets of information from : specific attributes from data sources, KPIs, entity attributes, dimensions, time and location • Edit and update data mart definition • Define data level security ** Coming Q3 2013
  • 36. http://www.virtus-it.com 36 Data formats supported : • XML • JSON • String Data Sources supported : • Database • Microsoft SQL Server • Oracle • My SQL • REST API • SOAP API • LDAP • Specify transformation rules to data that is applied to data in flight • Specify parameters when calling APIs • Auto fills location parameters based on location data stored in the database about entities • Auto creates tables in the backend database for data source data Pull Data from Applications Push data to StreamCentral • StreamCentral REST API available to stream data to StreamCentral – stream data from agents, sensors, probes, devices • Specify transformation rules to data that are applied to the data in flight • Auto creates the tables in the backend database for source data StreamCentral Databases • Supports Microsoft SQL Server and HP Vertica • Auto creates data structures in the database for source data • Auto creates fact tables, dimensions, flat tables for event analysis and flat tables for pattern and association analysis • Data level security StreamCentral Analytics • Device friendly visualization • Powerful portfolio of visualization tools • Ability to embed in custom applications • In-memory operations for fast querying StreamCentral Reports • Role based security • Subscribe to reports • Ability to embed in custom applications • Export reports to various formats
  • 37. http://www.virtus-it.com 37 Transformation Description LTRIM Removes all white spaces from the left RTRIM Removes all white spaces from the right Ignore Space Removes all white spaces from left, middle or right Ignore Special Characters Returns string after ignoring all special characters Contains Search for specific characters Substring Extract a substring from a string Left Removes the left part of a character string Right Removes the right part of a character string Replace Replaces specified string with another string Startswith Search for a starting character Endswith Search for an ending character DoesNotContain Search for specific characters Remove Remove specified characters or words from string Range Search for a range RoundOff Rounds off decimal value to a specific length StreamCentral Transformations • Easy to use transformations • Multiple transformations can be executed on one attribute
  • 39. http://www.virtus-it.com 39 StreamCentral Collector Windows Server 2012, .Net Framework 4.5, MSMQ StreamCentral Stream Processing Engine Windows Server 2012, .Net Framework 4.5,, MSMQ StreamCentral Stream Correlation Engine Windows Server 2012, .Net Framework 4.5,, MSMQ StreamCentral Data Engine Windows Server 2012, .Net Framework 4.5,, MSMQ All components can run on one machine Every component can run on more than one machine StreamCentral Cache Cluster Windows Server 2012, .Net Framework 4.5, AppFabric StreamCentral Metadata database Windows Server 2012, Microsoft SQL Server2008 R2 or Microsoft SQL Server 2012 StreamCentral Database and data marts Option 1: Windows Server 2012, Microsoft SQL Server2008 R2 or Microsoft SQL Server 2012 or SQL Server Parallel Data Warehouse Option 2 Linux, HP Vertica StreamCentral Analytics Windows Server 2012, Tableau Software StreamCentral Physical Architecture and Software Requirements
  • 40. http://www.virtus-it.com 40 1 server for StreamCentral Components: Collector, Stream Processing Engine, Correlation Engine, Data Engine Characteristics of this server : Processor dependent therefore the higher the number of cores the better, medium cache and low disk storage Software: Windows Server 2012, .Net Framework 4.5, MSMQ 1 server for cache Hardware characteristics: : Cache dependent therefore more memory the better. Medium CPU and low disk storage Software : Windows Server 2012, .Net Framework 4.5, AppFabric 1 server for StreamCentral Meta Data Database, data mart storage and reporting Hardware Characteristics:: High CPU, High Memory and High Storage Software : Windows Server 2012, SQL Server 1 server for StreamCentral Meta Data Database and reporting Hardware Characteristics:: Medium CPU, Medium Memory and High Storage Software : Windows Server 2012, SQL Server OR 1 server for StreamCentral data marts Hardware Characteristics:: High CPU, High Memory and High Storage Software : Linux, HP Vertica + StreamCentral suggested minimum system configuration
  • 41. http://www.virtus-it.com How does StreamCentral fit within your enterprise technology architecture? 41
  • 42. http://www.virtus-it.com Data Sources Method of Access StreamCentral - Read data from Application Application - Read data by subscribing to StreamCentral Real-Time Event API Application - Read data by querying StreamCentral database Enterprise Applications X real-time X Using Web Service or REST API X real-time Using database query X Enterprise Service Bus X real-time X via Web Service or REST API X real-time via subscribing to messages X real-time Enterprise Data Warehouse via database query X X Point databases via database query X X LDAP via database query X External Data Sources via Web Service or REST API X real-time 42
  • 43. http://www.virtus-it.com Sensors Weather Devices Traffic Custom ApplicationsMainframe Business Services Enterprise Service Bus - Messaging / Mediation / Orchestration / Security Business Process Business Process Business Process Composite Application Composite Application Composite Application Auto-build Database Schema Analysis Collections – Data marts as denormalized flat tables Event Collections – Data Marts for every event along with its context as denormalized flat tables StreamCentral Engine StreamCentral Workbench Collate Raw Data (Push/Pull) Standardize Data Define Business Rules Define Correlation Define events Define analytical data marts auto built by StreamCentral Historical Analysis Real-time event data published to operational applications and dashboards Massively Parallel Processing Systems - Vertica Columnar databases with Bit Map indexes – ParStream RDBMS – MS SQL Server StreamCentral as part of an Enterprise Service Bus architecture API ERP Push / Pull 43
  • 44. http://www.virtus-it.com Sensors Weather Devices Traffic ERP Custom ApplicationsMainframe Business Services Enterprise Service Bus - Messaging / Mediation / Orchestration / Security Business Process Business Process Business Process Composite Application Composite Application Composite Application Auto-build Database Schema Analysis Collections – Data marts as denormalized flat tables Event Collections – Data Marts for every event along with its context as denormalized flat tables StreamCentral Engine StreamCentral Workbench Collate Raw Data (Push/Pull) Standardize Data Define Business Rules Define Correlation Define events Define analytical data marts auto built by StreamCentral Historical AnalysisEnterprise Business Intelligence System Massively Parallel Processing Systems - Vertica Columnar databases with Bit Map indexes – ParStream RDBMS – MS SQL Server StreamCentral and Enterprise BI as part of an Enterprise Service Bus architecture Real-time event data published to operational applications and dashboardsAPI Push Pull Push / Pull 44
  • 45. http://www.virtus-it.com Thank you for your time Contact us for a demonstration Stephen Wells CEO - Virtus IT Ltd E: stephen.wells@virtus-it.com M: +44 77 111 30879 Raheel Retiwalla CTO - Virtus IT Ltd E: raheel.retiwalla@virtus-it.com M: +1 617 901 8370 A trusted partner 45