Have you ever tried to debug a production outage, when your system comprises apps your team has written, third-party apps your team runs, with logs going into some system, application performance metrics going into another system, and cloud platform metrics going somewhere else? Did you find yourself switching tabs, trying to correlate metrics with logs and alerts and finding yourself in a huge tangle? It is a nightmare. In the data world, we talk about aggregating all our data so we can derive new insights quickly, but what about our operational data? Observability is your ability to be able to ask questions of your system without having to write new code, or grab new data. When you've got an observable system, it feels like you have debugging superpowers, but can be challenging to even know where to start. If you can even convince your colleagues to start, finding the right tools can be challenging. In this talk Inny and Andrew will talk about what monitoring and logging are not sufficient anymore (if they ever were), observability basics, and demo an observability platform that you can use to start your observability journey today.
3. A system is observable if the behaviour of the
entire system can be determined by only
looking at its inputs and outputs
Kalman 1961
General theory of control system
20. k8s
k8s
The host
The host
The hostThe host
The hostThe host
k8s
The app
RDS
The db
Logs
Logs
Cloudwatch
New
Relic
k8s
ms
ms
RDS
md
Pagerduty
Prometheus
Sumologic
58. k8s
k8s
The host
The host
The hostThe host
The hostThe host
k8s
The app
RDS
The db
Logs
Logs
Cloudwatch
New
Relic
k8s
ms
ms
RDS
md
Pagerduty
Prometheus
Sumologic
62. k8s
k8s
The host
The host
The hostThe host
The hostThe host
k8s
The app
RDS
The db
Logs
Logs
k8s
ms
ms
RDS
md
KafkaEvents
63. k8s
k8s
The host
The host
The hostThe host
The hostThe host
k8s
The app
RDS
The db
Logs
Logs
k8s
ms
ms
RDS
md
Kafka
Datalake
Events
BigQuery
64. k8s
k8s
The host
The host
The hostThe host
The hostThe host
k8s
The app
RDS
The db
Logs
Logs
k8s
ms
ms
RDS
md
Kafka
Datalake
Consumer
Events
BigQuery
65. k8s
k8s
The host
The host
The hostThe host
The hostThe host
k8s
The app
RDS
The db
Logs
Logs
k8s
ms
ms
RDS
md
Pagerduty
Kafka
Datalake
Consumer
Events
BigQuery