SlideShare une entreprise Scribd logo
1  sur  54
Télécharger pour lire hors ligne
Searching for dark matter in CERN's
Large Hadron Collider dataset
Lavanya Shukla
What's
on the
menu
today?
The Standard Model
Sneak Peak into the Large
Hadron Collider
Machine Learning Challenges
Searching for Dark Matter
Particle Identification
How Can You Get Involved
you can play
with datasets
from CERN's
LHC experiments
http://opendata.cern.ch
Particle Physics and the
Standard Model
All matter is made up of particles.
These particles interact with each other
by exchanging other particles
associated with the fundamental forces.
The Standard Model describes what the
universe is made up of and how it holds
together.
The Standard Model
Pretty much everything we see in the
universe is made of up quarks, down
quarks and electrons.
All the other things exist only for a short
amount of time before decaying into other
particles.
The Standard Model
The 12 fermions are the building blocks of
matter.
Quarks group of 6 particles, organize
together to make protons (2 up, 1 down)
and neutrons (1 up, 2 down).
Dependent particles, have partial
charge so are never seen alone in the
wild.
Need to combine with other quarks
using strong nuclear force to form
particles, most commonly in the nucleus.
Fermions
The 12 fermions are the building blocks of
matter.
Leptons group of 6 particles, with
electrons being the best known.
Independent, have whole units of
charge.
Not seen in the nucleus.
Fermions
Err..
Fermions interact with each other through
fundamental forces. The 5 bosons are the
building blocks of fundamental forces.
Strong force (very short range, typically
smaller than that of an atomic nucleus)
gluons – bind the quarks in protons
and neutrons
photons – carry electromagnetic
force in light
Role: Holds atomic nuclei and quarks
together.
Bosons
Fermions interact with each other through
fundamental forces. The 5 bosons are the
building blocks of fundamental forces.
Bosons
Weak force (very long range, but million
times weaker than strong force)
Carried by Z and W bosons
Higgs boson
Role: responsible for radioactive decay
Behind the curtains of the
Large Hadron Collider
16 mile long tunnel
accelerates protons to almost the
speed of light (0.9999999c) and
smashes them into each other
4 big detectors
ALICE, ATLAS, CMS and LHCb
detect particles resulting from
proton-proton bunch collisions
40 million
bunch collisions per second
10 petabytes
of data per year
The detector measures the energy and momentum of every particle
flying out of the collision event and identify the types of those
particles by calculating their mass.
The Standard Model describes what the
universe is made up of and how it holds
together.
The Standard Model
Every particle leaves different traces in
different sub-detectors of the experiment.
Simulating the
LHCb's output
http://clangenb.web.cern.ch/clangenb/
It's time for some
Machine Learning!
Anomaly detection (data quality and
infrastructure monitoring)
Detector design optimization (using
bayesian optimization, surrogate
modeling etc.)
Precise and fast particle tracking
(single tracks, shower, jets etc.)
Fast and accurate data processing, and
design of triggers
Particle Identification
Some Machine Learning
Challenges in Particle Physics
Searching for
dark matter
If we add up all the stars, planets,
galaxies, comets, black holes, dark
clouds - everything out there and the
gravity doesn't add up.
There is something holding the universe
and our galaxy together, without which
our galaxy would fall apart
Dark 'matter' is a misnomer
85% of the gravity in the
universe has an unknown
source
Hi! I'm Fred!
There is something.
It interacts with gravity.
Mass and gravity go together
so it has mass, maybe?
Probably is invisible
has no interaction with light or
electromagnetic force, doesn't emit
or reflect light
There is a lot of it.
So.. Not much really.
What do we know for sure?
I like playing
hard to get
WIMP = weakly interacting massive
particle
heavy like a neutrino, has 100 - 1000
times the mass of a proton
a particle interacts with other matter
like the neutrino
weak interaction = force between
subatomic particles responsible for
radioactive decay
WIMP theory of Dark Matter
Neither absorb nor emit light
Don't interact strongly with other particles.
But when they encounter each other, they
annihilate and make gamma rays, which we can
detect.
THE CHALLENGE
WE DON'T KNOW WHAT WE'RE
LOOKING FOR
There are plenty of theories that try to
predict presence of dark matter, but we
have no inkling of the nature of effect we
should be looking for.
LACK OF CLEAN, OBSERVABLE
DATA
The direct and indirect targets that might
predict dark matter’s presence are laced
with noise and background phenomenon
that might lead to misleading results.
We'll use ML to tackle
the problem of signal
extraction from
background noise
Eliminate background noise
from base tracks
1. 2. Cluster tracks into neutrino
and dark matter interactions.
The scale of the problem is massive.
10,000 out of 10 million basetracks are results of electromagnetic showers, rest are noise.
Feynman diagram of a dark matter particle X
scattering an electron of lead nuclei.
Neutrinos produce similar
showers
The problem is that when a neutrino
interacts with a nucleus, it also produces
an electron that get boosted and similar
showers are produced.
One of the key distinguishing qualities
between them is the energy-angle
correlation.
We apply clustering to distinguish the two
electromagnetic showers.
The Dataset
The OPERA dataset has the
following features
15 million base tracks
Each base track contains:
X, Y, Z co-ordinates
angles from origin (TX, TY)
Signal
Background consists of base tracks scattered
randomly in space, and has a label=0
Signals consist of base tracks forming a cone shape,
has a label=1
In addition we compute the
following features
distance from the origin (dX, dY, dZ, dTX, dTY)
alpha, the angle between directions
d, the projection of base track1
Signal has a defined geometric structure
Noise is largely random vectors
Eliminate background
noise from base tracks
1.
Feature Engineering: Add
particle track info to dataset
Detect particle track
Signal base tracks are part of a larger vector of movement of
particles.
This particle moves across many layers in a straight line path.
Detect parent in case of decay
Look for other children particles in the same layer.
Train a model: to classify
signal from noise
Define Neural Network
Train Neural Network
Make Predictions
Train classifier: Neural
Network to classify signal
from noise
Output of classifer:
Probability of each id being
a signal (higher is better)
Define and Train XGBoost Train a model: to classify
signal from noise
Eliminate background
noise from base tracks
1.
Eliminate background
noise from base tracks
1.
2. Cluster tracks into
electromagnetic showers
that are good candidates for dark matter interactions.
Clustering: 2 Models
DBSCAN K-Means Our dataset
DBSCAN relies on euclidean distance, which won’t work for our case because we
care about relative angles between base tracks, in addition of relative positions.
Clustering: K-Means
Find the optimal number of clusters
The inertia's rate of decline flattens around k=6
clusters. So we'll train K-Means with 6 clusters.
Clustering: K-Means
Train K-Means with 6 clusters
Visualize clusters
Excellent! From all that background noise,
we’ve extracted the signal base tracks
that are the best candidates for dark
matter particle interactions! w00t!
2. Cluster tracks into
electromagnetic showers
that are good candidates for dark matter interactions.
2. Cluster tracks into
electromagnetic showers
that are good candidates for dark matter interactions.
Looking ahead to SHiP
Search for Hidden Particles
(SHiP)
Expected to launch in 2025
Designed more specifically to search for dark matter
Would employ the same techniques we used above to
reconstruct dark matter particle trajectories from data
Hopefully, the new data will allow us to determine with
greater confidence and accuracy which of these signal
clusters are a result of dark matter interactions!
When SHiP starts up, I’ll hope you'll use some of the
techniques we've used today and join me in exploring
the dataset!
Particle Identification
The Goal
Identify the type of a particle associated
with a track using responses from different
detector systems.
There are five particle types: Electron,
Proton, Kaon, Pion and Muon.
Therefore Particle Identification is a multi-
class classification Problem
The Problem
The inputs to the classifier are the particle
track responses from the 5 sub-detector
systems:
Tracking system
Ring Imaging Cherenkov detector
Electromagnetic calorimeter
Hadron calorimeter
Muon Chambers
The outputs of the classifier are six labels,
five of them correspond to five different
particle types and Ghost is a catch-all for
noisy tracks and other particle types.
Getting a little in the weeds
Each particle has a certain energy,
momentum and mass.
Energy and momentum are determined by
the speed of the particle, and mass by its
type.
From the laws of conservations of energy
and momentum, we know that when a
particle decays:
the energy and momentum of the mother
particle = the sum of energies and
momentums of the daughter particles.
Getting a little in the weeds
The sub-detectors estimate the daughter
particle’s trajectory, momentum, energy
and type.
From this we can reconstruct mother
particle’s parameters (e.g. the mass) and
therefor detect the particle.
Training a neural net
and a boosting model
The Dataset
1.2 million data-points, each
representing a particle track.
50 features
measurements from the sub-
detectors
derived features from these
measurements
AdaBoost Classifier
Neural Net Classifier
Neural Net Classifier
Here’s a plot of the ROC curves for all
particle classifiers.
AdaBoost model performs slightly better than the neural network in this case.
AdaBoost Neural Network
Now it's your turn!http://kaggle.com
Thank you!
lavanyaai
lavanya.ai
Twitter.com/
http://

Contenu connexe

Tendances

The grid extra dimensions theory
The grid extra dimensions theoryThe grid extra dimensions theory
The grid extra dimensions theoryEran Sinbar
 
23 3 Quantum Mechanics
23 3 Quantum Mechanics23 3 Quantum Mechanics
23 3 Quantum Mechanicsfysteach
 
Chapter 4 atomic structure
Chapter 4 atomic structureChapter 4 atomic structure
Chapter 4 atomic structureMahbub Alwathoni
 
Mathematical tour of em field
Mathematical tour of em fieldMathematical tour of em field
Mathematical tour of em fieldGlen Alleman
 
Physical world
Physical worldPhysical world
Physical worldVIDYAGAUDE
 
Final presentation on antimatter
Final presentation on antimatterFinal presentation on antimatter
Final presentation on antimatterArpit Saxena
 
Basic and fundamental of quantum mechanics (Theory)
Basic and fundamental of quantum mechanics (Theory)Basic and fundamental of quantum mechanics (Theory)
Basic and fundamental of quantum mechanics (Theory)Halavath Ramesh
 
The wave-particle duality and the double slit experiment
The wave-particle duality and the double slit experimentThe wave-particle duality and the double slit experiment
The wave-particle duality and the double slit experimentSatyavan65
 
Chapter 4 atomic structure
Chapter 4 atomic structureChapter 4 atomic structure
Chapter 4 atomic structureAshwani Kumar
 
Animatter Explained
Animatter ExplainedAnimatter Explained
Animatter Explainedcospic
 
Lecture on classical and quantum information
Lecture on classical and quantum informationLecture on classical and quantum information
Lecture on classical and quantum informationKrzysztof Pomorski
 

Tendances (20)

The grid extra dimensions theory
The grid extra dimensions theoryThe grid extra dimensions theory
The grid extra dimensions theory
 
The duality of the light
The duality of the lightThe duality of the light
The duality of the light
 
Ieee lecture
Ieee lectureIeee lecture
Ieee lecture
 
23 3 Quantum Mechanics
23 3 Quantum Mechanics23 3 Quantum Mechanics
23 3 Quantum Mechanics
 
Quantum physics
Quantum physicsQuantum physics
Quantum physics
 
Chapter 4 atomic structure
Chapter 4 atomic structureChapter 4 atomic structure
Chapter 4 atomic structure
 
Particle physics 2010
Particle physics 2010Particle physics 2010
Particle physics 2010
 
Antimatter ppt
Antimatter pptAntimatter ppt
Antimatter ppt
 
Mathematical tour of em field
Mathematical tour of em fieldMathematical tour of em field
Mathematical tour of em field
 
Physical world
Physical worldPhysical world
Physical world
 
Final presentation on antimatter
Final presentation on antimatterFinal presentation on antimatter
Final presentation on antimatter
 
1538545
15385451538545
1538545
 
Quantum mechanics
Quantum mechanicsQuantum mechanics
Quantum mechanics
 
Matter and antimatter
Matter and antimatterMatter and antimatter
Matter and antimatter
 
Basic and fundamental of quantum mechanics (Theory)
Basic and fundamental of quantum mechanics (Theory)Basic and fundamental of quantum mechanics (Theory)
Basic and fundamental of quantum mechanics (Theory)
 
Wave Properties of Particles
Wave Properties of ParticlesWave Properties of Particles
Wave Properties of Particles
 
The wave-particle duality and the double slit experiment
The wave-particle duality and the double slit experimentThe wave-particle duality and the double slit experiment
The wave-particle duality and the double slit experiment
 
Chapter 4 atomic structure
Chapter 4 atomic structureChapter 4 atomic structure
Chapter 4 atomic structure
 
Animatter Explained
Animatter ExplainedAnimatter Explained
Animatter Explained
 
Lecture on classical and quantum information
Lecture on classical and quantum informationLecture on classical and quantum information
Lecture on classical and quantum information
 

Similaire à Searching for dark matter in CERN's Large Hadron Collider dataset

High Precision, Not High Energy: Using Atomic Physics to Look Beyond the Stan...
High Precision, Not High Energy: Using Atomic Physics to Look Beyond the Stan...High Precision, Not High Energy: Using Atomic Physics to Look Beyond the Stan...
High Precision, Not High Energy: Using Atomic Physics to Look Beyond the Stan...Chad Orzel
 
Quantum-phenomena.ppt
Quantum-phenomena.pptQuantum-phenomena.ppt
Quantum-phenomena.pptPritamjit3
 
X-ray Crystallography & Its Applications in Proteomics
X-ray Crystallography & Its Applications in Proteomics X-ray Crystallography & Its Applications in Proteomics
X-ray Crystallography & Its Applications in Proteomics Akash Arora
 
Dark Matter: Hints and Signals from Astrophysics
Dark Matter: Hints and Signals from AstrophysicsDark Matter: Hints and Signals from Astrophysics
Dark Matter: Hints and Signals from Astrophysicsveenuadi
 
Matter antimatter - an accentuation-attrition model
Matter antimatter - an accentuation-attrition modelMatter antimatter - an accentuation-attrition model
Matter antimatter - an accentuation-attrition modelAlexander Decker
 
An article written by Prashant Pokhrel.pdf
An article written by Prashant Pokhrel.pdfAn article written by Prashant Pokhrel.pdf
An article written by Prashant Pokhrel.pdfPrashant Pokhrel
 
Radiation Interaction.pdf
Radiation Interaction.pdfRadiation Interaction.pdf
Radiation Interaction.pdfZahra742286
 
Quantum Theory
Quantum TheoryQuantum Theory
Quantum Theorylallen
 
The Standard Model and the LHC in the Higgs Boson Era
The Standard Model and the LHC in the Higgs Boson EraThe Standard Model and the LHC in the Higgs Boson Era
The Standard Model and the LHC in the Higgs Boson Erajuanrojochacon
 
Chapter 4 and 5
Chapter 4 and 5Chapter 4 and 5
Chapter 4 and 5smithdk
 
Particle physics article
Particle physics articleParticle physics article
Particle physics articleDevakumar Jain
 
Chapter2_Nature_of_light-1.ppt
Chapter2_Nature_of_light-1.pptChapter2_Nature_of_light-1.ppt
Chapter2_Nature_of_light-1.pptTwagitayezuagy
 

Similaire à Searching for dark matter in CERN's Large Hadron Collider dataset (20)

High Precision, Not High Energy: Using Atomic Physics to Look Beyond the Stan...
High Precision, Not High Energy: Using Atomic Physics to Look Beyond the Stan...High Precision, Not High Energy: Using Atomic Physics to Look Beyond the Stan...
High Precision, Not High Energy: Using Atomic Physics to Look Beyond the Stan...
 
Quantum-phenomena.ppt
Quantum-phenomena.pptQuantum-phenomena.ppt
Quantum-phenomena.ppt
 
Quantum-phenomena.ppt
Quantum-phenomena.pptQuantum-phenomena.ppt
Quantum-phenomena.ppt
 
X-ray Crystallography & Its Applications in Proteomics
X-ray Crystallography & Its Applications in Proteomics X-ray Crystallography & Its Applications in Proteomics
X-ray Crystallography & Its Applications in Proteomics
 
Dark Matter: Hints and Signals from Astrophysics
Dark Matter: Hints and Signals from AstrophysicsDark Matter: Hints and Signals from Astrophysics
Dark Matter: Hints and Signals from Astrophysics
 
X ray
X ray X ray
X ray
 
Matter antimatter - an accentuation-attrition model
Matter antimatter - an accentuation-attrition modelMatter antimatter - an accentuation-attrition model
Matter antimatter - an accentuation-attrition model
 
An article written by Prashant Pokhrel.pdf
An article written by Prashant Pokhrel.pdfAn article written by Prashant Pokhrel.pdf
An article written by Prashant Pokhrel.pdf
 
Radiation Interaction.pdf
Radiation Interaction.pdfRadiation Interaction.pdf
Radiation Interaction.pdf
 
Quantum Theory
Quantum TheoryQuantum Theory
Quantum Theory
 
The Standard Model and the LHC in the Higgs Boson Era
The Standard Model and the LHC in the Higgs Boson EraThe Standard Model and the LHC in the Higgs Boson Era
The Standard Model and the LHC in the Higgs Boson Era
 
Synchrotron absas
Synchrotron absasSynchrotron absas
Synchrotron absas
 
Chapter 4 and 5
Chapter 4 and 5Chapter 4 and 5
Chapter 4 and 5
 
Lister_report
Lister_reportLister_report
Lister_report
 
CHAPTER 3 The Experimental Basis of Quantum Theory
CHAPTER 3The Experimental Basis of Quantum TheoryCHAPTER 3The Experimental Basis of Quantum Theory
CHAPTER 3 The Experimental Basis of Quantum Theory
 
Particle physics article
Particle physics articleParticle physics article
Particle physics article
 
Chapter2_Nature_of_light-1.ppt
Chapter2_Nature_of_light-1.pptChapter2_Nature_of_light-1.ppt
Chapter2_Nature_of_light-1.ppt
 
C H6
C H6C H6
C H6
 
Cern
CernCern
Cern
 
ATOMS
ATOMSATOMS
ATOMS
 

Dernier

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 

Dernier (20)

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 

Searching for dark matter in CERN's Large Hadron Collider dataset

  • 1. Searching for dark matter in CERN's Large Hadron Collider dataset Lavanya Shukla
  • 2. What's on the menu today? The Standard Model Sneak Peak into the Large Hadron Collider Machine Learning Challenges Searching for Dark Matter Particle Identification How Can You Get Involved
  • 3. you can play with datasets from CERN's LHC experiments http://opendata.cern.ch
  • 4. Particle Physics and the Standard Model
  • 5. All matter is made up of particles. These particles interact with each other by exchanging other particles associated with the fundamental forces. The Standard Model describes what the universe is made up of and how it holds together. The Standard Model
  • 6. Pretty much everything we see in the universe is made of up quarks, down quarks and electrons. All the other things exist only for a short amount of time before decaying into other particles. The Standard Model
  • 7. The 12 fermions are the building blocks of matter. Quarks group of 6 particles, organize together to make protons (2 up, 1 down) and neutrons (1 up, 2 down). Dependent particles, have partial charge so are never seen alone in the wild. Need to combine with other quarks using strong nuclear force to form particles, most commonly in the nucleus. Fermions
  • 8. The 12 fermions are the building blocks of matter. Leptons group of 6 particles, with electrons being the best known. Independent, have whole units of charge. Not seen in the nucleus. Fermions Err..
  • 9. Fermions interact with each other through fundamental forces. The 5 bosons are the building blocks of fundamental forces. Strong force (very short range, typically smaller than that of an atomic nucleus) gluons – bind the quarks in protons and neutrons photons – carry electromagnetic force in light Role: Holds atomic nuclei and quarks together. Bosons
  • 10. Fermions interact with each other through fundamental forces. The 5 bosons are the building blocks of fundamental forces. Bosons Weak force (very long range, but million times weaker than strong force) Carried by Z and W bosons Higgs boson Role: responsible for radioactive decay
  • 11. Behind the curtains of the Large Hadron Collider
  • 12. 16 mile long tunnel accelerates protons to almost the speed of light (0.9999999c) and smashes them into each other 4 big detectors ALICE, ATLAS, CMS and LHCb detect particles resulting from proton-proton bunch collisions 40 million bunch collisions per second 10 petabytes of data per year The detector measures the energy and momentum of every particle flying out of the collision event and identify the types of those particles by calculating their mass.
  • 13.
  • 14. The Standard Model describes what the universe is made up of and how it holds together. The Standard Model Every particle leaves different traces in different sub-detectors of the experiment.
  • 16. It's time for some Machine Learning!
  • 17. Anomaly detection (data quality and infrastructure monitoring) Detector design optimization (using bayesian optimization, surrogate modeling etc.) Precise and fast particle tracking (single tracks, shower, jets etc.) Fast and accurate data processing, and design of triggers Particle Identification Some Machine Learning Challenges in Particle Physics
  • 19. If we add up all the stars, planets, galaxies, comets, black holes, dark clouds - everything out there and the gravity doesn't add up. There is something holding the universe and our galaxy together, without which our galaxy would fall apart Dark 'matter' is a misnomer 85% of the gravity in the universe has an unknown source Hi! I'm Fred!
  • 20. There is something. It interacts with gravity. Mass and gravity go together so it has mass, maybe? Probably is invisible has no interaction with light or electromagnetic force, doesn't emit or reflect light There is a lot of it. So.. Not much really. What do we know for sure? I like playing hard to get
  • 21. WIMP = weakly interacting massive particle heavy like a neutrino, has 100 - 1000 times the mass of a proton a particle interacts with other matter like the neutrino weak interaction = force between subatomic particles responsible for radioactive decay WIMP theory of Dark Matter Neither absorb nor emit light Don't interact strongly with other particles. But when they encounter each other, they annihilate and make gamma rays, which we can detect.
  • 22. THE CHALLENGE WE DON'T KNOW WHAT WE'RE LOOKING FOR There are plenty of theories that try to predict presence of dark matter, but we have no inkling of the nature of effect we should be looking for. LACK OF CLEAN, OBSERVABLE DATA The direct and indirect targets that might predict dark matter’s presence are laced with noise and background phenomenon that might lead to misleading results.
  • 23. We'll use ML to tackle the problem of signal extraction from background noise
  • 24. Eliminate background noise from base tracks 1. 2. Cluster tracks into neutrino and dark matter interactions. The scale of the problem is massive. 10,000 out of 10 million basetracks are results of electromagnetic showers, rest are noise.
  • 25. Feynman diagram of a dark matter particle X scattering an electron of lead nuclei. Neutrinos produce similar showers The problem is that when a neutrino interacts with a nucleus, it also produces an electron that get boosted and similar showers are produced. One of the key distinguishing qualities between them is the energy-angle correlation. We apply clustering to distinguish the two electromagnetic showers.
  • 27. The OPERA dataset has the following features 15 million base tracks Each base track contains: X, Y, Z co-ordinates angles from origin (TX, TY) Signal Background consists of base tracks scattered randomly in space, and has a label=0 Signals consist of base tracks forming a cone shape, has a label=1
  • 28. In addition we compute the following features distance from the origin (dX, dY, dZ, dTX, dTY) alpha, the angle between directions d, the projection of base track1 Signal has a defined geometric structure Noise is largely random vectors
  • 30. Feature Engineering: Add particle track info to dataset Detect particle track Signal base tracks are part of a larger vector of movement of particles. This particle moves across many layers in a straight line path. Detect parent in case of decay Look for other children particles in the same layer.
  • 31. Train a model: to classify signal from noise Define Neural Network Train Neural Network Make Predictions
  • 32. Train classifier: Neural Network to classify signal from noise Output of classifer: Probability of each id being a signal (higher is better) Define and Train XGBoost Train a model: to classify signal from noise
  • 35. 2. Cluster tracks into electromagnetic showers that are good candidates for dark matter interactions.
  • 36. Clustering: 2 Models DBSCAN K-Means Our dataset DBSCAN relies on euclidean distance, which won’t work for our case because we care about relative angles between base tracks, in addition of relative positions.
  • 37. Clustering: K-Means Find the optimal number of clusters The inertia's rate of decline flattens around k=6 clusters. So we'll train K-Means with 6 clusters.
  • 38. Clustering: K-Means Train K-Means with 6 clusters Visualize clusters Excellent! From all that background noise, we’ve extracted the signal base tracks that are the best candidates for dark matter particle interactions! w00t!
  • 39. 2. Cluster tracks into electromagnetic showers that are good candidates for dark matter interactions.
  • 40. 2. Cluster tracks into electromagnetic showers that are good candidates for dark matter interactions.
  • 42. Search for Hidden Particles (SHiP) Expected to launch in 2025 Designed more specifically to search for dark matter Would employ the same techniques we used above to reconstruct dark matter particle trajectories from data Hopefully, the new data will allow us to determine with greater confidence and accuracy which of these signal clusters are a result of dark matter interactions! When SHiP starts up, I’ll hope you'll use some of the techniques we've used today and join me in exploring the dataset!
  • 44. The Goal Identify the type of a particle associated with a track using responses from different detector systems. There are five particle types: Electron, Proton, Kaon, Pion and Muon. Therefore Particle Identification is a multi- class classification Problem
  • 45. The Problem The inputs to the classifier are the particle track responses from the 5 sub-detector systems: Tracking system Ring Imaging Cherenkov detector Electromagnetic calorimeter Hadron calorimeter Muon Chambers The outputs of the classifier are six labels, five of them correspond to five different particle types and Ghost is a catch-all for noisy tracks and other particle types.
  • 46. Getting a little in the weeds Each particle has a certain energy, momentum and mass. Energy and momentum are determined by the speed of the particle, and mass by its type. From the laws of conservations of energy and momentum, we know that when a particle decays: the energy and momentum of the mother particle = the sum of energies and momentums of the daughter particles.
  • 47. Getting a little in the weeds The sub-detectors estimate the daughter particle’s trajectory, momentum, energy and type. From this we can reconstruct mother particle’s parameters (e.g. the mass) and therefor detect the particle.
  • 48. Training a neural net and a boosting model
  • 49. The Dataset 1.2 million data-points, each representing a particle track. 50 features measurements from the sub- detectors derived features from these measurements
  • 52. Neural Net Classifier Here’s a plot of the ROC curves for all particle classifiers. AdaBoost model performs slightly better than the neural network in this case. AdaBoost Neural Network
  • 53. Now it's your turn!http://kaggle.com