SlideShare une entreprise Scribd logo
1  sur  41
Using Cheminformatics Approaches to
Develop a Structure Searchable
Database of Analytical Methods
Antony Williams, Greg Janesch, Sakuntala
Sivasupramaniam, Brian Meyer and Erik Carr
Center for Computational Toxicology & Exposure, U.S. Environmental Protection Agency
http://www.orcid.org/0000-0002-2668-4821
May 2023: American Oil Chemists Society meeting, Denver, CO.
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
Disclaimer
• The views expressed in this presentation are those of the
author and do not necessarily reflect the views or policies of
the U.S. EPA
• This presentation is on a proof-of-concept tool in development
– NOT yet publicly available
1
Building a Methods Database
• Simple Vision: I want to find the best method(s) associated with a
chemical and/or class of chemicals
• Answer the question “I cannot find a method for my chemical” - HELP
• The Approach:
– Aggregate MS method documents (and adjust the definition of “what is a useful method”)
– Extract chemistry (mostly CASRN and Names)
– Map CASRN and Names to structures
– Deliver a proof-of-concept application to search a database by names, CASRNs, InChIKeys
and ultimately structure
2
Most people start with Google
• People have many places to search for methods, and
there is no one integration hub, except for search engines
• Search engines can return so many hits – then you filter
by analytes, matrix, analytical methodology, so many
synonyms and abbreviations for so many chemicals
3
Synonyms, Abbreviations and Chemicals
4
• CAS Numbers, Names
and Abbreviations can
limit what’s possible…
Might this be a better view?
5
Might this be a better view?
6
When methods are mapped to chemistry…
• The advantages of mapping chemicals directly to methods
– When chemicals are mapped it opens access to many other tools
– Chemical structures allow for QSAR modeling
7
…and what if we could then profile toxicity?
8
…and what if we could then profile toxicity?
9
…or simply harvest data from the
CompTox Chemicals Dashboard
10
What data would you like???
11
Introducing AMOS
Analytical Methods and Spectra Database
• Three types of data in the database:
– Methods (regulatory, lab manuals and SOPs, publications, tech notes)
– Spectra (from public domain and our own laboratories)
– Monographs (harvested from SWGDRUG and other sites)
• Some methods have associated spectra
• Some data are just externally linked
• Currently contains around 135,000 spectra, 600,000
external links, 650 monographs, and ~2000 methods
• ALL data are growing in number
12
Where are there methods?
• Agency-based methods
– EPA
– USGS
– USDA
– CDC
– FDA
• Vendor application notes – Thermo, Waters, Agilent, Sciex,
Shimadzu, LECO, ….
• Peer-reviewed articles
• Laboratory Documents – lab manuals, SOPs
13
A view of the methods list
14
Filtering the list for interests…
• Look for pesticides studied in water, by GC/MS, after 1990
15
Where are there methods?
• 900 method documents from the EPA harvested
16
Many Scanned Documents!!!
17
• Methods generally developed by
the agrochemical companies
• Include parents plus degradation
products
• Lots of scanned, old, documents
but the historical records are still
of significant use
• Electronic document forms of old
documents still of benefit
Embedding the old Method PDFs
18
Embedding New Method PDFs
19
When Methods are OPEN Access
20
When Methods are PubMed OPEN Access
21
Proprietary Methods for INTERNAL Access
22
If there is no method for your chemical
• Use “Chemical Similarity Searching” so that you can find
chemicals that are similar in structure space
• Use the “Tanimoto Similarity Search
23
Searching for a chemical – CASRN, Name
Direct structure searching coming
24
When Methods are Not Enough
• EPA is highly active in the field of non-targeted analysis
• We have been applying lots of cheminformatics approaches
25
Building a spectrum library to search against
26
Linking to actual spectra
27
Linking to actual spectra
28
There are errors EVERYWHERE: 110-75-8
It can be challenging
9/365 chemicals…
There are MANY errors in public
spectral databases
31
There are MANY errors in public
spectral databases
32
There are MANY errors in public
spectral databases
33
There are MANY errors in public
spectral databases
34
• Chemical structure representations would ideally
be standardized…consider tautomeric forms
• Not all substances are explicit and can be
ambiguous representations
My first time at AOCS…what did I learn?
• Some of your chemicals of interest: TAGs, PAHs, Toxins
of different types
• Methods for PAHs, Aflatoxins and Microsystins, and tri-
acyl glycerols all extracted and added to database
35
My first time at AOCS…what did I learn?
• Index Mark Collison’s methods 
36
My first time at AOCS…what did I learn?
37
General Comments
• It is possible we can extract chemicals from AOCS methods
and structure enable the method, WITHOUT making the
method itself publicly accessible – like ASTM and ISO
• Our focus right now is on mass spectrometry methods but
can support many other methods
38
Conclusions
• We have built (I think) the first chemical structure indexed
database of “methods”
• Methods are not just “approved methods” but also
standard operating procedures, application notes, lab
manuals, regulatory methods etc.
• Integrating methods to experimental spectral data will
serve our non-targeted analysis efforts
39
If you want to help…
• Send information regarding analytical methods and method
articles to williams.antony@epa.gov
• If anyone is interested in a live demo I can do one in the break
40

Contenu connexe

Tendances

AI in Clinical Trials: From Big Sky to Practical Application
AI in Clinical Trials: From Big Sky to Practical ApplicationAI in Clinical Trials: From Big Sky to Practical Application
AI in Clinical Trials: From Big Sky to Practical ApplicationVeeva Systems
 
Ethical Concerns Regarding Research on Human Subjects
Ethical Concerns Regarding Research on Human SubjectsEthical Concerns Regarding Research on Human Subjects
Ethical Concerns Regarding Research on Human SubjectsClinosolIndia
 
Improve Clinical Trial Processes With a Unified Platform
Improve Clinical Trial Processes With a Unified PlatformImprove Clinical Trial Processes With a Unified Platform
Improve Clinical Trial Processes With a Unified PlatformVeeva Systems
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management PlanKristin Briney
 
Information classification
Information classificationInformation classification
Information classificationJyothsna Sridhar
 
CLINICAL TRIAL PROJECT MANAGEMENT
CLINICAL TRIAL PROJECT MANAGEMENTCLINICAL TRIAL PROJECT MANAGEMENT
CLINICAL TRIAL PROJECT MANAGEMENTRohit K.
 
Lecture 19 research ethics (2)
Lecture 19 research ethics (2)Lecture 19 research ethics (2)
Lecture 19 research ethics (2)Dr Ghaiath Hussein
 
CIOMS ethical guidelines for Biomedical Research. What is in for patients?
CIOMS ethical guidelines for Biomedical Research. What is in for patients?CIOMS ethical guidelines for Biomedical Research. What is in for patients?
CIOMS ethical guidelines for Biomedical Research. What is in for patients?Hibernia College
 
Indian gcp ppt cr by ann
Indian gcp ppt cr by annIndian gcp ppt cr by ann
Indian gcp ppt cr by annAnjali Rarichan
 
The FDA Guidance of a Risk-Based Approach to Monitoring as Viewed By CDM
The FDA Guidance of a Risk-Based Approach to Monitoring as Viewed By CDMThe FDA Guidance of a Risk-Based Approach to Monitoring as Viewed By CDM
The FDA Guidance of a Risk-Based Approach to Monitoring as Viewed By CDMwww.datatrak.com
 
Advance Information Management Through eTMF and CTMS Convergence
Advance Information Management Through eTMF and CTMS ConvergenceAdvance Information Management Through eTMF and CTMS Convergence
Advance Information Management Through eTMF and CTMS ConvergenceWingspan Technology
 
Preparing For Sponsor Audits Rpci 4 10
Preparing For Sponsor Audits Rpci 4 10Preparing For Sponsor Audits Rpci 4 10
Preparing For Sponsor Audits Rpci 4 10Jessica
 

Tendances (20)

AI in Clinical Trials: From Big Sky to Practical Application
AI in Clinical Trials: From Big Sky to Practical ApplicationAI in Clinical Trials: From Big Sky to Practical Application
AI in Clinical Trials: From Big Sky to Practical Application
 
Ethical Concerns Regarding Research on Human Subjects
Ethical Concerns Regarding Research on Human SubjectsEthical Concerns Regarding Research on Human Subjects
Ethical Concerns Regarding Research on Human Subjects
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
Clinical Trials 101
Clinical Trials 101Clinical Trials 101
Clinical Trials 101
 
HIPAA Privacy & Security
HIPAA Privacy & SecurityHIPAA Privacy & Security
HIPAA Privacy & Security
 
Improve Clinical Trial Processes With a Unified Platform
Improve Clinical Trial Processes With a Unified PlatformImprove Clinical Trial Processes With a Unified Platform
Improve Clinical Trial Processes With a Unified Platform
 
Data science and_analytics_for_ordinary_people_ebook
Data science and_analytics_for_ordinary_people_ebookData science and_analytics_for_ordinary_people_ebook
Data science and_analytics_for_ordinary_people_ebook
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management Plan
 
Information classification
Information classificationInformation classification
Information classification
 
Introduction to Data Management and Sharing
Introduction to Data Management and SharingIntroduction to Data Management and Sharing
Introduction to Data Management and Sharing
 
CLINICAL TRIAL PROJECT MANAGEMENT
CLINICAL TRIAL PROJECT MANAGEMENTCLINICAL TRIAL PROJECT MANAGEMENT
CLINICAL TRIAL PROJECT MANAGEMENT
 
Apache Atlas: Governance for your Data
Apache Atlas: Governance for your DataApache Atlas: Governance for your Data
Apache Atlas: Governance for your Data
 
Lecture 19 research ethics (2)
Lecture 19 research ethics (2)Lecture 19 research ethics (2)
Lecture 19 research ethics (2)
 
CIOMS ethical guidelines for Biomedical Research. What is in for patients?
CIOMS ethical guidelines for Biomedical Research. What is in for patients?CIOMS ethical guidelines for Biomedical Research. What is in for patients?
CIOMS ethical guidelines for Biomedical Research. What is in for patients?
 
RISK BASED MONITORING
RISK BASED MONITORINGRISK BASED MONITORING
RISK BASED MONITORING
 
Indian gcp ppt cr by ann
Indian gcp ppt cr by annIndian gcp ppt cr by ann
Indian gcp ppt cr by ann
 
The FDA Guidance of a Risk-Based Approach to Monitoring as Viewed By CDM
The FDA Guidance of a Risk-Based Approach to Monitoring as Viewed By CDMThe FDA Guidance of a Risk-Based Approach to Monitoring as Viewed By CDM
The FDA Guidance of a Risk-Based Approach to Monitoring as Viewed By CDM
 
Advance Information Management Through eTMF and CTMS Convergence
Advance Information Management Through eTMF and CTMS ConvergenceAdvance Information Management Through eTMF and CTMS Convergence
Advance Information Management Through eTMF and CTMS Convergence
 
Preparing For Sponsor Audits Rpci 4 10
Preparing For Sponsor Audits Rpci 4 10Preparing For Sponsor Audits Rpci 4 10
Preparing For Sponsor Audits Rpci 4 10
 
Gcp 2013
Gcp 2013Gcp 2013
Gcp 2013
 

Similaire à Using Cheminformatics Approaches to Develop a Structure Searchable Database of Analytical Methods

Similaire à Using Cheminformatics Approaches to Develop a Structure Searchable Database of Analytical Methods (20)

Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...
Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...
Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...
 
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
 
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
 
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
 
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
 
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted AnalysisThe US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
 
Cheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting ExposomicsCheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting Exposomics
 
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data DashboardsAccessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Chemis...Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Chemis...
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
 
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Applications of the US EPA’s CompTox chemicals dashboard to support structure...
Applications of the US EPA’s CompTox chemicals dashboard to support structure...Applications of the US EPA’s CompTox chemicals dashboard to support structure...
Applications of the US EPA’s CompTox chemicals dashboard to support structure...
 
Delivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applicationsDelivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applications
 

Dernier

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyDrAnita Sharma
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 

Dernier (20)

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 

Using Cheminformatics Approaches to Develop a Structure Searchable Database of Analytical Methods

  • 1. Using Cheminformatics Approaches to Develop a Structure Searchable Database of Analytical Methods Antony Williams, Greg Janesch, Sakuntala Sivasupramaniam, Brian Meyer and Erik Carr Center for Computational Toxicology & Exposure, U.S. Environmental Protection Agency http://www.orcid.org/0000-0002-2668-4821 May 2023: American Oil Chemists Society meeting, Denver, CO. The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
  • 2. Disclaimer • The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA • This presentation is on a proof-of-concept tool in development – NOT yet publicly available 1
  • 3. Building a Methods Database • Simple Vision: I want to find the best method(s) associated with a chemical and/or class of chemicals • Answer the question “I cannot find a method for my chemical” - HELP • The Approach: – Aggregate MS method documents (and adjust the definition of “what is a useful method”) – Extract chemistry (mostly CASRN and Names) – Map CASRN and Names to structures – Deliver a proof-of-concept application to search a database by names, CASRNs, InChIKeys and ultimately structure 2
  • 4. Most people start with Google • People have many places to search for methods, and there is no one integration hub, except for search engines • Search engines can return so many hits – then you filter by analytes, matrix, analytical methodology, so many synonyms and abbreviations for so many chemicals 3
  • 5. Synonyms, Abbreviations and Chemicals 4 • CAS Numbers, Names and Abbreviations can limit what’s possible…
  • 6. Might this be a better view? 5
  • 7. Might this be a better view? 6
  • 8. When methods are mapped to chemistry… • The advantages of mapping chemicals directly to methods – When chemicals are mapped it opens access to many other tools – Chemical structures allow for QSAR modeling 7
  • 9. …and what if we could then profile toxicity? 8
  • 10. …and what if we could then profile toxicity? 9
  • 11. …or simply harvest data from the CompTox Chemicals Dashboard 10
  • 12. What data would you like??? 11
  • 13. Introducing AMOS Analytical Methods and Spectra Database • Three types of data in the database: – Methods (regulatory, lab manuals and SOPs, publications, tech notes) – Spectra (from public domain and our own laboratories) – Monographs (harvested from SWGDRUG and other sites) • Some methods have associated spectra • Some data are just externally linked • Currently contains around 135,000 spectra, 600,000 external links, 650 monographs, and ~2000 methods • ALL data are growing in number 12
  • 14. Where are there methods? • Agency-based methods – EPA – USGS – USDA – CDC – FDA • Vendor application notes – Thermo, Waters, Agilent, Sciex, Shimadzu, LECO, …. • Peer-reviewed articles • Laboratory Documents – lab manuals, SOPs 13
  • 15. A view of the methods list 14
  • 16. Filtering the list for interests… • Look for pesticides studied in water, by GC/MS, after 1990 15
  • 17. Where are there methods? • 900 method documents from the EPA harvested 16
  • 18. Many Scanned Documents!!! 17 • Methods generally developed by the agrochemical companies • Include parents plus degradation products • Lots of scanned, old, documents but the historical records are still of significant use • Electronic document forms of old documents still of benefit
  • 19. Embedding the old Method PDFs 18
  • 21. When Methods are OPEN Access 20
  • 22. When Methods are PubMed OPEN Access 21
  • 23. Proprietary Methods for INTERNAL Access 22
  • 24. If there is no method for your chemical • Use “Chemical Similarity Searching” so that you can find chemicals that are similar in structure space • Use the “Tanimoto Similarity Search 23
  • 25. Searching for a chemical – CASRN, Name Direct structure searching coming 24
  • 26. When Methods are Not Enough • EPA is highly active in the field of non-targeted analysis • We have been applying lots of cheminformatics approaches 25
  • 27. Building a spectrum library to search against 26
  • 28. Linking to actual spectra 27
  • 29. Linking to actual spectra 28
  • 30. There are errors EVERYWHERE: 110-75-8
  • 31. It can be challenging 9/365 chemicals…
  • 32. There are MANY errors in public spectral databases 31
  • 33. There are MANY errors in public spectral databases 32
  • 34. There are MANY errors in public spectral databases 33
  • 35. There are MANY errors in public spectral databases 34 • Chemical structure representations would ideally be standardized…consider tautomeric forms • Not all substances are explicit and can be ambiguous representations
  • 36. My first time at AOCS…what did I learn? • Some of your chemicals of interest: TAGs, PAHs, Toxins of different types • Methods for PAHs, Aflatoxins and Microsystins, and tri- acyl glycerols all extracted and added to database 35
  • 37. My first time at AOCS…what did I learn? • Index Mark Collison’s methods  36
  • 38. My first time at AOCS…what did I learn? 37
  • 39. General Comments • It is possible we can extract chemicals from AOCS methods and structure enable the method, WITHOUT making the method itself publicly accessible – like ASTM and ISO • Our focus right now is on mass spectrometry methods but can support many other methods 38
  • 40. Conclusions • We have built (I think) the first chemical structure indexed database of “methods” • Methods are not just “approved methods” but also standard operating procedures, application notes, lab manuals, regulatory methods etc. • Integrating methods to experimental spectral data will serve our non-targeted analysis efforts 39
  • 41. If you want to help… • Send information regarding analytical methods and method articles to williams.antony@epa.gov • If anyone is interested in a live demo I can do one in the break 40