SlideShare une entreprise Scribd logo
1  sur  38
Using Cheminformatics Approaches to Develop
a Structure Searchable Database
of Analytical Methods
Antony Williams1, Greg Janesch2, Tyler Carr2,
Sakuntala Sivasupramaniam3, and Nancy Baker4
1. Center for Computational Toxicology and Exposure, US-EPA, RTP, NC
2. ORAU Student Services Contractor
3. Senior Environmental Employment (SEE) Program
4. ParlezChem Inc.
http://www.orcid.org/0000-0002-2668-4821
March 2024: Spring Fall Meeting, New Orleans, LA
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
Building a Methods Database
• Simple Vision: I want to find the best method(s) associated with a
chemical and/or class of chemicals
• Answer the question “I cannot find a method for my chemical” - HELP
• The Approach:
– Aggregate MS method documents (and adjust the definition of “what is a useful method”)
– Extract chemistry (mostly CASRN and Names)
– Map CASRN and Names to structures
– Deliver a proof-of-concept application to search a database by names, CASRNs, InChIKeys
and ultimately structure
1
Most people start with Google
• People have many places to search for methods, and
there is no one integration hub, except for search engines
• Search engines can return so many hits – then you filter
by analytes, matrix, analytical methodology, so many
synonyms and abbreviations for so many chemicals
2
Synonyms, Abbreviations and Chemicals
3
• CAS Numbers, Names
and Abbreviations can
limit what’s possible…
Might this be a better view?
4
Might this be a better view?
5
When methods are mapped to chemistry…
• The advantages of mapping chemicals directly to methods
– When chemicals are mapped it opens access to many other tools
– Chemical structures allow for QSAR modeling
6
…and what if we could then profile toxicity?
7
…and what if we could then profile toxicity?
8
…or simply harvest data from the
CompTox Chemicals Dashboard
9
What data would you like???
10
Introducing AMOS
Analytical Methods and Spectra Database
• Three types of data in the database:
– Methods (regulatory, lab manuals and SOPs, publications, tech notes)
– Spectra (from public domain and our own laboratories)
– Monographs (harvested from SWGDRUG and other sites)
• Some methods have associated spectra
• Some data are just externally linked
• Currently contains around 263,000 spectra, 600,000 external
links, 4800 “Fact Sheets” and ~5000 methods
• Spectra – LC-MS, GC-MS, NMR
• ALL data are growing in number
11
Where are there methods?
• Agency-based methods
– EPA, USGS, USDA, CDC, FDA, OSHA, DEA, ATSDR, NIOSH …
• ASTM and ISO methods
• Vendor application notes – Bruker, Waters, Agilent, Sciex,
Shimadzu, LECO, Thermo
• Peer-reviewed articles
• Laboratory Documents – lab manuals, SOPs
12
A view of the methods list
13
Filtering the list for interests…
• Look for pesticides studied in water, by GC/MS, after 1990
14
Where are there methods?
• 900 method documents from the EPA harvested
15
Many Scanned Documents!!!
16
• Methods generally developed by
the agrochemical companies
• Include parents plus degradation
products
• Lots of scanned, old, documents
but the historical records are still
of significant use
• Electronic document forms of old
documents still of benefit
Embedding the old Method PDFs
17
Embedding New Method PDFs
18
When Methods are OPEN Access
19
When Methods are PubMed OPEN Access
20
Proprietary Methods for INTERNAL Access
21
If there is no method for your chemical
• Use “Chemical Similarity Searching” so that you can find
chemicals that are similar in structure space
• Use the “Tanimoto Similarity Search
22
Searching for a chemical – CASRN, Name
Direct structure searching coming
23
When Methods are Not Enough
• EPA is highly active in the field of non-targeted analysis
• We have been applying lots of cheminformatics approaches
24
Building a spectrum library to search against
25
Linking to actual spectra
26
Linking to actual spectra
27
There are errors EVERYWHERE: 110-75-8
It can be challenging
9/365 chemicals…
There are MANY errors in public
spectral databases
30
There are MANY errors in public
spectral databases
31
There are MANY errors in public
spectral databases
32
There are MANY errors in public
spectral databases
33
• Chemical structure representations would ideally
be standardized…consider tautomeric forms
• Not all substances are explicit and can be
ambiguous representations
Fact Sheets – 4800 of them and growing
34
>40,000 AnalyticalQC spectra
35
Conclusions
• We have built the first chemical structure indexed open
database of “methods”.
• Methods are not just “approved methods” but also
standard operating procedures, application notes, lab
manuals, regulatory methods etc.
• Integrating methods to experimental spectral data will
serve our non-targeted analysis efforts
• Work is underway to make it public
36
If you want to help…
• Send information regarding analytical methods and method
articles to williams.antony@epa.gov
• If you want to add methods or spectral data please contact me
37

Contenu connexe

Similaire à Applying Cheminformatics to Develop a Structure Searchable Database of Analytical Methods

Similaire à Applying Cheminformatics to Develop a Structure Searchable Database of Analytical Methods (20)

Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
Consensus ranking and fragmentation prediction for identification of unknowns...
Consensus ranking and fragmentation prediction for identification of unknowns...Consensus ranking and fragmentation prediction for identification of unknowns...
Consensus ranking and fragmentation prediction for identification of unknowns...
 
Structure standardization approaches for mass spectrometry data integration
Structure standardization approaches for  mass spectrometry data integrationStructure standardization approaches for  mass spectrometry data integration
Structure standardization approaches for mass spectrometry data integration
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
 
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
 
Cheminformatics approaches to support chemical identification delivered via t...
Cheminformatics approaches to support chemical identification delivered via t...Cheminformatics approaches to support chemical identification delivered via t...
Cheminformatics approaches to support chemical identification delivered via t...
 
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted AnalysisThe US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
Cheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting ExposomicsCheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting Exposomics
 
Applications of the US EPA’s CompTox chemicals dashboard to support structure...
Applications of the US EPA’s CompTox chemicals dashboard to support structure...Applications of the US EPA’s CompTox chemicals dashboard to support structure...
Applications of the US EPA’s CompTox chemicals dashboard to support structure...
 
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
 
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Toxico...
Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Toxico...Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Toxico...
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Toxico...
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
 
Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
 

Dernier

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 

Dernier (20)

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 

Applying Cheminformatics to Develop a Structure Searchable Database of Analytical Methods

  • 1. Using Cheminformatics Approaches to Develop a Structure Searchable Database of Analytical Methods Antony Williams1, Greg Janesch2, Tyler Carr2, Sakuntala Sivasupramaniam3, and Nancy Baker4 1. Center for Computational Toxicology and Exposure, US-EPA, RTP, NC 2. ORAU Student Services Contractor 3. Senior Environmental Employment (SEE) Program 4. ParlezChem Inc. http://www.orcid.org/0000-0002-2668-4821 March 2024: Spring Fall Meeting, New Orleans, LA The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
  • 2. Building a Methods Database • Simple Vision: I want to find the best method(s) associated with a chemical and/or class of chemicals • Answer the question “I cannot find a method for my chemical” - HELP • The Approach: – Aggregate MS method documents (and adjust the definition of “what is a useful method”) – Extract chemistry (mostly CASRN and Names) – Map CASRN and Names to structures – Deliver a proof-of-concept application to search a database by names, CASRNs, InChIKeys and ultimately structure 1
  • 3. Most people start with Google • People have many places to search for methods, and there is no one integration hub, except for search engines • Search engines can return so many hits – then you filter by analytes, matrix, analytical methodology, so many synonyms and abbreviations for so many chemicals 2
  • 4. Synonyms, Abbreviations and Chemicals 3 • CAS Numbers, Names and Abbreviations can limit what’s possible…
  • 5. Might this be a better view? 4
  • 6. Might this be a better view? 5
  • 7. When methods are mapped to chemistry… • The advantages of mapping chemicals directly to methods – When chemicals are mapped it opens access to many other tools – Chemical structures allow for QSAR modeling 6
  • 8. …and what if we could then profile toxicity? 7
  • 9. …and what if we could then profile toxicity? 8
  • 10. …or simply harvest data from the CompTox Chemicals Dashboard 9
  • 11. What data would you like??? 10
  • 12. Introducing AMOS Analytical Methods and Spectra Database • Three types of data in the database: – Methods (regulatory, lab manuals and SOPs, publications, tech notes) – Spectra (from public domain and our own laboratories) – Monographs (harvested from SWGDRUG and other sites) • Some methods have associated spectra • Some data are just externally linked • Currently contains around 263,000 spectra, 600,000 external links, 4800 “Fact Sheets” and ~5000 methods • Spectra – LC-MS, GC-MS, NMR • ALL data are growing in number 11
  • 13. Where are there methods? • Agency-based methods – EPA, USGS, USDA, CDC, FDA, OSHA, DEA, ATSDR, NIOSH … • ASTM and ISO methods • Vendor application notes – Bruker, Waters, Agilent, Sciex, Shimadzu, LECO, Thermo • Peer-reviewed articles • Laboratory Documents – lab manuals, SOPs 12
  • 14. A view of the methods list 13
  • 15. Filtering the list for interests… • Look for pesticides studied in water, by GC/MS, after 1990 14
  • 16. Where are there methods? • 900 method documents from the EPA harvested 15
  • 17. Many Scanned Documents!!! 16 • Methods generally developed by the agrochemical companies • Include parents plus degradation products • Lots of scanned, old, documents but the historical records are still of significant use • Electronic document forms of old documents still of benefit
  • 18. Embedding the old Method PDFs 17
  • 20. When Methods are OPEN Access 19
  • 21. When Methods are PubMed OPEN Access 20
  • 22. Proprietary Methods for INTERNAL Access 21
  • 23. If there is no method for your chemical • Use “Chemical Similarity Searching” so that you can find chemicals that are similar in structure space • Use the “Tanimoto Similarity Search 22
  • 24. Searching for a chemical – CASRN, Name Direct structure searching coming 23
  • 25. When Methods are Not Enough • EPA is highly active in the field of non-targeted analysis • We have been applying lots of cheminformatics approaches 24
  • 26. Building a spectrum library to search against 25
  • 27. Linking to actual spectra 26
  • 28. Linking to actual spectra 27
  • 29. There are errors EVERYWHERE: 110-75-8
  • 30. It can be challenging 9/365 chemicals…
  • 31. There are MANY errors in public spectral databases 30
  • 32. There are MANY errors in public spectral databases 31
  • 33. There are MANY errors in public spectral databases 32
  • 34. There are MANY errors in public spectral databases 33 • Chemical structure representations would ideally be standardized…consider tautomeric forms • Not all substances are explicit and can be ambiguous representations
  • 35. Fact Sheets – 4800 of them and growing 34
  • 37. Conclusions • We have built the first chemical structure indexed open database of “methods”. • Methods are not just “approved methods” but also standard operating procedures, application notes, lab manuals, regulatory methods etc. • Integrating methods to experimental spectral data will serve our non-targeted analysis efforts • Work is underway to make it public 36
  • 38. If you want to help… • Send information regarding analytical methods and method articles to williams.antony@epa.gov • If you want to add methods or spectral data please contact me 37