SlideShare une entreprise Scribd logo
1  sur  39
Real-world e-science use-cases:
embedded development and
financial engineering
Annette Strauch | 15th November 2012 |
1
18.03.2022
2
Image credit: http://en.wikipedia.org/wiki/Big_data
#digitalpreservation #usecase
#usecases
#openplanetsfoundation
#emulation
#OPF
#hackathon
#escience
#digitalpreservationFR12
CONTENT
RESEARCH DATA
SCIENTIFIC WORKFLOWS
METHOLOGY OF INTERVIEWING THE SCIENTISTS AT ULM UNIVERSITY
DESCRIPTION OF THE SCIENTIFIC WORKFLOW(S) (QUESTIONS)
USE-CASES
a) EMBEDDED SYSTEMS
b) FINANCIAL ENGINEERING
18.03.2022
3
RESEARCH DATA
The DFG understands research data as digital and
electronically storable data that results from a research
project, for example through experiments,
measurements, surveys or interviews.
DFG paper relating to Research Data
http://staatsbibliothekberlin.de/fileadmin/user_upload/zentrale_Seiten/faecher/pdf/Winkle
r_Nees_VDB_26NOV2011.pdf
18.03.2022
4
Just an example relating to research data and digital
preservation:
The latest APARSEN survey (Alliance for Permanent Access to
the Records of Science in Europe Network) wanted to know about
the state of digital preservation in libraries, archives and research
organizations performed by individuals or the organizations
themselves.
An in-topic!
Long-term preservation & how to do it (maybe
emulation??!!!) is a subject of BIG IMPORTANCE NOW!
18.03.2022
5
18.03.2022
6
Image credit: http://blog.nationalarchives.gov.uk/blog/emulations-what-you-need/
 We do not want that valuable digital research data gets
lost!
“The essential idea behind emulation is to be able to
access or run original data/software on a new/current
platform by running software on the new/current
platform that emulates the original platform.” – (Stewart
Granger)
SCIENTIFIC WORKFLOWS
Most of today’s scientific workflows are based solely on
digital data.
Input, output and intermediate results are pure digital
objects > Keeping scientific results reproducible,
accessible and meaningful is challenging!
Emulation is useful because by using today’s emulators an
authentic reproduction of current digital workflows can be
verified and long-term access is guaranteed for a defined
feature-set.
18.03.2022
7
18.03.2022
8
Here, I present results from the scientific domain!
 embedded development & financial engineering
METHOLOGY OF ASKING THE SCIENTISTS AT ULM
UNIVERSITY
I conducted a two-tier survey to find out what would be
relevant for emulation and if it was useful to ask further
questions in order to identify a real-world use-case each time:
I. Characterization of scientific background
II. Scientific process and work environment
18.03.2022
9
The first part dealt with the characterization of their scientific
background, user expectations and potential future-users with regards
to the preservation processes.
 Object of research: abstract description of the data and the
(expected) results of current research activities.
 Aims and objectives of the preservation process: specifically
asks which objects, results, concepts should be kept available and
for which period of time (necessary, relevant, desirable).
18.03.2022
10
18.03.2022
11
 Re-use of data: what kind of access is desired, which
material available in what kind of form, data still useful
without its software environment, re-use of data targeted
at what kind of audience?
 Data protection: Is the data subject to data protection
laws or non disclosure agreement?
Image Credit: http://commons.wikimedia.org/wiki/File:Cc-by-nc-nd.svg?uselang=de
 Formal requirements: Long-term archival requirement posed by
external or internal requirements (funding organizations that support
the research)?
Image credit: http://upload.wikimedia.org/wikipedia/commons/b/be/Euro_banknotes.png
 Existing archiving processes: Are preservation processes
deployed and used already?
18.03.2022
12
The second part of the survey (in a second meeting in the example of
embedded systems!) was focused on details of the scientific
process and work environment :
Data formats (primary data objects):
 Does the research data (primary digital objects) use a
standardized format representation?
 Is proprietary / restricted software necessary to render these
objects? Are alternative open source solutions available?
 What amounts of data are expected and format migration options
available and planned?
 What does the scientist know about the longevity of the
software / data format ?
18.03.2022
13
DESCRIPTION OF THE SCIENTIFIC WORKFLOW(S):
(in the embedded systems example, this was done in a second
meeting!)
 Identifying what happens to the raw data technically? How is the
data being processed?
 Input data? Processing steps? Intermediate results?
 Questions regarding the software environment: operating
system, used tools and the software necessary for the
processing of the primary research data?
18.03.2022
14
 In order to gain greater knowledge of hardware dependencies, we
were asking the researchers about dongles, sensors and special
hardware.
 Were the researchers aware of any external dependencies, e.g.
Cloud, licence server, external services or network drives? Which
of these can be controlled, internalized, substituted?
 Were risk-assessments on external dependencies already done
by the academic staff?
18.03.2022
15
 The users were also asked if they had already worked with
virtualized systems such as workstations based on VMware or
VirtualBox?
 The last point was the question about licences.
 How can we improve our questionnaire? Can we develop a
standard questionnaire?
18.03.2022
16
USE-CASE a: EMBEDDED SYSTEMS
 Object of research: abstract description of the data and the
(expected) results of current research activities.
This use-case deals with the scientific research on the
development of embedded devices.
“Design and implementation of a FPGA based sonar system
for an autonomous underwater vehicle” (title of research paper)
A field-programmable gate array (FPGA) is an integrated circuit
designed to be configured by a customer or a designer after
manufacturing.
The subjects of the main research in this field are experiments
of simulation data.
18.03.2022
17
An autonomous underwater vehicle (AUV) is a robot which travels
underwater without requiring input from an operator.
18.03.2022
18
18.03.2022
19
A focus of research and teaching at the Institute for
Embedded Systems at the University of Ulm is the
construction of an autonomous underwater vehicle (AUV)
suitable for cave exploration!
In a study like this students can learn all that is asked of them
later on in the industry, everything from design processes,
software techniques, computer architecture, building
computers, all that is included in the terms embedded systems
and real-time systems!
Image credit https://www.uni-ulm.de/fileadmin/website_uni_ulm/kiz/intranet/fotos/betriebsausflug2011/CIMG7912.html
18.03.2022
20
In the first meeting, I have found out the following:
 For the embedded services no proper preservation and
accessibility period could be specified by the researchers.
18.03.2022
21
 However, scenarios for (result) verification and embedded
development as museum artifacts in the future were
mentioned.
 No explicit preservation processes are employed currently but
standard backups are done frequently.
18.03.2022
22
Aims and objectives of the preservation process:
specifically asks which objects, results, concepts should be
kept available and for which period of time (necessary,
relevant, desirable).
 The preservation targets were both compiler and toolchains
and the access primarily for students and researchers.
 Hardware-in-the-Loop
18.03.2022
23
 The data is experimental data which means the design of
this sonar system covers the hardware needed for signal
transmission and reception and also the software used in
the interpretation of the data.
 This hardware has been used to test the software running
on the FPGA, which controls signal generation, data
acquisition and processing.
 The simulation tests the important aspects of the software
and hardware.
18.03.2022
24
 Complex tool-chains were used. These included the following:
18.03.2022
25
In the second meeting I learnt about the use of the following
(and the compilers have to be kept functional in the DP
process):
Eagle (CAD Software)
Quartus (Hardware)
GNU (Compiler)
SPICE (for simulations)
18.03.2022
26
What can we achieve next in order to provide a practical
implementation of archival workflows for the rendering of
digital objects in its original environment? This is our bwFLA
question!
It was important to identify the relevant software in order to
work on the primary research data.
This first use-case is important because of the software and
the hardware environments.
We always need to identify the necessary workflows.
We have to do tests with the proper hardware components
of the AUV or with electrical parts of the AUV.
18.03.2022
27
USE-CASE b: FINANCIAL DATA ANALYSIS
 Object of research: abstract description of the data and the
(expected) results of current research activities
This use-case deals with the scientific research on the Jarrow-
Yildirim model which is a very general inflation model.
“Calibration of the Jarrow-Yildirim Model
for German and French Inflation-Linked
Government Bonds“ (title of research paper)
18.03.2022
28
Image credit:
Kalibrierung des Jarrow-Yildirim-Modells
für deutsche und französische inflationsabhängige
Staatsanleihen
Manuel Gröger
Workshop "Modellierung von Inflation"
Universität Ulm
19.03.2012
18.03.2022
29
During anxious market periods, at latest the "Euro Crisis" in
2011/2012 or the "financial crisis" in 2007, investors seek other
investment possibilities to avoid traps of the occurring problems.
Image credit:http://www.boerse-express.com/w/images/4/49/Goldbarren.JPG
Some market participants like insurance companies or pension
funds, who invest most of their assets in treasury bonds, face the
problem of low interest rates for some years, whereas the liabilities
are stable or even growing.
18.03.2022
30
Health insurance companies for instance have to deal with the
problem of high medical inflation on the liabilities side and low
interest rates on the asset side of their balance sheet.
In order to close this budget gap they can either spend more
money for risky asset classes like stocks or private equity, or
they invest more in bond products like treasury inflation-
protected securities (TIPS) or high yield bonds.
The standard model for pricing TIPS is the Jarrow-Yildirim
model.
18.03.2022
31
 Valuation of financial products linked to inflation
 Mathematical models
 Interest is modelled by certain equations
 Prices, bonds, costs – shown graphically
 Bloomberg data
 Visual Basic is used for data preparation
 All MATLAB documents
 The data is needed for calibration
Image credit: http://upload.wikimedia.org/wikipedia/commons/f/f3/Bills_and_coins.png
18.03.2022
32
 The data should be kept for future-use for researchers
and for businesses
 Data can be used without the software environment
 However, the data needs the soft- and hardware
environments in order to interpret the data so emulation
would play an important role here
 No VMware
Image credit: http://bic.library.nd.edu/about/images/image011.gife
18.03.2022
33
18.03.2022
34
Much of the data in this use-case also comes from the
Bloomberg Terminal which is a computer system provided by
Bloomberg L.P. that enables professionals in finance like in
this example here to access the Bloomberg Professional
service.
Users can monitor and analyze real-time financial market
data movements and place trades on the electronic trading
platform.
This fact is interesting for us because of the external
dependencies from a data provider.
18.03.2022
35
So, why does all this matter again as a whole?
 e-Science is about an enabling infrastructure – tools,
technologies, computing power in order to support research
 addressing needs for virtual research environments
 looking at dynamic, complex objects in e-science
Overall:
 Helping to define and to provide a practical implementation
of archival workflows for the rendering of digital objects
 AND: TO MAKE EMULATION WORK!
18.03.2022
36
PUBLICATIONS (a selection)
Tzitzikas, Yannis, Marketakis, Yannis, Kargagis, Yannis, 2012. Conversion and Emulation-
aware Dependency Reasoning for Curation Services.
http://www.ics.forth.gr/~tzitzik/publications/Tzitzikas_2012_iPres_DepMgmtForCovertersEm
ulators.pdf
APA and APARSEN website for excellence in Digital Preservation.
http://www.alliancepermanentaccess.org/
Neuroth, Heike, Strathmann, Stefan, Oßwald, Achim, Scheffel, Regine, Klump, Jens,
Ludwig, Jens, 2012. Langzeitarchivierung von Forschungsdaten.
http://www.reticon.de/nachrichten/langzeitarchivierung-von-forschungsdaten_2984.html
Büttner, Stephan, Hobohm, Hans-Christoph, Müller, Lars, 2011. Handbuch
Forschungsdatenmanagement.
http://www.forschungsdatenmanagement.de/
Winkler-Nees, 2012. Informationsinfrastrukturen für Forschungsdaten Ansätze und
Strategien der DFG.
http://open-access.net/fileadmin/downloads/OAT12/OAT12_Winkler-Nees.pdf
18.03.2022
37
Discuss!
18.03.2022
38
Acknowledgements:
Thanks to a good team in Ulm at the kiz: Simon Streib,
Karin Zwiesler, Dr. Pia Schmücker and Franziska
Ackermann!
Thanks to Prof. Dr. Frank Slomka and Jan-Philip Schmidt
for their help with both use-cases.
18.03.2022
39
Thank you!
annette.strauch@uni-ulm.de
Annette Strauch, M.A.
bwFLA Project: URL = http://bw-fla.uni-freiburg.de/wordpress/?page_id=7

Contenu connexe

Tendances

OSLC KM: Elevating the meaning of data and operations within the toolchain
OSLC KM: Elevating the meaning of data and operations within the toolchainOSLC KM: Elevating the meaning of data and operations within the toolchain
OSLC KM: Elevating the meaning of data and operations within the toolchainCARLOS III UNIVERSITY OF MADRID
 
LOTAR-PDES: Engineering digitalization through task automation and reuse in t...
LOTAR-PDES: Engineering digitalization through task automation and reuse in t...LOTAR-PDES: Engineering digitalization through task automation and reuse in t...
LOTAR-PDES: Engineering digitalization through task automation and reuse in t...CARLOS III UNIVERSITY OF MADRID
 
Sailing the V: Engineering digitalization through task automation and reuse i...
Sailing the V: Engineering digitalization through task automation and reuse i...Sailing the V: Engineering digitalization through task automation and reuse i...
Sailing the V: Engineering digitalization through task automation and reuse i...CARLOS III UNIVERSITY OF MADRID
 
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...Amélie Gyrard
 
Preserving Computer-Aided Design, Digital Preservation Coalition Report
Preserving Computer-Aided Design, Digital Preservation Coalition ReportPreserving Computer-Aided Design, Digital Preservation Coalition Report
Preserving Computer-Aided Design, Digital Preservation Coalition ReportRuggero Lancia
 
Towards Human-Guided Machine Learning - IUI 2019
Towards Human-Guided Machine Learning - IUI 2019Towards Human-Guided Machine Learning - IUI 2019
Towards Human-Guided Machine Learning - IUI 2019dgarijo
 
Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...
Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...
Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...Denis Parra Santander
 
Feature based similarity search in 3 d object databases
Feature based similarity search in 3 d object databasesFeature based similarity search in 3 d object databases
Feature based similarity search in 3 d object databasesunyil96
 
CIB W78 2015 - Semantic Rule-checking for Regulation Compliance Checking
CIB W78 2015 - Semantic Rule-checking for Regulation Compliance CheckingCIB W78 2015 - Semantic Rule-checking for Regulation Compliance Checking
CIB W78 2015 - Semantic Rule-checking for Regulation Compliance CheckingPieter Pauwels
 
International Journal of Ad hoc, Sensor & Ubiquitous Computing (IJASUC)
International Journal of Ad hoc, Sensor & Ubiquitous Computing (IJASUC)International Journal of Ad hoc, Sensor & Ubiquitous Computing (IJASUC)
International Journal of Ad hoc, Sensor & Ubiquitous Computing (IJASUC)ijasuc
 
Knowledge Extraction for the Web of Things (KE4WoT) Challenge: Co-located wit...
Knowledge Extraction for the Web of Things (KE4WoT) Challenge: Co-located wit...Knowledge Extraction for the Web of Things (KE4WoT) Challenge: Co-located wit...
Knowledge Extraction for the Web of Things (KE4WoT) Challenge: Co-located wit...Amélie Gyrard
 
2_presFriday_ontologydevelopment
2_presFriday_ontologydevelopment2_presFriday_ontologydevelopment
2_presFriday_ontologydevelopmentPieter Pauwels
 
Emerging engineering issues for building large scale AI systems By Srinivas P...
Emerging engineering issues for building large scale AI systems By Srinivas P...Emerging engineering issues for building large scale AI systems By Srinivas P...
Emerging engineering issues for building large scale AI systems By Srinivas P...Analytics India Magazine
 
HPCC Systems Engineering Summit Presentation - Collaborative Research with FA...
HPCC Systems Engineering Summit Presentation - Collaborative Research with FA...HPCC Systems Engineering Summit Presentation - Collaborative Research with FA...
HPCC Systems Engineering Summit Presentation - Collaborative Research with FA...HPCC Systems
 
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...Universita della Calabria,
 
DURAARK presentation at DEDICATE final seminar, October 21st 2013, Michelle L...
DURAARK presentation at DEDICATE final seminar, October 21st 2013, Michelle L...DURAARK presentation at DEDICATE final seminar, October 21st 2013, Michelle L...
DURAARK presentation at DEDICATE final seminar, October 21st 2013, Michelle L...lindlar
 
Internet of Robotic Things Ontology catalog, knowledge extraction IEEE P1872....
Internet of Robotic Things Ontology catalog, knowledge extraction IEEE P1872....Internet of Robotic Things Ontology catalog, knowledge extraction IEEE P1872....
Internet of Robotic Things Ontology catalog, knowledge extraction IEEE P1872....Amélie Gyrard
 

Tendances (20)

OSLC KM: Elevating the meaning of data and operations within the toolchain
OSLC KM: Elevating the meaning of data and operations within the toolchainOSLC KM: Elevating the meaning of data and operations within the toolchain
OSLC KM: Elevating the meaning of data and operations within the toolchain
 
LOTAR-PDES: Engineering digitalization through task automation and reuse in t...
LOTAR-PDES: Engineering digitalization through task automation and reuse in t...LOTAR-PDES: Engineering digitalization through task automation and reuse in t...
LOTAR-PDES: Engineering digitalization through task automation and reuse in t...
 
Sailing the V: Engineering digitalization through task automation and reuse i...
Sailing the V: Engineering digitalization through task automation and reuse i...Sailing the V: Engineering digitalization through task automation and reuse i...
Sailing the V: Engineering digitalization through task automation and reuse i...
 
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...
Keynote WFIoT2019 - Data Graph, Knowledge Graphs Ontologies, Internet of Thin...
 
Preserving Computer-Aided Design, Digital Preservation Coalition Report
Preserving Computer-Aided Design, Digital Preservation Coalition ReportPreserving Computer-Aided Design, Digital Preservation Coalition Report
Preserving Computer-Aided Design, Digital Preservation Coalition Report
 
Ipen2018
Ipen2018Ipen2018
Ipen2018
 
Towards Human-Guided Machine Learning - IUI 2019
Towards Human-Guided Machine Learning - IUI 2019Towards Human-Guided Machine Learning - IUI 2019
Towards Human-Guided Machine Learning - IUI 2019
 
Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...
Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...
Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...
 
Feature based similarity search in 3 d object databases
Feature based similarity search in 3 d object databasesFeature based similarity search in 3 d object databases
Feature based similarity search in 3 d object databases
 
CIB W78 2015 - Semantic Rule-checking for Regulation Compliance Checking
CIB W78 2015 - Semantic Rule-checking for Regulation Compliance CheckingCIB W78 2015 - Semantic Rule-checking for Regulation Compliance Checking
CIB W78 2015 - Semantic Rule-checking for Regulation Compliance Checking
 
International Journal of Ad hoc, Sensor & Ubiquitous Computing (IJASUC)
International Journal of Ad hoc, Sensor & Ubiquitous Computing (IJASUC)International Journal of Ad hoc, Sensor & Ubiquitous Computing (IJASUC)
International Journal of Ad hoc, Sensor & Ubiquitous Computing (IJASUC)
 
Granular or holistic approaches 210126 Alejandra Ruiz
Granular or holistic approaches 210126 Alejandra RuizGranular or holistic approaches 210126 Alejandra Ruiz
Granular or holistic approaches 210126 Alejandra Ruiz
 
Knowledge Extraction for the Web of Things (KE4WoT) Challenge: Co-located wit...
Knowledge Extraction for the Web of Things (KE4WoT) Challenge: Co-located wit...Knowledge Extraction for the Web of Things (KE4WoT) Challenge: Co-located wit...
Knowledge Extraction for the Web of Things (KE4WoT) Challenge: Co-located wit...
 
2_presFriday_ontologydevelopment
2_presFriday_ontologydevelopment2_presFriday_ontologydevelopment
2_presFriday_ontologydevelopment
 
Emerging engineering issues for building large scale AI systems By Srinivas P...
Emerging engineering issues for building large scale AI systems By Srinivas P...Emerging engineering issues for building large scale AI systems By Srinivas P...
Emerging engineering issues for building large scale AI systems By Srinivas P...
 
HPCC Systems Engineering Summit Presentation - Collaborative Research with FA...
HPCC Systems Engineering Summit Presentation - Collaborative Research with FA...HPCC Systems Engineering Summit Presentation - Collaborative Research with FA...
HPCC Systems Engineering Summit Presentation - Collaborative Research with FA...
 
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
 
new_kitching_cv
new_kitching_cvnew_kitching_cv
new_kitching_cv
 
DURAARK presentation at DEDICATE final seminar, October 21st 2013, Michelle L...
DURAARK presentation at DEDICATE final seminar, October 21st 2013, Michelle L...DURAARK presentation at DEDICATE final seminar, October 21st 2013, Michelle L...
DURAARK presentation at DEDICATE final seminar, October 21st 2013, Michelle L...
 
Internet of Robotic Things Ontology catalog, knowledge extraction IEEE P1872....
Internet of Robotic Things Ontology catalog, knowledge extraction IEEE P1872....Internet of Robotic Things Ontology catalog, knowledge extraction IEEE P1872....
Internet of Robotic Things Ontology catalog, knowledge extraction IEEE P1872....
 

En vedette

Sekilas
SekilasSekilas
Sekilasyuswar
 
Tenerlochiquito
TenerlochiquitoTenerlochiquito
Tenerlochiquitojfcaman
 
Pristinecareers Visualizefrm
Pristinecareers VisualizefrmPristinecareers Visualizefrm
Pristinecareers Visualizefrmparamdeep
 
URC Based Architecture for Next Generation Inclusive Television
URC Based Architecture for Next Generation  Inclusive TelevisionURC Based Architecture for Next Generation  Inclusive Television
URC Based Architecture for Next Generation Inclusive TelevisionGorka Epelde Unanue
 
Mindy Koch SMAcademy Week 1
Mindy Koch SMAcademy Week 1Mindy Koch SMAcademy Week 1
Mindy Koch SMAcademy Week 1Mindy Koch
 
High impacttrainingandamazingpresentation introduction
High impacttrainingandamazingpresentation introductionHigh impacttrainingandamazingpresentation introduction
High impacttrainingandamazingpresentation introductionkangazul
 
Belfast
BelfastBelfast
Belfastyuswar
 
Testslides
TestslidesTestslides
Testslidesrorlan
 
Mindy Koch SMAcademy
Mindy Koch SMAcademyMindy Koch SMAcademy
Mindy Koch SMAcademyMindy Koch
 
Selayang Pandang
Selayang PandangSelayang Pandang
Selayang Pandangyuswar
 
Korean Major Portal History
Korean Major Portal HistoryKorean Major Portal History
Korean Major Portal HistoryAsadal Lee
 
How Wiki Changes the World?
How Wiki Changes the World?How Wiki Changes the World?
How Wiki Changes the World?Asadal Lee
 
Apple-Samsung Patent Trial
Apple-Samsung Patent TrialApple-Samsung Patent Trial
Apple-Samsung Patent TrialAsadal Lee
 
6 Frames for Thinking About Information
6 Frames for Thinking About Information6 Frames for Thinking About Information
6 Frames for Thinking About Informationkangazul
 
Reproduction In Flowering Plants
Reproduction In Flowering PlantsReproduction In Flowering Plants
Reproduction In Flowering Plantscpisola
 
Leading Change: Why transformation efforts fail
Leading Change: Why transformation efforts failLeading Change: Why transformation efforts fail
Leading Change: Why transformation efforts failkangazul
 

En vedette (19)

Sekilas
SekilasSekilas
Sekilas
 
Tenerlochiquito
TenerlochiquitoTenerlochiquito
Tenerlochiquito
 
Pristinecareers Visualizefrm
Pristinecareers VisualizefrmPristinecareers Visualizefrm
Pristinecareers Visualizefrm
 
Ianire-Gorka Ezkontza
Ianire-Gorka EzkontzaIanire-Gorka Ezkontza
Ianire-Gorka Ezkontza
 
URC Based Architecture for Next Generation Inclusive Television
URC Based Architecture for Next Generation  Inclusive TelevisionURC Based Architecture for Next Generation  Inclusive Television
URC Based Architecture for Next Generation Inclusive Television
 
Mindy Koch SMAcademy Week 1
Mindy Koch SMAcademy Week 1Mindy Koch SMAcademy Week 1
Mindy Koch SMAcademy Week 1
 
High impacttrainingandamazingpresentation introduction
High impacttrainingandamazingpresentation introductionHigh impacttrainingandamazingpresentation introduction
High impacttrainingandamazingpresentation introduction
 
Urc Based Accessible Tv
Urc Based Accessible TvUrc Based Accessible Tv
Urc Based Accessible Tv
 
Belfast
BelfastBelfast
Belfast
 
Testslides
TestslidesTestslides
Testslides
 
Mindy Koch SMAcademy
Mindy Koch SMAcademyMindy Koch SMAcademy
Mindy Koch SMAcademy
 
Selayang Pandang
Selayang PandangSelayang Pandang
Selayang Pandang
 
Korean Major Portal History
Korean Major Portal HistoryKorean Major Portal History
Korean Major Portal History
 
How Wiki Changes the World?
How Wiki Changes the World?How Wiki Changes the World?
How Wiki Changes the World?
 
Dingbats
DingbatsDingbats
Dingbats
 
Apple-Samsung Patent Trial
Apple-Samsung Patent TrialApple-Samsung Patent Trial
Apple-Samsung Patent Trial
 
6 Frames for Thinking About Information
6 Frames for Thinking About Information6 Frames for Thinking About Information
6 Frames for Thinking About Information
 
Reproduction In Flowering Plants
Reproduction In Flowering PlantsReproduction In Flowering Plants
Reproduction In Flowering Plants
 
Leading Change: Why transformation efforts fail
Leading Change: Why transformation efforts failLeading Change: Why transformation efforts fail
Leading Change: Why transformation efforts fail
 

Similaire à Real-world e-science use-cases: embedded development and financial engineering workflows

ML for blind people.pptx
ML for blind people.pptxML for blind people.pptx
ML for blind people.pptxJAYKUMARKABRA
 
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATION
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATIONMtech First progress PRESENTATION ON VIDEO SUMMARIZATION
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATIONNEERAJ BAGHEL
 
Hardware Design Practices For Modern Hardware
Hardware Design Practices For Modern HardwareHardware Design Practices For Modern Hardware
Hardware Design Practices For Modern HardwareWinstina Kennedy
 
Blockchain & Cloud-based, Digital Object Identifier System for better Environ...
Blockchain & Cloud-based, Digital Object Identifier System for better Environ...Blockchain & Cloud-based, Digital Object Identifier System for better Environ...
Blockchain & Cloud-based, Digital Object Identifier System for better Environ...IRJET Journal
 
Computer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineeringComputer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineeringuniversity of sust.
 
Preserving CAD: A briefing - Emerging trends - Architecture
Preserving CAD: A briefing - Emerging trends - Architecture Preserving CAD: A briefing - Emerging trends - Architecture
Preserving CAD: A briefing - Emerging trends - Architecture Ruggero Lancia
 
Grid Computing in a Commodity World (KCCMG, 2005)
Grid Computing in a Commodity World (KCCMG, 2005)Grid Computing in a Commodity World (KCCMG, 2005)
Grid Computing in a Commodity World (KCCMG, 2005)Lorin Olsen
 
DURAARK Preserving Architectural Knowledge
DURAARK Preserving Architectural KnowledgeDURAARK Preserving Architectural Knowledge
DURAARK Preserving Architectural KnowledgeRuggero Lancia
 
ArchiInspection Project: Integrated Non Destructive Testing, A Building Infor...
ArchiInspection Project: Integrated Non Destructive Testing, A Building Infor...ArchiInspection Project: Integrated Non Destructive Testing, A Building Infor...
ArchiInspection Project: Integrated Non Destructive Testing, A Building Infor...Farid Mokhtar Noriega
 
3 rd International Conference on Data Science and Machine Learning (DSML 2022)
3 rd International Conference on Data Science and Machine Learning (DSML 2022)3 rd International Conference on Data Science and Machine Learning (DSML 2022)
3 rd International Conference on Data Science and Machine Learning (DSML 2022)gerogepatton
 
Dynamic Semantics for the Internet of Things
Dynamic Semantics for the Internet of Things Dynamic Semantics for the Internet of Things
Dynamic Semantics for the Internet of Things PayamBarnaghi
 
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)EUDAT
 
ROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNETROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNETIRJET Journal
 
Open Research Data Pilot in Horizon 2020
Open Research Data Pilot in Horizon 2020Open Research Data Pilot in Horizon 2020
Open Research Data Pilot in Horizon 2020Elena Simukovic
 
Sample CS Senior Capstone Projects
Sample CS Senior Capstone ProjectsSample CS Senior Capstone Projects
Sample CS Senior Capstone ProjectsFred Annexstein
 
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLING
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLINGUSING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLING
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLINGIRJET Journal
 
Precaution for Covid-19 based on Mask detection and sensor
Precaution for Covid-19 based on Mask detection and sensorPrecaution for Covid-19 based on Mask detection and sensor
Precaution for Covid-19 based on Mask detection and sensorIRJET Journal
 
UberCloud HPC Experiment Introduction for Beginners
UberCloud HPC Experiment Introduction for BeginnersUberCloud HPC Experiment Introduction for Beginners
UberCloud HPC Experiment Introduction for Beginnershpcexperiment
 
3 rd International Conference on Data Science and Machine Learning (DSML 2022)
3 rd International Conference on Data Science and Machine Learning (DSML 2022)3 rd International Conference on Data Science and Machine Learning (DSML 2022)
3 rd International Conference on Data Science and Machine Learning (DSML 2022)ijscai
 

Similaire à Real-world e-science use-cases: embedded development and financial engineering workflows (20)

ML for blind people.pptx
ML for blind people.pptxML for blind people.pptx
ML for blind people.pptx
 
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATION
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATIONMtech First progress PRESENTATION ON VIDEO SUMMARIZATION
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATION
 
GRID COMPUTING.ppt
GRID COMPUTING.pptGRID COMPUTING.ppt
GRID COMPUTING.ppt
 
Hardware Design Practices For Modern Hardware
Hardware Design Practices For Modern HardwareHardware Design Practices For Modern Hardware
Hardware Design Practices For Modern Hardware
 
Blockchain & Cloud-based, Digital Object Identifier System for better Environ...
Blockchain & Cloud-based, Digital Object Identifier System for better Environ...Blockchain & Cloud-based, Digital Object Identifier System for better Environ...
Blockchain & Cloud-based, Digital Object Identifier System for better Environ...
 
Computer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineeringComputer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineering
 
Preserving CAD: A briefing - Emerging trends - Architecture
Preserving CAD: A briefing - Emerging trends - Architecture Preserving CAD: A briefing - Emerging trends - Architecture
Preserving CAD: A briefing - Emerging trends - Architecture
 
Grid Computing in a Commodity World (KCCMG, 2005)
Grid Computing in a Commodity World (KCCMG, 2005)Grid Computing in a Commodity World (KCCMG, 2005)
Grid Computing in a Commodity World (KCCMG, 2005)
 
DURAARK Preserving Architectural Knowledge
DURAARK Preserving Architectural KnowledgeDURAARK Preserving Architectural Knowledge
DURAARK Preserving Architectural Knowledge
 
ArchiInspection Project: Integrated Non Destructive Testing, A Building Infor...
ArchiInspection Project: Integrated Non Destructive Testing, A Building Infor...ArchiInspection Project: Integrated Non Destructive Testing, A Building Infor...
ArchiInspection Project: Integrated Non Destructive Testing, A Building Infor...
 
3 rd International Conference on Data Science and Machine Learning (DSML 2022)
3 rd International Conference on Data Science and Machine Learning (DSML 2022)3 rd International Conference on Data Science and Machine Learning (DSML 2022)
3 rd International Conference on Data Science and Machine Learning (DSML 2022)
 
Dynamic Semantics for the Internet of Things
Dynamic Semantics for the Internet of Things Dynamic Semantics for the Internet of Things
Dynamic Semantics for the Internet of Things
 
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
 
ROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNETROAD POTHOLE DETECTION USING YOLOV4 DARKNET
ROAD POTHOLE DETECTION USING YOLOV4 DARKNET
 
Open Research Data Pilot in Horizon 2020
Open Research Data Pilot in Horizon 2020Open Research Data Pilot in Horizon 2020
Open Research Data Pilot in Horizon 2020
 
Sample CS Senior Capstone Projects
Sample CS Senior Capstone ProjectsSample CS Senior Capstone Projects
Sample CS Senior Capstone Projects
 
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLING
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLINGUSING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLING
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLING
 
Precaution for Covid-19 based on Mask detection and sensor
Precaution for Covid-19 based on Mask detection and sensorPrecaution for Covid-19 based on Mask detection and sensor
Precaution for Covid-19 based on Mask detection and sensor
 
UberCloud HPC Experiment Introduction for Beginners
UberCloud HPC Experiment Introduction for BeginnersUberCloud HPC Experiment Introduction for Beginners
UberCloud HPC Experiment Introduction for Beginners
 
3 rd International Conference on Data Science and Machine Learning (DSML 2022)
3 rd International Conference on Data Science and Machine Learning (DSML 2022)3 rd International Conference on Data Science and Machine Learning (DSML 2022)
3 rd International Conference on Data Science and Machine Learning (DSML 2022)
 

Real-world e-science use-cases: embedded development and financial engineering workflows

  • 1. Real-world e-science use-cases: embedded development and financial engineering Annette Strauch | 15th November 2012 | 1
  • 2. 18.03.2022 2 Image credit: http://en.wikipedia.org/wiki/Big_data #digitalpreservation #usecase #usecases #openplanetsfoundation #emulation #OPF #hackathon #escience #digitalpreservationFR12
  • 3. CONTENT RESEARCH DATA SCIENTIFIC WORKFLOWS METHOLOGY OF INTERVIEWING THE SCIENTISTS AT ULM UNIVERSITY DESCRIPTION OF THE SCIENTIFIC WORKFLOW(S) (QUESTIONS) USE-CASES a) EMBEDDED SYSTEMS b) FINANCIAL ENGINEERING 18.03.2022 3
  • 4. RESEARCH DATA The DFG understands research data as digital and electronically storable data that results from a research project, for example through experiments, measurements, surveys or interviews. DFG paper relating to Research Data http://staatsbibliothekberlin.de/fileadmin/user_upload/zentrale_Seiten/faecher/pdf/Winkle r_Nees_VDB_26NOV2011.pdf 18.03.2022 4
  • 5. Just an example relating to research data and digital preservation: The latest APARSEN survey (Alliance for Permanent Access to the Records of Science in Europe Network) wanted to know about the state of digital preservation in libraries, archives and research organizations performed by individuals or the organizations themselves. An in-topic! Long-term preservation & how to do it (maybe emulation??!!!) is a subject of BIG IMPORTANCE NOW! 18.03.2022 5
  • 6. 18.03.2022 6 Image credit: http://blog.nationalarchives.gov.uk/blog/emulations-what-you-need/  We do not want that valuable digital research data gets lost! “The essential idea behind emulation is to be able to access or run original data/software on a new/current platform by running software on the new/current platform that emulates the original platform.” – (Stewart Granger)
  • 7. SCIENTIFIC WORKFLOWS Most of today’s scientific workflows are based solely on digital data. Input, output and intermediate results are pure digital objects > Keeping scientific results reproducible, accessible and meaningful is challenging! Emulation is useful because by using today’s emulators an authentic reproduction of current digital workflows can be verified and long-term access is guaranteed for a defined feature-set. 18.03.2022 7
  • 8. 18.03.2022 8 Here, I present results from the scientific domain!  embedded development & financial engineering
  • 9. METHOLOGY OF ASKING THE SCIENTISTS AT ULM UNIVERSITY I conducted a two-tier survey to find out what would be relevant for emulation and if it was useful to ask further questions in order to identify a real-world use-case each time: I. Characterization of scientific background II. Scientific process and work environment 18.03.2022 9
  • 10. The first part dealt with the characterization of their scientific background, user expectations and potential future-users with regards to the preservation processes.  Object of research: abstract description of the data and the (expected) results of current research activities.  Aims and objectives of the preservation process: specifically asks which objects, results, concepts should be kept available and for which period of time (necessary, relevant, desirable). 18.03.2022 10
  • 11. 18.03.2022 11  Re-use of data: what kind of access is desired, which material available in what kind of form, data still useful without its software environment, re-use of data targeted at what kind of audience?  Data protection: Is the data subject to data protection laws or non disclosure agreement? Image Credit: http://commons.wikimedia.org/wiki/File:Cc-by-nc-nd.svg?uselang=de
  • 12.  Formal requirements: Long-term archival requirement posed by external or internal requirements (funding organizations that support the research)? Image credit: http://upload.wikimedia.org/wikipedia/commons/b/be/Euro_banknotes.png  Existing archiving processes: Are preservation processes deployed and used already? 18.03.2022 12
  • 13. The second part of the survey (in a second meeting in the example of embedded systems!) was focused on details of the scientific process and work environment : Data formats (primary data objects):  Does the research data (primary digital objects) use a standardized format representation?  Is proprietary / restricted software necessary to render these objects? Are alternative open source solutions available?  What amounts of data are expected and format migration options available and planned?  What does the scientist know about the longevity of the software / data format ? 18.03.2022 13
  • 14. DESCRIPTION OF THE SCIENTIFIC WORKFLOW(S): (in the embedded systems example, this was done in a second meeting!)  Identifying what happens to the raw data technically? How is the data being processed?  Input data? Processing steps? Intermediate results?  Questions regarding the software environment: operating system, used tools and the software necessary for the processing of the primary research data? 18.03.2022 14
  • 15.  In order to gain greater knowledge of hardware dependencies, we were asking the researchers about dongles, sensors and special hardware.  Were the researchers aware of any external dependencies, e.g. Cloud, licence server, external services or network drives? Which of these can be controlled, internalized, substituted?  Were risk-assessments on external dependencies already done by the academic staff? 18.03.2022 15
  • 16.  The users were also asked if they had already worked with virtualized systems such as workstations based on VMware or VirtualBox?  The last point was the question about licences.  How can we improve our questionnaire? Can we develop a standard questionnaire? 18.03.2022 16
  • 17. USE-CASE a: EMBEDDED SYSTEMS  Object of research: abstract description of the data and the (expected) results of current research activities. This use-case deals with the scientific research on the development of embedded devices. “Design and implementation of a FPGA based sonar system for an autonomous underwater vehicle” (title of research paper) A field-programmable gate array (FPGA) is an integrated circuit designed to be configured by a customer or a designer after manufacturing. The subjects of the main research in this field are experiments of simulation data. 18.03.2022 17
  • 18. An autonomous underwater vehicle (AUV) is a robot which travels underwater without requiring input from an operator. 18.03.2022 18
  • 19. 18.03.2022 19 A focus of research and teaching at the Institute for Embedded Systems at the University of Ulm is the construction of an autonomous underwater vehicle (AUV) suitable for cave exploration! In a study like this students can learn all that is asked of them later on in the industry, everything from design processes, software techniques, computer architecture, building computers, all that is included in the terms embedded systems and real-time systems! Image credit https://www.uni-ulm.de/fileadmin/website_uni_ulm/kiz/intranet/fotos/betriebsausflug2011/CIMG7912.html
  • 21. In the first meeting, I have found out the following:  For the embedded services no proper preservation and accessibility period could be specified by the researchers. 18.03.2022 21  However, scenarios for (result) verification and embedded development as museum artifacts in the future were mentioned.  No explicit preservation processes are employed currently but standard backups are done frequently.
  • 22. 18.03.2022 22 Aims and objectives of the preservation process: specifically asks which objects, results, concepts should be kept available and for which period of time (necessary, relevant, desirable).  The preservation targets were both compiler and toolchains and the access primarily for students and researchers.  Hardware-in-the-Loop
  • 23. 18.03.2022 23  The data is experimental data which means the design of this sonar system covers the hardware needed for signal transmission and reception and also the software used in the interpretation of the data.  This hardware has been used to test the software running on the FPGA, which controls signal generation, data acquisition and processing.  The simulation tests the important aspects of the software and hardware.
  • 24. 18.03.2022 24  Complex tool-chains were used. These included the following:
  • 25. 18.03.2022 25 In the second meeting I learnt about the use of the following (and the compilers have to be kept functional in the DP process): Eagle (CAD Software) Quartus (Hardware) GNU (Compiler) SPICE (for simulations)
  • 26. 18.03.2022 26 What can we achieve next in order to provide a practical implementation of archival workflows for the rendering of digital objects in its original environment? This is our bwFLA question! It was important to identify the relevant software in order to work on the primary research data. This first use-case is important because of the software and the hardware environments. We always need to identify the necessary workflows. We have to do tests with the proper hardware components of the AUV or with electrical parts of the AUV.
  • 27. 18.03.2022 27 USE-CASE b: FINANCIAL DATA ANALYSIS  Object of research: abstract description of the data and the (expected) results of current research activities This use-case deals with the scientific research on the Jarrow- Yildirim model which is a very general inflation model. “Calibration of the Jarrow-Yildirim Model for German and French Inflation-Linked Government Bonds“ (title of research paper)
  • 28. 18.03.2022 28 Image credit: Kalibrierung des Jarrow-Yildirim-Modells für deutsche und französische inflationsabhängige Staatsanleihen Manuel Gröger Workshop "Modellierung von Inflation" Universität Ulm 19.03.2012
  • 29. 18.03.2022 29 During anxious market periods, at latest the "Euro Crisis" in 2011/2012 or the "financial crisis" in 2007, investors seek other investment possibilities to avoid traps of the occurring problems. Image credit:http://www.boerse-express.com/w/images/4/49/Goldbarren.JPG Some market participants like insurance companies or pension funds, who invest most of their assets in treasury bonds, face the problem of low interest rates for some years, whereas the liabilities are stable or even growing.
  • 30. 18.03.2022 30 Health insurance companies for instance have to deal with the problem of high medical inflation on the liabilities side and low interest rates on the asset side of their balance sheet. In order to close this budget gap they can either spend more money for risky asset classes like stocks or private equity, or they invest more in bond products like treasury inflation- protected securities (TIPS) or high yield bonds. The standard model for pricing TIPS is the Jarrow-Yildirim model.
  • 31. 18.03.2022 31  Valuation of financial products linked to inflation  Mathematical models  Interest is modelled by certain equations  Prices, bonds, costs – shown graphically  Bloomberg data  Visual Basic is used for data preparation  All MATLAB documents  The data is needed for calibration Image credit: http://upload.wikimedia.org/wikipedia/commons/f/f3/Bills_and_coins.png
  • 32. 18.03.2022 32  The data should be kept for future-use for researchers and for businesses  Data can be used without the software environment  However, the data needs the soft- and hardware environments in order to interpret the data so emulation would play an important role here  No VMware Image credit: http://bic.library.nd.edu/about/images/image011.gife
  • 34. 18.03.2022 34 Much of the data in this use-case also comes from the Bloomberg Terminal which is a computer system provided by Bloomberg L.P. that enables professionals in finance like in this example here to access the Bloomberg Professional service. Users can monitor and analyze real-time financial market data movements and place trades on the electronic trading platform. This fact is interesting for us because of the external dependencies from a data provider.
  • 35. 18.03.2022 35 So, why does all this matter again as a whole?  e-Science is about an enabling infrastructure – tools, technologies, computing power in order to support research  addressing needs for virtual research environments  looking at dynamic, complex objects in e-science Overall:  Helping to define and to provide a practical implementation of archival workflows for the rendering of digital objects  AND: TO MAKE EMULATION WORK!
  • 36. 18.03.2022 36 PUBLICATIONS (a selection) Tzitzikas, Yannis, Marketakis, Yannis, Kargagis, Yannis, 2012. Conversion and Emulation- aware Dependency Reasoning for Curation Services. http://www.ics.forth.gr/~tzitzik/publications/Tzitzikas_2012_iPres_DepMgmtForCovertersEm ulators.pdf APA and APARSEN website for excellence in Digital Preservation. http://www.alliancepermanentaccess.org/ Neuroth, Heike, Strathmann, Stefan, Oßwald, Achim, Scheffel, Regine, Klump, Jens, Ludwig, Jens, 2012. Langzeitarchivierung von Forschungsdaten. http://www.reticon.de/nachrichten/langzeitarchivierung-von-forschungsdaten_2984.html Büttner, Stephan, Hobohm, Hans-Christoph, Müller, Lars, 2011. Handbuch Forschungsdatenmanagement. http://www.forschungsdatenmanagement.de/ Winkler-Nees, 2012. Informationsinfrastrukturen für Forschungsdaten Ansätze und Strategien der DFG. http://open-access.net/fileadmin/downloads/OAT12/OAT12_Winkler-Nees.pdf
  • 38. 18.03.2022 38 Acknowledgements: Thanks to a good team in Ulm at the kiz: Simon Streib, Karin Zwiesler, Dr. Pia Schmücker and Franziska Ackermann! Thanks to Prof. Dr. Frank Slomka and Jan-Philip Schmidt for their help with both use-cases.
  • 39. 18.03.2022 39 Thank you! annette.strauch@uni-ulm.de Annette Strauch, M.A. bwFLA Project: URL = http://bw-fla.uni-freiburg.de/wordpress/?page_id=7