SlideShare une entreprise Scribd logo
1  sur  38
Data Handling &
Network Literacy
Data movement and network know-how
A train the trainer workshop
2
Talking Points
© AARNet Pty Ltd |
● Advanced research networks
● Network services
● Researcher data movement problems
3
Talking Points
© AARNet Pty Ltd |
● Advanced research networks
● Network services
● Researcher data movement problems
4
Typical Researcher Data File Sizes
© AARNet Pty Ltd |
Gigabyte (GB)
files:
4GB: DVD
movie.
5GB: modest
USB stick.
20GB: Blu-ray
movie.
100GB: 4K
movie.
300GB: laptop
backup.
500GB: high-
end USB stick.
Megabyte (MB)
files:
1MB: scholarly
paper.
2MB: e-Book.
3MB: HD photo.
5MB: 1 song
(MP3).
10MB: 1-minute
Youtube movie.
100MB: album of
songs.
750MB: CD-ROM.
Kilobyte (KB)
files:
1KB: email.
4KB: page of
text.
30KB: page of
formatted text.
25KB: 1-page
spreadsheet.
40KB: simple
Web page.
Terabyte (TB)
files:
2TB: laptop
backup storage
HDD.
5TB: typical
Humanities
data
collection.
20TB: typical
Medical data
collection.
100TB: typical
Genomics data
collection.
Petabyte (PB)
files:
2PB: all Medical
data collections
on RDS.
2PB: all genomic
data collections
on RDS.
10PB: Climate
data.
20PB: ASKAP data
per year.
1000PB (1EB): SKA
data per year.
5
Research Data Services (RDS) Collections
https://www.rds.edu.au/
6
Data Collection Sizes (Research Data Services RDS)
© AARNet Pty Ltd |
90 data collections = 2200TB
(average = 24TB/collection)
49 data collections = 359TB
(average = 7TB/collection)
31 data collections = 2237TB
(average = 72TB/collection)
See RDS Research Community Projects
1TB Transfer
See Tech of the Internets File
Transfer Time Calculator:
https://techinternets.com/copy_calc
8
Australian NREN
© AARNet Pty Ltd |
● Advanced research network infrastructure
● Fast - 10 Gbps > 40 Gbps > 100 Gbps
● High capacity - 1 million + users
● Tailored for research, teaching and learning
● Low latency - consistent connectivity and response time
● Designed to have “head room”
National Network
International Network
Network
Speed
11
Australian NREN
© AARNet Pty Ltd |
Download
10 Gbps
40 Gbps
100 Gbps
Upload
10 Gbps
40 Gbps
100 Gbps
12
Australian NBN
© AARNet Pty Ltd |
Download
100 Mbps
1 Gbps
Upload
40 Mbps
400 Mbps
Network
Speed
See NBN Fact Sheet:
Traffic Class 4 for data
https://www.nbnco.com.au/cont
ent/dam/nbnco2/documents/nbn
-business-fact-sheets/nbn-
business-fact-sheet-tc4.pdf
1TB Transfer
See Tech of the Internets File
Transfer Time Calculator:
https://techinternets.com/copy_calc
WHAT CAN PREVENT THROUGHPUT = BANDWIDTH?
Need to Know True End-End Network Connectivity & Characteristics.
• Contention/Congestion, Firewalls, PC Speed, Applications, …
• For very large file transfers, buffer sizes and other network tuning might be required.
• Some simple things researchers can do themselves to try to get a handle on file transfer
issues.
● Speedtest 🚀
● Ping 🔊
● Traceroute 🔎
14© AARNet Pty Ltd |
Exercises
15
Speed Test 🚀
© AARNet Pty Ltd |
http://www.speedtest.net/
Test on wifi and a mobile phone
Pay attention to: upload vs
download speeds
16
Bandwidth & Throughput
© AARNet Pty Ltd |
Network Bandwidth (bps)
measures the maximum
speed of the data that can
be transferred.
Network Throughput (bps)
measures the actual speed
of transfer across an end-
end network.
17
Ping 🔊
© AARNet Pty Ltd |
Type “ping” and the web
address:
eg ping aarnet.edu.au
See WikiHow
https://m.wikihow.com/Ping-an-IP-
Address
Windows: open up the
Command Prompt using
“cmd”, type
ping aarnet.edu.au
Linux: open a terminal
window, type
ping aarnet.edu.au
(press CTRL and C to stop the
command)
Mac: open up Network
Utilities (using Spotlight) and
select ping menu and type in
aarnet.edu.au
Identifying issues with end to end data transfers
18
Traceroute 🔎
© AARNet Pty Ltd |
Type “tracert” or “traceroute”
and the web address e.g.
tracert aarnet.edu.au
Windows: open up the
Command Prompt using
“cmd”, type tracert
aarnet.edu.au
Linux: open a terminal
window, type traceroute
aarnet.edu.au (press CTRL and
C to stop the command)
Mac: open up Network
Utilities (using Spotlight) and
select traceroute menu and
type in aarnet.edu.au
Identifying failure point of data transfers
19
Latency
© AARNet Pty Ltd |
• Normally, determined by the distance from one
end to the other.
• Also can be affected by the speed of switches
through which the signal travels.
• Signals only move along wires/fibres at ~half the
speed of light.
• Satellite connections have very long latency
because of the distances involved (2x36,000km).
• Latency is important for real-time applications, like
videoconferencing or gaming.
• Use Ping to measure latency on a particular route.
20
Elephants
© AARNet Pty Ltd |
CC-BY 2.0 Brian Ralphs
21
Elephant Flows
© AARNet Pty Ltd |
Spot the large blocks of traffic (elephants) moving in and out of the network.
22
Network Architecture - Current State
© AARNet Pty Ltd |
International
Uni B
Uni B
Uni A
International
Uni A
University Campus
Email
Finance
Student
Research
Etc ...
Campus
Data
100GbpsAARNet
NREN
FIREWALL
1-10Gbps
23
Science DMZ
© AARNet Pty Ltd |
Network Connection tuned for large scientific / research data traffic:
• Irregular, but very disruptive.
• Optimised for big data science, elephant flows.
• Science DMZ (“demilitarised zone”) diverts large data flows from/to specific sites (eg 10TB
of climate data that NCI imports from the USA every week).
• Improves overall network performance – for regular users as well as researchers.
• Reduces need to upgrade corporate Firewalls.
• Developed by ESnet in the USA.
24
Network Architecture - Future State (Science DMZ)
© AARNet Pty Ltd |
AARNet
NREN
International
Uni B
Uni B
Uni A
International
Uni A
Email
Finance
Student
etc ...
Campus
Data
FIREWALL
Campus
RESEARCH
DATA
University Campus Science DMZ
switch
25
Talking Points
© AARNet Pty Ltd |
● Advanced research networks
● Network services
● Researcher data movement problems
26
Network Services
● CloudStor
● FileSender
● Zoom
● Zoom Webinars
● Discipline-specific Virtual Labs
● eduroam WiFi international roaming
● etc
27© AARNet Pty Ltd |
28© AARNet Pty Ltd |
CloudStor
Functions
- Store
- Upload
- Share
- Send
- Sync
- Secure
- Package
- Describe
CloudStor with other
Applications
- WebDAV e.g. Cyberduck/Transmit
- FileSender API
- S3 gateways e.g. AWS/Azure
- Rocket
- Jupyter Notebook
- Kaltura
- Storage services
CloudStor in
Research
- Move data
- Change data
- Store data
- Describe data
- Share data
Manual and automated data workflows that support data intensive research
29© AARNet Pty Ltd |
CloudStor: Upload
When? All through the research lifecycle.
How? Browser, Sync, WebDAV, Rocket, FileSender API, S3
What else? File number, sizes and types, equipment, and programming.
30© AARNet Pty Ltd |
Syncing data
● Just a gentle reminder with Syncing data.
● To think a moment before you delete.
● Sometimes storage points for data capture are also data clearing points.
● Think about developing a workflow to copy files to another folder if data is being
moved from the point of capture to the point of processing.
● On the bright side, if you Whoops! delete your data, it’s in a deleted folder for 30
days.
31© AARNet Pty Ltd |
CloudStor: Share & Send
When? All through the research lifecycle.
How? Group Allocation, FileSender.
What else? Institutional allocation, vouchers, and notifications.
32© AARNet Pty Ltd |
Sending or Sharing data?
● Send lasts for 2 weeks
● has file encryption,
● and can be two-way (vouchers).
● Share includes access control (CRUD)
● and utilises web links.
33© AARNet Pty Ltd |
CloudStor: Secure
When? All through the research lifecycle.
How? Group Drive, FileSender, Backup.
What else? User access, public/private links and controls, and encryption.
34© AARNet Pty Ltd |
Securing data
● Files need to reside on Cloudstor for 24 hours to enter the backup cycle.
● Encryption at rest (automatic) and in transit of data (automatic).
● FileSender supports end-end file encryption (in beta).
35© AARNet Pty Ltd |
36
Talking Points
© AARNet Pty Ltd |
● Advanced research networks
● Network services
● Researcher data movement problems
37
Researcher data movement problems
File too big to attach to Email:
Solution: Send using FileSender.
Slow data transferring via desktop/laptop:
Solution: Identify issue location (Ping, Traceroute) & set up direct data transfers via a
tailored network (eg Science DMZ).
Putting sensitive data at risk by sharing via email, dropbox:
Solution: Send sensitive data via encryption on a network (eg FileSender).
Too many files or chunky files 100TB (eg 25,000 characterisation images or
333 videos):
Solution: Use FileSender group file transfer facility; or use FileSender API to connect direct
with the Application generating/capturing the files.
etc…
THANK YOU
Alex Reid, eResearch Advisor, AARNet.
alex.reid@aarnet.edu.au
support@aarnet.edu.au

Contenu connexe

Similaire à A reid ands_ttt2_perth_network-literacy 17_may18

Clouds, Grids and Data
Clouds, Grids and DataClouds, Grids and Data
Clouds, Grids and DataGuy Coates
 
Janet Network R&D Innovation - HEAnet / Juniper Innovation Day
Janet Network R&D Innovation - HEAnet / Juniper Innovation DayJanet Network R&D Innovation - HEAnet / Juniper Innovation Day
Janet Network R&D Innovation - HEAnet / Juniper Innovation DayMartin Hamilton
 
Easygenomics ISCB Cloud section 2012
Easygenomics ISCB Cloud section 2012Easygenomics ISCB Cloud section 2012
Easygenomics ISCB Cloud section 2012Xing Xu
 
Future services on Janet
Future services on JanetFuture services on Janet
Future services on JanetJisc
 
Research and education
Research and educationResearch and education
Research and educationJisc
 
Data management for Quantitative Biology -Basics and challenges in biomedical...
Data management for Quantitative Biology -Basics and challenges in biomedical...Data management for Quantitative Biology -Basics and challenges in biomedical...
Data management for Quantitative Biology -Basics and challenges in biomedical...QBiC_Tue
 
Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014aceas13tern
 
Archiving data from Durham to RAL using the File Transfer Service (FTS)
Archiving data from Durham to RAL using the File Transfer Service (FTS)Archiving data from Durham to RAL using the File Transfer Service (FTS)
Archiving data from Durham to RAL using the File Transfer Service (FTS)Jisc
 
Tech 2 Tech: Network performance
Tech 2 Tech: Network performanceTech 2 Tech: Network performance
Tech 2 Tech: Network performanceJisc
 
Data Mobility Exhibition
Data Mobility ExhibitionData Mobility Exhibition
Data Mobility ExhibitionGlobus
 
Tech 2 tech low latency networking on Janet presentation
Tech 2 tech low latency networking on Janet presentationTech 2 tech low latency networking on Janet presentation
Tech 2 tech low latency networking on Janet presentationJisc
 
Yield Improvement Through Data Analysis using TIBCO Spotfire
Yield Improvement Through Data Analysis using TIBCO SpotfireYield Improvement Through Data Analysis using TIBCO Spotfire
Yield Improvement Through Data Analysis using TIBCO SpotfireTIBCO Spotfire
 
Spectra Logic BlackPearl Developer Summit 2015
Spectra Logic BlackPearl Developer Summit 2015Spectra Logic BlackPearl Developer Summit 2015
Spectra Logic BlackPearl Developer Summit 2015spectralogic
 
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...SURFnet
 

Similaire à A reid ands_ttt2_perth_network-literacy 17_may18 (20)

Clouds, Grids and Data
Clouds, Grids and DataClouds, Grids and Data
Clouds, Grids and Data
 
Janet Network R&D Innovation - HEAnet / Juniper Innovation Day
Janet Network R&D Innovation - HEAnet / Juniper Innovation DayJanet Network R&D Innovation - HEAnet / Juniper Innovation Day
Janet Network R&D Innovation - HEAnet / Juniper Innovation Day
 
Easygenomics ISCB Cloud section 2012
Easygenomics ISCB Cloud section 2012Easygenomics ISCB Cloud section 2012
Easygenomics ISCB Cloud section 2012
 
Final Ucat Ppt
Final Ucat PptFinal Ucat Ppt
Final Ucat Ppt
 
Future services on Janet
Future services on JanetFuture services on Janet
Future services on Janet
 
Mis chapter 5
Mis  chapter 5Mis  chapter 5
Mis chapter 5
 
Thoughts on Cybersecurity
Thoughts on CybersecurityThoughts on Cybersecurity
Thoughts on Cybersecurity
 
Research and education
Research and educationResearch and education
Research and education
 
Data management for Quantitative Biology -Basics and challenges in biomedical...
Data management for Quantitative Biology -Basics and challenges in biomedical...Data management for Quantitative Biology -Basics and challenges in biomedical...
Data management for Quantitative Biology -Basics and challenges in biomedical...
 
Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014
 
Big Data and OSS at IBM
Big Data and OSS at IBMBig Data and OSS at IBM
Big Data and OSS at IBM
 
DNA Storage
DNA StorageDNA Storage
DNA Storage
 
Archiving data from Durham to RAL using the File Transfer Service (FTS)
Archiving data from Durham to RAL using the File Transfer Service (FTS)Archiving data from Durham to RAL using the File Transfer Service (FTS)
Archiving data from Durham to RAL using the File Transfer Service (FTS)
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
Tech 2 Tech: Network performance
Tech 2 Tech: Network performanceTech 2 Tech: Network performance
Tech 2 Tech: Network performance
 
Data Mobility Exhibition
Data Mobility ExhibitionData Mobility Exhibition
Data Mobility Exhibition
 
Tech 2 tech low latency networking on Janet presentation
Tech 2 tech low latency networking on Janet presentationTech 2 tech low latency networking on Janet presentation
Tech 2 tech low latency networking on Janet presentation
 
Yield Improvement Through Data Analysis using TIBCO Spotfire
Yield Improvement Through Data Analysis using TIBCO SpotfireYield Improvement Through Data Analysis using TIBCO Spotfire
Yield Improvement Through Data Analysis using TIBCO Spotfire
 
Spectra Logic BlackPearl Developer Summit 2015
Spectra Logic BlackPearl Developer Summit 2015Spectra Logic BlackPearl Developer Summit 2015
Spectra Logic BlackPearl Developer Summit 2015
 
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...
 

Plus de ARDC

Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADAARDC
 
Architecture and Standards
Architecture and StandardsArchitecture and Standards
Architecture and StandardsARDC
 
Data Sharing and Release Legislation
Data Sharing and Release Legislation   Data Sharing and Release Legislation
Data Sharing and Release Legislation ARDC
 
Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)ARDC
 
Investigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveInvestigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveARDC
 
NCRIS and the health domain
NCRIS and the health domainNCRIS and the health domain
NCRIS and the health domainARDC
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataARDC
 
Clinical trials data sharing
Clinical trials data sharingClinical trials data sharing
Clinical trials data sharingARDC
 
Clinical trials and cohort studies
Clinical trials and cohort studiesClinical trials and cohort studies
Clinical trials and cohort studiesARDC
 
Introduction to vision and scope
Introduction to vision and scopeIntroduction to vision and scope
Introduction to vision and scopeARDC
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things dataARDC
 
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC
 
Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128ARDC
 
Research data management and sharing of medical data
Research data management and sharing of medical dataResearch data management and sharing of medical data
Research data management and sharing of medical dataARDC
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataARDC
 
Applying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesApplying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesARDC
 
How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018ARDC
 
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintReady, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintARDC
 
How FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataHow FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataARDC
 
Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018ARDC
 

Plus de ARDC (20)

Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADA
 
Architecture and Standards
Architecture and StandardsArchitecture and Standards
Architecture and Standards
 
Data Sharing and Release Legislation
Data Sharing and Release Legislation   Data Sharing and Release Legislation
Data Sharing and Release Legislation
 
Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)
 
Investigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveInvestigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspective
 
NCRIS and the health domain
NCRIS and the health domainNCRIS and the health domain
NCRIS and the health domain
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
Clinical trials data sharing
Clinical trials data sharingClinical trials data sharing
Clinical trials data sharing
 
Clinical trials and cohort studies
Clinical trials and cohort studiesClinical trials and cohort studies
Clinical trials and cohort studies
 
Introduction to vision and scope
Introduction to vision and scopeIntroduction to vision and scope
Introduction to vision and scope
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
 
Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128
 
Research data management and sharing of medical data
Research data management and sharing of medical dataResearch data management and sharing of medical data
Research data management and sharing of medical data
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
 
Applying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesApplying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and Challenges
 
How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018
 
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintReady, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
 
How FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataHow FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of data
 
Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018
 

Dernier

Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 

Dernier (20)

Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 

A reid ands_ttt2_perth_network-literacy 17_may18

  • 1. Data Handling & Network Literacy Data movement and network know-how A train the trainer workshop
  • 2. 2 Talking Points © AARNet Pty Ltd | ● Advanced research networks ● Network services ● Researcher data movement problems
  • 3. 3 Talking Points © AARNet Pty Ltd | ● Advanced research networks ● Network services ● Researcher data movement problems
  • 4. 4 Typical Researcher Data File Sizes © AARNet Pty Ltd | Gigabyte (GB) files: 4GB: DVD movie. 5GB: modest USB stick. 20GB: Blu-ray movie. 100GB: 4K movie. 300GB: laptop backup. 500GB: high- end USB stick. Megabyte (MB) files: 1MB: scholarly paper. 2MB: e-Book. 3MB: HD photo. 5MB: 1 song (MP3). 10MB: 1-minute Youtube movie. 100MB: album of songs. 750MB: CD-ROM. Kilobyte (KB) files: 1KB: email. 4KB: page of text. 30KB: page of formatted text. 25KB: 1-page spreadsheet. 40KB: simple Web page. Terabyte (TB) files: 2TB: laptop backup storage HDD. 5TB: typical Humanities data collection. 20TB: typical Medical data collection. 100TB: typical Genomics data collection. Petabyte (PB) files: 2PB: all Medical data collections on RDS. 2PB: all genomic data collections on RDS. 10PB: Climate data. 20PB: ASKAP data per year. 1000PB (1EB): SKA data per year.
  • 5. 5 Research Data Services (RDS) Collections https://www.rds.edu.au/
  • 6. 6 Data Collection Sizes (Research Data Services RDS) © AARNet Pty Ltd | 90 data collections = 2200TB (average = 24TB/collection) 49 data collections = 359TB (average = 7TB/collection) 31 data collections = 2237TB (average = 72TB/collection) See RDS Research Community Projects
  • 7. 1TB Transfer See Tech of the Internets File Transfer Time Calculator: https://techinternets.com/copy_calc
  • 8. 8 Australian NREN © AARNet Pty Ltd | ● Advanced research network infrastructure ● Fast - 10 Gbps > 40 Gbps > 100 Gbps ● High capacity - 1 million + users ● Tailored for research, teaching and learning ● Low latency - consistent connectivity and response time ● Designed to have “head room”
  • 11. Network Speed 11 Australian NREN © AARNet Pty Ltd | Download 10 Gbps 40 Gbps 100 Gbps Upload 10 Gbps 40 Gbps 100 Gbps
  • 12. 12 Australian NBN © AARNet Pty Ltd | Download 100 Mbps 1 Gbps Upload 40 Mbps 400 Mbps Network Speed See NBN Fact Sheet: Traffic Class 4 for data https://www.nbnco.com.au/cont ent/dam/nbnco2/documents/nbn -business-fact-sheets/nbn- business-fact-sheet-tc4.pdf
  • 13. 1TB Transfer See Tech of the Internets File Transfer Time Calculator: https://techinternets.com/copy_calc
  • 14. WHAT CAN PREVENT THROUGHPUT = BANDWIDTH? Need to Know True End-End Network Connectivity & Characteristics. • Contention/Congestion, Firewalls, PC Speed, Applications, … • For very large file transfers, buffer sizes and other network tuning might be required. • Some simple things researchers can do themselves to try to get a handle on file transfer issues. ● Speedtest 🚀 ● Ping 🔊 ● Traceroute 🔎 14© AARNet Pty Ltd | Exercises
  • 15. 15 Speed Test 🚀 © AARNet Pty Ltd | http://www.speedtest.net/ Test on wifi and a mobile phone Pay attention to: upload vs download speeds
  • 16. 16 Bandwidth & Throughput © AARNet Pty Ltd | Network Bandwidth (bps) measures the maximum speed of the data that can be transferred. Network Throughput (bps) measures the actual speed of transfer across an end- end network.
  • 17. 17 Ping 🔊 © AARNet Pty Ltd | Type “ping” and the web address: eg ping aarnet.edu.au See WikiHow https://m.wikihow.com/Ping-an-IP- Address Windows: open up the Command Prompt using “cmd”, type ping aarnet.edu.au Linux: open a terminal window, type ping aarnet.edu.au (press CTRL and C to stop the command) Mac: open up Network Utilities (using Spotlight) and select ping menu and type in aarnet.edu.au Identifying issues with end to end data transfers
  • 18. 18 Traceroute 🔎 © AARNet Pty Ltd | Type “tracert” or “traceroute” and the web address e.g. tracert aarnet.edu.au Windows: open up the Command Prompt using “cmd”, type tracert aarnet.edu.au Linux: open a terminal window, type traceroute aarnet.edu.au (press CTRL and C to stop the command) Mac: open up Network Utilities (using Spotlight) and select traceroute menu and type in aarnet.edu.au Identifying failure point of data transfers
  • 19. 19 Latency © AARNet Pty Ltd | • Normally, determined by the distance from one end to the other. • Also can be affected by the speed of switches through which the signal travels. • Signals only move along wires/fibres at ~half the speed of light. • Satellite connections have very long latency because of the distances involved (2x36,000km). • Latency is important for real-time applications, like videoconferencing or gaming. • Use Ping to measure latency on a particular route.
  • 20. 20 Elephants © AARNet Pty Ltd | CC-BY 2.0 Brian Ralphs
  • 21. 21 Elephant Flows © AARNet Pty Ltd | Spot the large blocks of traffic (elephants) moving in and out of the network.
  • 22. 22 Network Architecture - Current State © AARNet Pty Ltd | International Uni B Uni B Uni A International Uni A University Campus Email Finance Student Research Etc ... Campus Data 100GbpsAARNet NREN FIREWALL 1-10Gbps
  • 23. 23 Science DMZ © AARNet Pty Ltd | Network Connection tuned for large scientific / research data traffic: • Irregular, but very disruptive. • Optimised for big data science, elephant flows. • Science DMZ (“demilitarised zone”) diverts large data flows from/to specific sites (eg 10TB of climate data that NCI imports from the USA every week). • Improves overall network performance – for regular users as well as researchers. • Reduces need to upgrade corporate Firewalls. • Developed by ESnet in the USA.
  • 24. 24 Network Architecture - Future State (Science DMZ) © AARNet Pty Ltd | AARNet NREN International Uni B Uni B Uni A International Uni A Email Finance Student etc ... Campus Data FIREWALL Campus RESEARCH DATA University Campus Science DMZ switch
  • 25. 25 Talking Points © AARNet Pty Ltd | ● Advanced research networks ● Network services ● Researcher data movement problems
  • 26. 26 Network Services ● CloudStor ● FileSender ● Zoom ● Zoom Webinars ● Discipline-specific Virtual Labs ● eduroam WiFi international roaming ● etc
  • 29. CloudStor Functions - Store - Upload - Share - Send - Sync - Secure - Package - Describe CloudStor with other Applications - WebDAV e.g. Cyberduck/Transmit - FileSender API - S3 gateways e.g. AWS/Azure - Rocket - Jupyter Notebook - Kaltura - Storage services CloudStor in Research - Move data - Change data - Store data - Describe data - Share data Manual and automated data workflows that support data intensive research 29© AARNet Pty Ltd |
  • 30. CloudStor: Upload When? All through the research lifecycle. How? Browser, Sync, WebDAV, Rocket, FileSender API, S3 What else? File number, sizes and types, equipment, and programming. 30© AARNet Pty Ltd |
  • 31. Syncing data ● Just a gentle reminder with Syncing data. ● To think a moment before you delete. ● Sometimes storage points for data capture are also data clearing points. ● Think about developing a workflow to copy files to another folder if data is being moved from the point of capture to the point of processing. ● On the bright side, if you Whoops! delete your data, it’s in a deleted folder for 30 days. 31© AARNet Pty Ltd |
  • 32. CloudStor: Share & Send When? All through the research lifecycle. How? Group Allocation, FileSender. What else? Institutional allocation, vouchers, and notifications. 32© AARNet Pty Ltd |
  • 33. Sending or Sharing data? ● Send lasts for 2 weeks ● has file encryption, ● and can be two-way (vouchers). ● Share includes access control (CRUD) ● and utilises web links. 33© AARNet Pty Ltd |
  • 34. CloudStor: Secure When? All through the research lifecycle. How? Group Drive, FileSender, Backup. What else? User access, public/private links and controls, and encryption. 34© AARNet Pty Ltd |
  • 35. Securing data ● Files need to reside on Cloudstor for 24 hours to enter the backup cycle. ● Encryption at rest (automatic) and in transit of data (automatic). ● FileSender supports end-end file encryption (in beta). 35© AARNet Pty Ltd |
  • 36. 36 Talking Points © AARNet Pty Ltd | ● Advanced research networks ● Network services ● Researcher data movement problems
  • 37. 37 Researcher data movement problems File too big to attach to Email: Solution: Send using FileSender. Slow data transferring via desktop/laptop: Solution: Identify issue location (Ping, Traceroute) & set up direct data transfers via a tailored network (eg Science DMZ). Putting sensitive data at risk by sharing via email, dropbox: Solution: Send sensitive data via encryption on a network (eg FileSender). Too many files or chunky files 100TB (eg 25,000 characterisation images or 333 videos): Solution: Use FileSender group file transfer facility; or use FileSender API to connect direct with the Application generating/capturing the files. etc…
  • 38. THANK YOU Alex Reid, eResearch Advisor, AARNet. alex.reid@aarnet.edu.au support@aarnet.edu.au