SlideShare a Scribd company logo
1 of 40
Taxonomy:
Do I need one?


                     Leigh White
            ElementalSource, LLC
Yes
What I’ll talk about

•   What happens without a taxonomy
•   What a taxonomy is and does
•   Why a taxonomy is important
•   A few first development steps
What I won’t talk about

• All the different kinds of taxonomies
• Details about development
• Tools for development
  – except DITA subjectScheme (briefly!)
A little history
What the he** IS that???
Oh, let’s call it a…

• Use the native name
• Name it after something familiar
  that it’s kind of “like”
• “Like” is murky; you have to define
  “like”
  – How it looks? Shape? Color? Size?
  – How it tastes?
  – How it acts?
Earth apples, anyone?

• aardappel (Dutch)
• pomme de terre (French)
*not apples
We know this because

• We have a taxonomy (Linnean
  classification) that specifies degrees
  of relationship between living things
Distant cousins, at best

          apple          potato

Kingdom   Plantae        Plantae

Phylum    Anthophyta     Anthophyta

Class     Eudicots       Eudicots

Order     Rosales        Solanales

Family    Rosaceae       Solanaceae

Genus     Malus          Solanum

Species   M. domestica   S. tuberosum
So, a taxonomy is

• A way of defining “like”
• A way of expressing relationships
  between things
  – We might already be instinctively
    aware of these relationships but need
    to formalize them
• A way of discovering relationships
  between things
• An information model
Taxonomies are

• typically organized by parent-child
  relationships
• typically indicated by the phrase 'is
  a kind of' or 'is a subtype of'
• the subtype has the same
  properties, behaviors, and
  constraints as the supertype plus
  one or more additional properties,
  behaviors, or constraints
Uhh…what?

• For example: car is a kind of
  vehicle, so any car is also a vehicle,
  but not every vehicle is a car
• The level “car” is more constrained
  than the level “vehicle”
• A car has all the properties of a
  vehicle plus some other properties
  specific to a car
Taxonomies are all around us

• It’s our nature to classify
• Many of these taxonomies are
  internal, arbitrary and personal
• A true taxonomy must be uniform
  and unambiguous
Other familiar taxonomies

• Dewey Decimal System
• Library of Congress System
• ICD-9/10 codes
• computer folder system
  – probably most common
    taxonomy in tech comm
And one I especially dig

• A taxonomy of wrongness!
  – www.fallacyfiles.org/taxonomy.html
We have metadata…why do we need
a taxonomy too?

• Where did that metadata come
  from?
  – You must have had some idea of how
    your content should be classified
  – If so, then you already have the
    beginnings of a taxonomy, at least in
    your head
  – So take it a step further
Metadata compliments taxonomy
and vice-versa
• Metadata describes an individual piece of
  content but doesn’t capture relationships
  very well.
• Metadata is part of content so updates
  can be unwieldy; better to maintain the
  model outside the content
• A taxonomy serves as a roadmap…it both
  describes current content and predicts
  future content
• A taxonomy highlights similarities (and
  differences) across products
• Metadata can pick up where taxonomy
  leaves off
What else are taxonomies good for?

• Controlled vocabularies
  – indexing
  – keywords
  – glossaries

• Searching/browsing/filtering
  – Faceted search
  – Filtering for custom doc publishing

• Content reuse
Amazon.com
So far…

• we’ve looked at hierarchical
  taxonomies
When hierarchy isn’t enough

 A Cockapoo is a kind of dog. It’s the
  product of a poodle and a Cocker
  Spaniel. A hierarchy cannot capture
  all these relationships.
There’s an alternative (polyarchical)
Purists might say…

• that you need different notations to
  express different kinds of
  relationships
• or that you must express the
  relationships uniformly
Maybe, maybe not

• You need what you need to capture
  the relationships you need to
  express
• No more, no less - KISS
• The relationships already exist; you
  are just using the taxonomy to
  express them
Decisions to make

• What kind of taxonomy:
  – hierarchical, polyarchical, something
    else?
• If hierarchical, how many levels?
• If polyarchical, what kinds of
  relationships and how designated?
• Tool to use? (meh)
• How to associate content with
  taxonomy?
Questions to ask
• What will the taxonomy be used for?
  – indexing, search, etc.
• Who are the users?
  – content creators, clients, SMEs, support, etc.
• What content will the taxonomy cover?
  – topics, images, demos, videos, etc.
• What are the scope and limits?
  – handling off-topic content—what to
    include/exclude
• What are the resources and constraints?
  – skills/expertise, timing, technology, funding,
    stakeholder roles, etc.
More questions to ask

• Who is responsible for development?
• What are secondary/contributor
  roles?
• How does taxonomy fit in with other
  metadata?
• How to handle ongoing support and
  maintenance?
Some first steps
• Start small—maybe just one small product
• Do content audit of everything the
  taxonomy will categorize
• Compare TOCs of existing deliverables
  – Find commonalities, differences
• Compare indexes of existing deliverables
  – Discover terms already in use
• Use folder structure
More first steps
• Assemble starting list of categories
  that cover existing content based on
  TOC, index and content audit
• Place existing content within
  taxonomy (on paper)
• Create taxonomy task force to
  review and refine
  – Avoid too many cooks
DITA Classification and Subject
Scheme
• Subject scheme
  – Defines controlled values (“buckets”)
    for classifying content
  – Defines relationships between those
    buckets
• Classification
  – Groups content into appropriate
    buckets
Subject classification scheme
subjectScheme map
<subjectScheme>
   <hasInstance>
      <subjectdef keys="product">
         <subjectdef keys="Widget"/>
            <subjectdef keys="module">
               <subjectdef keys="Meds"/>
               <subjectdef keys="AdminW"/>
            </subjectdef>
         </subjectdef>
         <subjectdef keys="Gadget"/>
            <subjectdef keys="module">
               <subjectdef keys="AdminG"/>
               <subjectdef keys="Labs"/>
            </subjectdef>
         </subjectdef>
      </subjectdef>
   </hasInstance>
</subjectScheme>
Associate topics with subjects
<map>
   <topicref href="t_configure_med.xml">
      <topicsubject>
         <subjectref keys="Meds"/>
         <subjectref keys="AdminW"/>
         <subjectref keys="AdminG"/>
      </topicsubject>
   </topicref>
</map>
Recommended reading/viewing

• The Accidental Taxonomist, Heather
  Hedden
• Organising Knowledge: Taxonomies,
  Knowledge, and Organisational
  Effectiveness, Patrick Lambe
• Joe Gelb’s presentation on
  subjectScheme:
  http://svdig.ditamap.com/videos/sv
  dig-2011-05-11.htm
Contact me



               Leigh White
      ElementalSource, LLC

elementalsource@gmail.com
              678.467.7706

More Related Content

Similar to Taxonomy

Some thoughts on social tagging
Some thoughts on social taggingSome thoughts on social tagging
Some thoughts on social taggingmarti_hearst
 
[AIIM17] Data Categorization You Can Live With - Monica Crocker
[AIIM17]  Data Categorization You Can Live With - Monica Crocker [AIIM17]  Data Categorization You Can Live With - Monica Crocker
[AIIM17] Data Categorization You Can Live With - Monica Crocker AIIM International
 
System Concepts for Object Modelling.pptx
System Concepts for Object Modelling.pptxSystem Concepts for Object Modelling.pptx
System Concepts for Object Modelling.pptxbarrettoleisabel
 
The Elusive Promise of Reuse
The Elusive Promise of ReuseThe Elusive Promise of Reuse
The Elusive Promise of ReuseIXIASOFT
 
The Elusive Promise of Reuse
The Elusive Promise of ReuseThe Elusive Promise of Reuse
The Elusive Promise of ReuseLeigh White
 
How To Go About Researching
How To Go About ResearchingHow To Go About Researching
How To Go About ResearchingSudhira H. S.
 
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARYINFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARYChris Okiki
 
Card Sorting Your Way to Meaningful Metadata
Card Sorting Your Way to Meaningful MetadataCard Sorting Your Way to Meaningful Metadata
Card Sorting Your Way to Meaningful MetadataRob Bogue
 
Designing an effective information architecture (
Designing an effective information architecture (Designing an effective information architecture (
Designing an effective information architecture (Vickey Bird
 
Library Research for Human Rights Guide
Library Research for Human Rights GuideLibrary Research for Human Rights Guide
Library Research for Human Rights GuideAnnelise Sklar
 
Analysing Qualitative Data
Analysing Qualitative DataAnalysing Qualitative Data
Analysing Qualitative DataMike Crabb
 
Michael Bolton - Heuristics: Solving Problems Rapidly
Michael Bolton - Heuristics: Solving Problems RapidlyMichael Bolton - Heuristics: Solving Problems Rapidly
Michael Bolton - Heuristics: Solving Problems RapidlyTEST Huddle
 
Library research for Environmental Studies at UCSD
Library research for Environmental Studies at UCSDLibrary research for Environmental Studies at UCSD
Library research for Environmental Studies at UCSDAnnelise Sklar
 
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics Hackathon
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics HackathonxAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics Hackathon
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics HackathonRussell Duhon
 
Thematic content analysis in psychology
Thematic content analysis in psychologyThematic content analysis in psychology
Thematic content analysis in psychologyDr. Chinchu C
 
Taxonomies & folksonomies
Taxonomies  & folksonomiesTaxonomies  & folksonomies
Taxonomies & folksonomiesAparna Sane
 
Enc1101 drafting unit 3
Enc1101 drafting unit 3Enc1101 drafting unit 3
Enc1101 drafting unit 3Heather Wayne
 
Chapter 4 common features of qualitative data analysis
Chapter 4 common features of qualitative data analysisChapter 4 common features of qualitative data analysis
Chapter 4 common features of qualitative data analysisMohd. Noor Abdul Hamid
 

Similar to Taxonomy (20)

Some thoughts on social tagging
Some thoughts on social taggingSome thoughts on social tagging
Some thoughts on social tagging
 
[AIIM17] Data Categorization You Can Live With - Monica Crocker
[AIIM17]  Data Categorization You Can Live With - Monica Crocker [AIIM17]  Data Categorization You Can Live With - Monica Crocker
[AIIM17] Data Categorization You Can Live With - Monica Crocker
 
System Concepts for Object Modelling.pptx
System Concepts for Object Modelling.pptxSystem Concepts for Object Modelling.pptx
System Concepts for Object Modelling.pptx
 
The Elusive Promise of Reuse
The Elusive Promise of ReuseThe Elusive Promise of Reuse
The Elusive Promise of Reuse
 
The Elusive Promise of Reuse
The Elusive Promise of ReuseThe Elusive Promise of Reuse
The Elusive Promise of Reuse
 
How To Go About Researching
How To Go About ResearchingHow To Go About Researching
How To Go About Researching
 
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARYINFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
 
Card Sorting Your Way to Meaningful Metadata
Card Sorting Your Way to Meaningful MetadataCard Sorting Your Way to Meaningful Metadata
Card Sorting Your Way to Meaningful Metadata
 
Designing an effective information architecture (
Designing an effective information architecture (Designing an effective information architecture (
Designing an effective information architecture (
 
Library Research for Human Rights Guide
Library Research for Human Rights GuideLibrary Research for Human Rights Guide
Library Research for Human Rights Guide
 
Analysing Qualitative Data
Analysing Qualitative DataAnalysing Qualitative Data
Analysing Qualitative Data
 
Taxonomy Fundamentals Workshop
Taxonomy Fundamentals WorkshopTaxonomy Fundamentals Workshop
Taxonomy Fundamentals Workshop
 
Michael Bolton - Heuristics: Solving Problems Rapidly
Michael Bolton - Heuristics: Solving Problems RapidlyMichael Bolton - Heuristics: Solving Problems Rapidly
Michael Bolton - Heuristics: Solving Problems Rapidly
 
Library research for Environmental Studies at UCSD
Library research for Environmental Studies at UCSDLibrary research for Environmental Studies at UCSD
Library research for Environmental Studies at UCSD
 
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics Hackathon
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics HackathonxAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics Hackathon
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics Hackathon
 
Thematic content analysis in psychology
Thematic content analysis in psychologyThematic content analysis in psychology
Thematic content analysis in psychology
 
Taxonomies & folksonomies
Taxonomies  & folksonomiesTaxonomies  & folksonomies
Taxonomies & folksonomies
 
Week 2
Week 2Week 2
Week 2
 
Enc1101 drafting unit 3
Enc1101 drafting unit 3Enc1101 drafting unit 3
Enc1101 drafting unit 3
 
Chapter 4 common features of qualitative data analysis
Chapter 4 common features of qualitative data analysisChapter 4 common features of qualitative data analysis
Chapter 4 common features of qualitative data analysis
 

More from Leigh White

DITA 1.3 Keyscopes
DITA 1.3 KeyscopesDITA 1.3 Keyscopes
DITA 1.3 KeyscopesLeigh White
 
The Content Revolution: Managing the Move to a CCMS
The Content Revolution: Managing the Move to a CCMSThe Content Revolution: Managing the Move to a CCMS
The Content Revolution: Managing the Move to a CCMSLeigh White
 
To CMS or Not to CMS?
To CMS or Not to CMS?To CMS or Not to CMS?
To CMS or Not to CMS?Leigh White
 
Making the Mental Shift to Topic-Based Authoring and a CMS
Making the Mental Shift to Topic-Based Authoring and a CMSMaking the Mental Shift to Topic-Based Authoring and a CMS
Making the Mental Shift to Topic-Based Authoring and a CMSLeigh White
 
Information Architecture Explained
Information Architecture ExplainedInformation Architecture Explained
Information Architecture ExplainedLeigh White
 
Estimating time-tracking
Estimating time-trackingEstimating time-tracking
Estimating time-trackingLeigh White
 
Custom PDFs from the DITA OT
Custom PDFs from the DITA OTCustom PDFs from the DITA OT
Custom PDFs from the DITA OTLeigh White
 

More from Leigh White (7)

DITA 1.3 Keyscopes
DITA 1.3 KeyscopesDITA 1.3 Keyscopes
DITA 1.3 Keyscopes
 
The Content Revolution: Managing the Move to a CCMS
The Content Revolution: Managing the Move to a CCMSThe Content Revolution: Managing the Move to a CCMS
The Content Revolution: Managing the Move to a CCMS
 
To CMS or Not to CMS?
To CMS or Not to CMS?To CMS or Not to CMS?
To CMS or Not to CMS?
 
Making the Mental Shift to Topic-Based Authoring and a CMS
Making the Mental Shift to Topic-Based Authoring and a CMSMaking the Mental Shift to Topic-Based Authoring and a CMS
Making the Mental Shift to Topic-Based Authoring and a CMS
 
Information Architecture Explained
Information Architecture ExplainedInformation Architecture Explained
Information Architecture Explained
 
Estimating time-tracking
Estimating time-trackingEstimating time-tracking
Estimating time-tracking
 
Custom PDFs from the DITA OT
Custom PDFs from the DITA OTCustom PDFs from the DITA OT
Custom PDFs from the DITA OT
 

Recently uploaded

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 

Taxonomy

  • 1. Taxonomy: Do I need one? Leigh White ElementalSource, LLC
  • 2. Yes
  • 3. What I’ll talk about • What happens without a taxonomy • What a taxonomy is and does • Why a taxonomy is important • A few first development steps
  • 4. What I won’t talk about • All the different kinds of taxonomies • Details about development • Tools for development – except DITA subjectScheme (briefly!)
  • 6. What the he** IS that???
  • 7. Oh, let’s call it a… • Use the native name • Name it after something familiar that it’s kind of “like” • “Like” is murky; you have to define “like” – How it looks? Shape? Color? Size? – How it tastes? – How it acts?
  • 8. Earth apples, anyone? • aardappel (Dutch) • pomme de terre (French)
  • 10. We know this because • We have a taxonomy (Linnean classification) that specifies degrees of relationship between living things
  • 11. Distant cousins, at best apple potato Kingdom Plantae Plantae Phylum Anthophyta Anthophyta Class Eudicots Eudicots Order Rosales Solanales Family Rosaceae Solanaceae Genus Malus Solanum Species M. domestica S. tuberosum
  • 12. So, a taxonomy is • A way of defining “like” • A way of expressing relationships between things – We might already be instinctively aware of these relationships but need to formalize them • A way of discovering relationships between things • An information model
  • 13. Taxonomies are • typically organized by parent-child relationships • typically indicated by the phrase 'is a kind of' or 'is a subtype of' • the subtype has the same properties, behaviors, and constraints as the supertype plus one or more additional properties, behaviors, or constraints
  • 14. Uhh…what? • For example: car is a kind of vehicle, so any car is also a vehicle, but not every vehicle is a car • The level “car” is more constrained than the level “vehicle” • A car has all the properties of a vehicle plus some other properties specific to a car
  • 15. Taxonomies are all around us • It’s our nature to classify • Many of these taxonomies are internal, arbitrary and personal • A true taxonomy must be uniform and unambiguous
  • 16. Other familiar taxonomies • Dewey Decimal System • Library of Congress System • ICD-9/10 codes • computer folder system – probably most common taxonomy in tech comm
  • 17. And one I especially dig • A taxonomy of wrongness! – www.fallacyfiles.org/taxonomy.html
  • 18. We have metadata…why do we need a taxonomy too? • Where did that metadata come from? – You must have had some idea of how your content should be classified – If so, then you already have the beginnings of a taxonomy, at least in your head – So take it a step further
  • 19. Metadata compliments taxonomy and vice-versa • Metadata describes an individual piece of content but doesn’t capture relationships very well. • Metadata is part of content so updates can be unwieldy; better to maintain the model outside the content • A taxonomy serves as a roadmap…it both describes current content and predicts future content • A taxonomy highlights similarities (and differences) across products • Metadata can pick up where taxonomy leaves off
  • 20. What else are taxonomies good for? • Controlled vocabularies – indexing – keywords – glossaries • Searching/browsing/filtering – Faceted search – Filtering for custom doc publishing • Content reuse
  • 22.
  • 23.
  • 24.
  • 25. So far… • we’ve looked at hierarchical taxonomies
  • 26. When hierarchy isn’t enough  A Cockapoo is a kind of dog. It’s the product of a poodle and a Cocker Spaniel. A hierarchy cannot capture all these relationships.
  • 27. There’s an alternative (polyarchical)
  • 28. Purists might say… • that you need different notations to express different kinds of relationships • or that you must express the relationships uniformly
  • 29. Maybe, maybe not • You need what you need to capture the relationships you need to express • No more, no less - KISS • The relationships already exist; you are just using the taxonomy to express them
  • 30. Decisions to make • What kind of taxonomy: – hierarchical, polyarchical, something else? • If hierarchical, how many levels? • If polyarchical, what kinds of relationships and how designated? • Tool to use? (meh) • How to associate content with taxonomy?
  • 31. Questions to ask • What will the taxonomy be used for? – indexing, search, etc. • Who are the users? – content creators, clients, SMEs, support, etc. • What content will the taxonomy cover? – topics, images, demos, videos, etc. • What are the scope and limits? – handling off-topic content—what to include/exclude • What are the resources and constraints? – skills/expertise, timing, technology, funding, stakeholder roles, etc.
  • 32. More questions to ask • Who is responsible for development? • What are secondary/contributor roles? • How does taxonomy fit in with other metadata? • How to handle ongoing support and maintenance?
  • 33. Some first steps • Start small—maybe just one small product • Do content audit of everything the taxonomy will categorize • Compare TOCs of existing deliverables – Find commonalities, differences • Compare indexes of existing deliverables – Discover terms already in use • Use folder structure
  • 34. More first steps • Assemble starting list of categories that cover existing content based on TOC, index and content audit • Place existing content within taxonomy (on paper) • Create taxonomy task force to review and refine – Avoid too many cooks
  • 35. DITA Classification and Subject Scheme • Subject scheme – Defines controlled values (“buckets”) for classifying content – Defines relationships between those buckets • Classification – Groups content into appropriate buckets
  • 37. subjectScheme map <subjectScheme> <hasInstance> <subjectdef keys="product"> <subjectdef keys="Widget"/> <subjectdef keys="module"> <subjectdef keys="Meds"/> <subjectdef keys="AdminW"/> </subjectdef> </subjectdef> <subjectdef keys="Gadget"/> <subjectdef keys="module"> <subjectdef keys="AdminG"/> <subjectdef keys="Labs"/> </subjectdef> </subjectdef> </subjectdef> </hasInstance> </subjectScheme>
  • 38. Associate topics with subjects <map> <topicref href="t_configure_med.xml"> <topicsubject> <subjectref keys="Meds"/> <subjectref keys="AdminW"/> <subjectref keys="AdminG"/> </topicsubject> </topicref> </map>
  • 39. Recommended reading/viewing • The Accidental Taxonomist, Heather Hedden • Organising Knowledge: Taxonomies, Knowledge, and Organisational Effectiveness, Patrick Lambe • Joe Gelb’s presentation on subjectScheme: http://svdig.ditamap.com/videos/sv dig-2011-05-11.htm
  • 40. Contact me Leigh White ElementalSource, LLC elementalsource@gmail.com 678.467.7706