SlideShare une entreprise Scribd logo
1  sur  39
Télécharger pour lire hors ligne
HyperMembrane Structures for
Open Source Cognitive Computing
Japanese Agency for Science and Technology
Tokyo, Japan
3 March, 2015
Jack Park
© 2015 TopicQuests Foundation CC by SA 4.0
The Present Situation
Upon this gifted age, in its dark hour,
Rains from the sky a meteoric shower
Of facts . . . they lie unquestioned,
uncombined.
Wisdom enough to leech us of our ill
Is daily spun; but there exists no loom
To weave it into fabric
Edna St. Vincent Millay, 1939
2
Topics To Cover
• Discovery, learning, problem solving
• Topic Maps
• OpenSherlock
• HyperMembranes
• Open Source
• Key reasons for building open source cognitive
systems
3
Cognitive Computing: My View
• Cognitive Computing is:
– Far less about what a computer knows
– Far more about how computers can
augment human cognitive capabilities
– Based on the J.C.R Licklider and
Douglas Engelbart augmentation work
J.C.R. Licklider
Douglas Engelbart
4Imgs: Wikipedia
A Domain-specific Problem Statement
• An Example:
– Do these two sentences say the same thing?
• CO2 is a causal factor in climate change.
• Climate change is caused by carbon dioxide.
• Problem Statement
– Software agents need elegant methods for
reading, representing, organizing, and modeling
information resources to support discovery and
answering questions.
5
A Framing Thought
• From [1]
– The understanding of global brain organization
and its large-scale integration remains a challenge
for modern neurosciences.
• To
– The understanding of global conversations about
topics that matter and their large-scale federation
remain a challenge for modern information
technology.
[1] Petri G, Expert P, Turkheimer F, Carhart-Harris R, Nutt D, Hellyer PJ, Vaccarino F. (2014)
Homological scaffolds of brain functional networks. J. R. Soc. Interface 11: 20140873.
6
Our Goals
• Improve Human-Tool Capabilities
• Augment existing analytic methods
– Increase opportunities for discovery
– Improve already sophisticated methods
• Build Looms
– Read documents
– Map and model topics read
– Weave information fabrics
Douglas Engelbart
7
Discovery
• Is it really possible for people to see
everything?
– Part of discovery is connecting dots not
yet connected.
– “Cognitive Agents” can help increase
chances of serendipity.
“Discovery consists of
seeing what everybody
has seen and thinking
what nobody has
thought.”
–Albert Szent-Györgyi
8
Related Work
• Commercial
– IBM Watson
– Wolfram Alpha
– Viv
– Saffron 10
– Clueda
– Siri
– Google Now
– Cortana
– …
• Open Source
– OAQA
– DeepDive
– OpenCog
– OpenNARS
– Watsonsim
– YodaQA
– AKSW OpenQA
– AKSW QA
– AquaLog
– OpenSherlock
– OpenIRIS (CALO)
– …
• Research
– Project Aristo
– Project Halo
– FREyA
– CASIA
– NLP-Reduce
– EIS Sina
– WDAqua ITM
– Intui2
– …
9
Biologically Inspired Design
• Humans are blessed with:
– Memory to keep concepts organized and
connected
– Internal mechanisms which map sensor data into
memory for processing and storage
– The abilities of complex, adaptive, anticipatory
systems
10
Memory: Introducing Topic Maps
• A Topic Map is like a library without all the books*
– A Topic Map is indexical
• Like a card catalog
– Each topic has its own representation
• Improving on a card catalog, a topic can be identified many
different ways
• Captures metadata and optionally content
– A Topic Map is relational
• Like a good road map
– Topics are connected by associations (relations)
– Topics point to their occurrences in the territory
– A Topic Map is organized
• Multiple records on the same topic are co-located (stored as one
topic) in the map
*a map is not its territory
11
TopicMap Structure
•Topics as Actors
•Topics as Relations
•Topics as Types
•Topics as Biographies
12
Processing Mechanisms
• Typically, software processes take the form of
variants of NLP (natural language processing)
– Parsers
– Cluster analysis
– Entity recognition
– Relation detection
– Role recognition
– Probabilistic methods
13
A Key Question in My Research
• Can a Topic Map learn (construct itself) by “reading” literature?
– Relevant issues:
• Bootstrapping
• Machine reading
– NLP
– Linguistics
– Statistics
– Analogy & Metaphor
– …
• Knowledge representation
• Model building
– Anticipation
• Weaving information fabrics
• Literature-based discovery
• Deep Question Answering
14
A Simple Example
• Read this sentence:
– Gene expression is caused by insoluble hormones
binding to a plasma membrane hormone receptor
• Topic Map recognizes:
– Gene expression  GeneExpression
– insoluble hormones  InsolubleHormone
– plasma membrane hormone receptor 
PlasmaMembraneReceptor
• Software agents transform:
– is caused by  Cause
– binding to  Binds
• Final semantic structure:
• { {InsolubleHormone, Binds, PlasmaMembraneReceptor},
Cause, GeneExpression }
15
Introducing OpenSherlock
• OpenSherlock is:
– A Topic Map for information resource identity and organization
– A HyperMembrane information fabric structure
– A society of agents system which can
• Read documents
• Process information resources
– Maintain the topic map
– Maintain the HyperMembrane
– Build and maintain models
– Perform discovery tasks
– Answer questions
– Agents are coordinated by:
• A blackboard system
• A dynamic task-based agenda
• Event propagation and handling
16
Observations 1
• A Topic Map is central to the key question, and
therefore to a thesis entailed by this research
– It serves as a kind of memory for social processes
– It provides a robust platform for subject identity
– It can also serve as a repository for domain-
specific vocabularies (ontologies, taxonomies,
naming conventions,…)
17
Observations 2
• A Topic Map is necessary but not sufficient to support
discovery, learning, or problem solving
– It really only provides a powerful indexical structure related to
the key artifacts in any universe of discourse:
• Actors
• Their relations
• Their states
• Rules, laws, theories,…
• To model those key artifacts, other representation
strategies are required
– Conceptual Graphs
– Qualitative Process Theory
– Belief Networks
– …
18
A Research Question
• What processes are available which, if
performed while harvesting (reading)
documents, can reduce the amount of
processing required later during question
answering?
– The question entails
• Synthesis of ontology
• Co-reference resolution
• Re-representation during question lifting
• …
19
A Working Hypothesis
• Process
– Build and maintain a content-addressable memory
of questions, claims, arguments, and evidence
fields.
• We call that a HyperMembrane
– Note:
• Every text object passed into the system is processed by
the same algorithms
– Sentences harvested from text
– Questions and responses posed by humans
20
Key Concept: HyperMembrane
• HyperMembrane is a key concept in the
working hypothesis that OpenSherlock seeks
to explore and demonstrate
– A growing graph as a collection of woven and
intersecting fabrics
• constructed from normalized tuples (n-tuples) which
are designed to reduce the amount of NLP required to
read documents
• such that intersections of fabrics occur where named
entities in the graph of n-tuples are the same
– Inspired by Ted Nelson’s ZigZag Architecture
21
Machine Reading in OpenSherlock
• Goals:
– Grow the topic map
• Topic Map then serves to support fabrication of higher-order
knowledge structures
– Conceptual Graphs
– Belief Networks
– QP Theory Models
– HyperMembrane
– …
• Process Loop:
– For a given document
• For every paragraph in that document
– For every sentence in each paragraph
» Read the sentence
22
Sentence Reading
• First Step:
– Process sentence into word grams*
• Second Step:
– Where possible
• Transform word grams into n-tuples**
• n-tuples form the HyperMembrane
* A container of words, from 1 to 8 words per container
** A container of symbols based on words in word grams
23
Process Sentence into WordGrams
• Approach
– Break sentence into word grams*
• WordGram objects are shared across sentences
– Count of sentence identifiers associated with each object
serves as basis for probabilistic models
– Either
• TopicMap recognizes terms
– Or
• Sentence is parsed by Link-Grammar Parser**
• TopicMap learns from parse results
*http://en.wikipedia.org/wiki/W-shingling **http://www.link.cs.cmu.edu/link/
24
Transform WordGrams to N-Tuples
• Normalized tuple (N-Tuple)
– A structure where the subject, predicate, and object are normalized
• Nouns and verbs transformed
– CO2, Carbon Dioxide, …  CO2
– causes, is caused by, …  cause
• Two sentence example
– CO2 is a cause of climate change.
– Climate change is caused by carbon dioxide.
– Result:
» { CO2, cause, climate change }
– Normalization processes include general and domain specific lenses
• Rule-based interpreters which detect structures
– Taxonomy
– Causality
– Biomedical
– Geophysical
– …
• Process models
– Built and maintained while reading
– Predict while reading – Anticipatory Reading
25
About N-Tuples
• An N-Tuple is a structured record of
– Topics in the topic map
– Those topics are harvested from text
• An N-Tuple takes the form:
– { Subject, Predicate, Object }
– Where
• Subject and/or Object can be one of:
– A topic from the topic map
– Another N-Tuple
• An N-Tuple is identified by the identities of the terms it contains
– When thinking in terms of terms (words) read from documents, the identities
(numeric representations) of those terms form the identity of the N-Tuple
object.
• N-Tuples are content addressable
• Disambiguation of subjects is a topic mapping process
– Learning means continuous refinement of subject identity
– Ambiguities can also be solved through human intervention
26
N-Tuples as HyperMembrane
Tuples
{A, Bind, B}
{{A, Bind, B}, Cause, X}
{X, Bind, D}
{{X, Bind, D}, Cause, Y}
27
A B
Bind
X
Y
D
Cause
Bind
Cause
Current State of OpenSherlock
ElasticSearch
Titan
or
Blazegraph
Ontology
Importer
Ontologies
PubMed
Reader
PubMed
Abstracts
HyperMembrane
Engine
TellAsk
UMLS
Importer
UMLS
28
Observations 3
• HyperMembrane is a reminding system
– HyperMembrane is a record of federated human
conversation
• Harvested from books, papers, and recorded
conversation
• Includes statistical properties of recorded utterances
– HyperMembrane records:
• That which is common
• That which is novel
– Possibly wrong
– Possibly game changing
29
TellAsk Interface
Conversation Tree
User can click a
node to select as
parent for any
user response
Response Type Selectors.
Selection required before
response.
User types here
Linear
conversation flow
Entry Forms Selector List
Map starts a new conversation
with entered topic
30
The Open Source Stack
• Persistence
– ElasticSearch
– Considering Titan
– Considering Blazegraph (Bigdata™ RDF Store)
• Libraries
– Many from Apache Foundation and others
– LinkGrammarParser (Java version)
– XML PullParser
– Simple JSON Parser
• Tools
– Eclipse
31
Summary
32
Current State of Development
• Aim to answer simple questions about
casuality
– Current focus on biomedical domain
– Current focus on two lenses
• Taxonomy
• Casuality
– No Conceptual Graphs
– No Process Models
– No Probabilistic Models
33
Future Work
• Aim to complete an anticipatory system
– Process models for anticipation
– Conceptual graphs
– Probabilistic models
– More lenses
• Pluggable lenses
• Adaptive lenses
– More domains
34
Why Do This?
• Augment human capabilities in problem
solving
• Participate in Open Science
35
Augmenting Social Sensemaking
1
2
3
Creating Ideas
Refining Connections
Connecting Ideas
Cancer patient
36
Participate in Open Science
37
Key Context for Open Science
• A planet-wide, collaborative quest for Global
Thrivability*.
– Issues include
• Sociological events
– Health, epidemics, wars,…
• Geophysical events
– Climate change, earthquakes, volcanoes, …
• Astrophysical events
– Asteroids, our Sun. …
* Let’s call the quest: EarthMoonshot
38
Completed Representation
antioxidants
kill
free radicals
Contraindicates
macrophages use
free radicals to
kill bacteria
Bacterial Infection Antioxidants
Because
Appropriate For
Compromised Host
Let us co-create Cognitive Agents for Discovery
jackpark@topicquests.org
OpenSherlock documents at: http://debategraph.org/OpenSherlock
Code emerging at: https://github.com/opensherlock/
Slides online at http://slideshare.net/jackpark/
Acknowledgments:
Bob Gleichauf
David Alexander Price
Arun Majumdar
Robert S. Stephenson
Mark Szpakowski
Martin Radley
Sherry Jones
Alexander Wenzowski
Ted Kahn
Patrick Durusau
39

Contenu connexe

Tendances

Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)James Hendler
 
Facilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupFacilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupJames Hendler
 
On Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebOn Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebJames Hendler
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataJames Hendler
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide WebJames Hendler
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)James Hendler
 
Watson: An Academic's Perspective
Watson: An Academic's PerspectiveWatson: An Academic's Perspective
Watson: An Academic's PerspectiveJames Hendler
 
The Semantic Web: It's for Real
The Semantic Web: It's for RealThe Semantic Web: It's for Real
The Semantic Web: It's for RealJames Hendler
 
Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)James Hendler
 
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
The Future of AI: Going BeyondDeep Learning, Watson, and the Semantic WebThe Future of AI: Going BeyondDeep Learning, Watson, and the Semantic Web
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic WebJames Hendler
 
KR in the age of Deep Learning
KR in the age of Deep LearningKR in the age of Deep Learning
KR in the age of Deep LearningJames Hendler
 
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...James Hendler
 
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...James Hendler
 
Digital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AIDigital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AIJames Hendler
 
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain
 
The Semantic Web: 2010 Update
The Semantic Web: 2010 Update The Semantic Web: 2010 Update
The Semantic Web: 2010 Update James Hendler
 
Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)James Hendler
 
5 Thoughts on Staying Sharp and Relevant (Chicago)
5 Thoughts on Staying Sharp and Relevant (Chicago)5 Thoughts on Staying Sharp and Relevant (Chicago)
5 Thoughts on Staying Sharp and Relevant (Chicago)Scott Lowe
 

Tendances (20)

Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
Facilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupFacilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic Markup
 
On Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebOn Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the Web
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of Metadata
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide Web
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)
 
Watson: An Academic's Perspective
Watson: An Academic's PerspectiveWatson: An Academic's Perspective
Watson: An Academic's Perspective
 
The Semantic Web: It's for Real
The Semantic Web: It's for RealThe Semantic Web: It's for Real
The Semantic Web: It's for Real
 
Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)
 
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
The Future of AI: Going BeyondDeep Learning, Watson, and the Semantic WebThe Future of AI: Going BeyondDeep Learning, Watson, and the Semantic Web
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
 
KR in the age of Deep Learning
KR in the age of Deep LearningKR in the age of Deep Learning
KR in the age of Deep Learning
 
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
 
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
 
Towards An Improvement Community Platform for Service Innovation
Towards An Improvement Community Platform for Service InnovationTowards An Improvement Community Platform for Service Innovation
Towards An Improvement Community Platform for Service Innovation
 
Digital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AIDigital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AI
 
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
 
The Semantic Web: 2010 Update
The Semantic Web: 2010 Update The Semantic Web: 2010 Update
The Semantic Web: 2010 Update
 
Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
5 Thoughts on Staying Sharp and Relevant (Chicago)
5 Thoughts on Staying Sharp and Relevant (Chicago)5 Thoughts on Staying Sharp and Relevant (Chicago)
5 Thoughts on Staying Sharp and Relevant (Chicago)
 

Similaire à HyperMembrane Structures for Open Source Cognitive Computing

Topic Pages. From articles to answers.
Topic Pages. From articles to answers.Topic Pages. From articles to answers.
Topic Pages. From articles to answers.Deep Kayal
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerFrancesco Osborne
 
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...Angelo Salatino
 
Recommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and DatoRecommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and DatoAshok Venkatesan
 
PS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxPS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxahmadbhattim005
 
PS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxPS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxUneezaRajpoot
 
Introduction to Object Oriented Programming
Introduction to Object Oriented ProgrammingIntroduction to Object Oriented Programming
Introduction to Object Oriented ProgrammingMoutaz Haddara
 
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache SparkThe Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache SparkKrishna Sankar
 
Searching of Web and Electronic Resources
Searching of Web and Electronic Resources Searching of Web and Electronic Resources
Searching of Web and Electronic Resources Bramesha B
 
Artificial Intelligent introduction or history
Artificial Intelligent introduction or historyArtificial Intelligent introduction or history
Artificial Intelligent introduction or historyArslan Sattar
 
Knowledge base system appl. p 1,2-ver1
Knowledge base system appl.  p 1,2-ver1Knowledge base system appl.  p 1,2-ver1
Knowledge base system appl. p 1,2-ver1Taymoor Nazmy
 
The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...Anubhav Jain
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎Libcorpio
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagementSTIinnsbruck
 

Similaire à HyperMembrane Structures for Open Source Cognitive Computing (20)

Topic Pages. From articles to answers.
Topic Pages. From articles to answers.Topic Pages. From articles to answers.
Topic Pages. From articles to answers.
 
Ngsp
NgspNgsp
Ngsp
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
 
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
 
Recommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and DatoRecommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and Dato
 
DatoConference2015
DatoConference2015DatoConference2015
DatoConference2015
 
PS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxPS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptx
 
PS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxPS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptx
 
lec01.ppt
lec01.pptlec01.ppt
lec01.ppt
 
Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text Mining
 
Introduction to Object Oriented Programming
Introduction to Object Oriented ProgrammingIntroduction to Object Oriented Programming
Introduction to Object Oriented Programming
 
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache SparkThe Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
 
Searching of Web and Electronic Resources
Searching of Web and Electronic Resources Searching of Web and Electronic Resources
Searching of Web and Electronic Resources
 
Lec-1.pdf
Lec-1.pdfLec-1.pdf
Lec-1.pdf
 
Artificial Intelligent introduction or history
Artificial Intelligent introduction or historyArtificial Intelligent introduction or history
Artificial Intelligent introduction or history
 
Knowledge base system appl. p 1,2-ver1
Knowledge base system appl.  p 1,2-ver1Knowledge base system appl.  p 1,2-ver1
Knowledge base system appl. p 1,2-ver1
 
The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagement
 
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
 

Plus de Jack Park

Topic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation TopicsTopic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation TopicsJack Park
 
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview Jack Park
 
Game-based Learning Landscape and Opportunities
Game-based Learning Landscape and OpportunitiesGame-based Learning Landscape and Opportunities
Game-based Learning Landscape and OpportunitiesJack Park
 
OpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence AugmentationOpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence AugmentationJack Park
 
Finding Meaning Across Boundaries
Finding Meaning Across BoundariesFinding Meaning Across Boundaries
Finding Meaning Across BoundariesJack Park
 
On Augmenting Patients
On Augmenting PatientsOn Augmenting Patients
On Augmenting PatientsJack Park
 
Open Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and ProspectsOpen Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and ProspectsJack Park
 
Knowledge Garden Overview
Knowledge Garden OverviewKnowledge Garden Overview
Knowledge Garden OverviewJack Park
 
Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...Jack Park
 
Questing in the age of Complex Systems
Questing in the age of Complex SystemsQuesting in the age of Complex Systems
Questing in the age of Complex SystemsJack Park
 
FutureOfText2015
FutureOfText2015FutureOfText2015
FutureOfText2015Jack Park
 
Towards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive ComputingTowards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive ComputingJack Park
 
Towards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData DiscoveryTowards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData DiscoveryJack Park
 
Feedback Loops and Knowledge Gardens
Feedback Loops and Knowledge GardensFeedback Loops and Knowledge Gardens
Feedback Loops and Knowledge GardensJack Park
 
A Future for Education: Some Core Thoughts
A Future for Education: Some Core ThoughtsA Future for Education: Some Core Thoughts
A Future for Education: Some Core ThoughtsJack Park
 
Game-based Learning
Game-based LearningGame-based Learning
Game-based LearningJack Park
 
Learning Gardens: Storytelling Opportunities
Learning Gardens: Storytelling OpportunitiesLearning Gardens: Storytelling Opportunities
Learning Gardens: Storytelling OpportunitiesJack Park
 
Condos in a Politics Knowledge Garden
Condos in a Politics Knowledge GardenCondos in a Politics Knowledge Garden
Condos in a Politics Knowledge GardenJack Park
 

Plus de Jack Park (20)

Tm keynote
Tm keynoteTm keynote
Tm keynote
 
Topic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation TopicsTopic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation Topics
 
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
 
Game-based Learning Landscape and Opportunities
Game-based Learning Landscape and OpportunitiesGame-based Learning Landscape and Opportunities
Game-based Learning Landscape and Opportunities
 
OpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence AugmentationOpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence Augmentation
 
Finding Meaning Across Boundaries
Finding Meaning Across BoundariesFinding Meaning Across Boundaries
Finding Meaning Across Boundaries
 
On Augmenting Patients
On Augmenting PatientsOn Augmenting Patients
On Augmenting Patients
 
Open Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and ProspectsOpen Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and Prospects
 
Knowledge Garden Overview
Knowledge Garden OverviewKnowledge Garden Overview
Knowledge Garden Overview
 
Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...
 
Questing in the age of Complex Systems
Questing in the age of Complex SystemsQuesting in the age of Complex Systems
Questing in the age of Complex Systems
 
Why?
Why?Why?
Why?
 
FutureOfText2015
FutureOfText2015FutureOfText2015
FutureOfText2015
 
Towards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive ComputingTowards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive Computing
 
Towards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData DiscoveryTowards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData Discovery
 
Feedback Loops and Knowledge Gardens
Feedback Loops and Knowledge GardensFeedback Loops and Knowledge Gardens
Feedback Loops and Knowledge Gardens
 
A Future for Education: Some Core Thoughts
A Future for Education: Some Core ThoughtsA Future for Education: Some Core Thoughts
A Future for Education: Some Core Thoughts
 
Game-based Learning
Game-based LearningGame-based Learning
Game-based Learning
 
Learning Gardens: Storytelling Opportunities
Learning Gardens: Storytelling OpportunitiesLearning Gardens: Storytelling Opportunities
Learning Gardens: Storytelling Opportunities
 
Condos in a Politics Knowledge Garden
Condos in a Politics Knowledge GardenCondos in a Politics Knowledge Garden
Condos in a Politics Knowledge Garden
 

Dernier

Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 

Dernier (20)

Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 

HyperMembrane Structures for Open Source Cognitive Computing

  • 1. HyperMembrane Structures for Open Source Cognitive Computing Japanese Agency for Science and Technology Tokyo, Japan 3 March, 2015 Jack Park © 2015 TopicQuests Foundation CC by SA 4.0
  • 2. The Present Situation Upon this gifted age, in its dark hour, Rains from the sky a meteoric shower Of facts . . . they lie unquestioned, uncombined. Wisdom enough to leech us of our ill Is daily spun; but there exists no loom To weave it into fabric Edna St. Vincent Millay, 1939 2
  • 3. Topics To Cover • Discovery, learning, problem solving • Topic Maps • OpenSherlock • HyperMembranes • Open Source • Key reasons for building open source cognitive systems 3
  • 4. Cognitive Computing: My View • Cognitive Computing is: – Far less about what a computer knows – Far more about how computers can augment human cognitive capabilities – Based on the J.C.R Licklider and Douglas Engelbart augmentation work J.C.R. Licklider Douglas Engelbart 4Imgs: Wikipedia
  • 5. A Domain-specific Problem Statement • An Example: – Do these two sentences say the same thing? • CO2 is a causal factor in climate change. • Climate change is caused by carbon dioxide. • Problem Statement – Software agents need elegant methods for reading, representing, organizing, and modeling information resources to support discovery and answering questions. 5
  • 6. A Framing Thought • From [1] – The understanding of global brain organization and its large-scale integration remains a challenge for modern neurosciences. • To – The understanding of global conversations about topics that matter and their large-scale federation remain a challenge for modern information technology. [1] Petri G, Expert P, Turkheimer F, Carhart-Harris R, Nutt D, Hellyer PJ, Vaccarino F. (2014) Homological scaffolds of brain functional networks. J. R. Soc. Interface 11: 20140873. 6
  • 7. Our Goals • Improve Human-Tool Capabilities • Augment existing analytic methods – Increase opportunities for discovery – Improve already sophisticated methods • Build Looms – Read documents – Map and model topics read – Weave information fabrics Douglas Engelbart 7
  • 8. Discovery • Is it really possible for people to see everything? – Part of discovery is connecting dots not yet connected. – “Cognitive Agents” can help increase chances of serendipity. “Discovery consists of seeing what everybody has seen and thinking what nobody has thought.” –Albert Szent-Györgyi 8
  • 9. Related Work • Commercial – IBM Watson – Wolfram Alpha – Viv – Saffron 10 – Clueda – Siri – Google Now – Cortana – … • Open Source – OAQA – DeepDive – OpenCog – OpenNARS – Watsonsim – YodaQA – AKSW OpenQA – AKSW QA – AquaLog – OpenSherlock – OpenIRIS (CALO) – … • Research – Project Aristo – Project Halo – FREyA – CASIA – NLP-Reduce – EIS Sina – WDAqua ITM – Intui2 – … 9
  • 10. Biologically Inspired Design • Humans are blessed with: – Memory to keep concepts organized and connected – Internal mechanisms which map sensor data into memory for processing and storage – The abilities of complex, adaptive, anticipatory systems 10
  • 11. Memory: Introducing Topic Maps • A Topic Map is like a library without all the books* – A Topic Map is indexical • Like a card catalog – Each topic has its own representation • Improving on a card catalog, a topic can be identified many different ways • Captures metadata and optionally content – A Topic Map is relational • Like a good road map – Topics are connected by associations (relations) – Topics point to their occurrences in the territory – A Topic Map is organized • Multiple records on the same topic are co-located (stored as one topic) in the map *a map is not its territory 11
  • 12. TopicMap Structure •Topics as Actors •Topics as Relations •Topics as Types •Topics as Biographies 12
  • 13. Processing Mechanisms • Typically, software processes take the form of variants of NLP (natural language processing) – Parsers – Cluster analysis – Entity recognition – Relation detection – Role recognition – Probabilistic methods 13
  • 14. A Key Question in My Research • Can a Topic Map learn (construct itself) by “reading” literature? – Relevant issues: • Bootstrapping • Machine reading – NLP – Linguistics – Statistics – Analogy & Metaphor – … • Knowledge representation • Model building – Anticipation • Weaving information fabrics • Literature-based discovery • Deep Question Answering 14
  • 15. A Simple Example • Read this sentence: – Gene expression is caused by insoluble hormones binding to a plasma membrane hormone receptor • Topic Map recognizes: – Gene expression  GeneExpression – insoluble hormones  InsolubleHormone – plasma membrane hormone receptor  PlasmaMembraneReceptor • Software agents transform: – is caused by  Cause – binding to  Binds • Final semantic structure: • { {InsolubleHormone, Binds, PlasmaMembraneReceptor}, Cause, GeneExpression } 15
  • 16. Introducing OpenSherlock • OpenSherlock is: – A Topic Map for information resource identity and organization – A HyperMembrane information fabric structure – A society of agents system which can • Read documents • Process information resources – Maintain the topic map – Maintain the HyperMembrane – Build and maintain models – Perform discovery tasks – Answer questions – Agents are coordinated by: • A blackboard system • A dynamic task-based agenda • Event propagation and handling 16
  • 17. Observations 1 • A Topic Map is central to the key question, and therefore to a thesis entailed by this research – It serves as a kind of memory for social processes – It provides a robust platform for subject identity – It can also serve as a repository for domain- specific vocabularies (ontologies, taxonomies, naming conventions,…) 17
  • 18. Observations 2 • A Topic Map is necessary but not sufficient to support discovery, learning, or problem solving – It really only provides a powerful indexical structure related to the key artifacts in any universe of discourse: • Actors • Their relations • Their states • Rules, laws, theories,… • To model those key artifacts, other representation strategies are required – Conceptual Graphs – Qualitative Process Theory – Belief Networks – … 18
  • 19. A Research Question • What processes are available which, if performed while harvesting (reading) documents, can reduce the amount of processing required later during question answering? – The question entails • Synthesis of ontology • Co-reference resolution • Re-representation during question lifting • … 19
  • 20. A Working Hypothesis • Process – Build and maintain a content-addressable memory of questions, claims, arguments, and evidence fields. • We call that a HyperMembrane – Note: • Every text object passed into the system is processed by the same algorithms – Sentences harvested from text – Questions and responses posed by humans 20
  • 21. Key Concept: HyperMembrane • HyperMembrane is a key concept in the working hypothesis that OpenSherlock seeks to explore and demonstrate – A growing graph as a collection of woven and intersecting fabrics • constructed from normalized tuples (n-tuples) which are designed to reduce the amount of NLP required to read documents • such that intersections of fabrics occur where named entities in the graph of n-tuples are the same – Inspired by Ted Nelson’s ZigZag Architecture 21
  • 22. Machine Reading in OpenSherlock • Goals: – Grow the topic map • Topic Map then serves to support fabrication of higher-order knowledge structures – Conceptual Graphs – Belief Networks – QP Theory Models – HyperMembrane – … • Process Loop: – For a given document • For every paragraph in that document – For every sentence in each paragraph » Read the sentence 22
  • 23. Sentence Reading • First Step: – Process sentence into word grams* • Second Step: – Where possible • Transform word grams into n-tuples** • n-tuples form the HyperMembrane * A container of words, from 1 to 8 words per container ** A container of symbols based on words in word grams 23
  • 24. Process Sentence into WordGrams • Approach – Break sentence into word grams* • WordGram objects are shared across sentences – Count of sentence identifiers associated with each object serves as basis for probabilistic models – Either • TopicMap recognizes terms – Or • Sentence is parsed by Link-Grammar Parser** • TopicMap learns from parse results *http://en.wikipedia.org/wiki/W-shingling **http://www.link.cs.cmu.edu/link/ 24
  • 25. Transform WordGrams to N-Tuples • Normalized tuple (N-Tuple) – A structure where the subject, predicate, and object are normalized • Nouns and verbs transformed – CO2, Carbon Dioxide, …  CO2 – causes, is caused by, …  cause • Two sentence example – CO2 is a cause of climate change. – Climate change is caused by carbon dioxide. – Result: » { CO2, cause, climate change } – Normalization processes include general and domain specific lenses • Rule-based interpreters which detect structures – Taxonomy – Causality – Biomedical – Geophysical – … • Process models – Built and maintained while reading – Predict while reading – Anticipatory Reading 25
  • 26. About N-Tuples • An N-Tuple is a structured record of – Topics in the topic map – Those topics are harvested from text • An N-Tuple takes the form: – { Subject, Predicate, Object } – Where • Subject and/or Object can be one of: – A topic from the topic map – Another N-Tuple • An N-Tuple is identified by the identities of the terms it contains – When thinking in terms of terms (words) read from documents, the identities (numeric representations) of those terms form the identity of the N-Tuple object. • N-Tuples are content addressable • Disambiguation of subjects is a topic mapping process – Learning means continuous refinement of subject identity – Ambiguities can also be solved through human intervention 26
  • 27. N-Tuples as HyperMembrane Tuples {A, Bind, B} {{A, Bind, B}, Cause, X} {X, Bind, D} {{X, Bind, D}, Cause, Y} 27 A B Bind X Y D Cause Bind Cause
  • 28. Current State of OpenSherlock ElasticSearch Titan or Blazegraph Ontology Importer Ontologies PubMed Reader PubMed Abstracts HyperMembrane Engine TellAsk UMLS Importer UMLS 28
  • 29. Observations 3 • HyperMembrane is a reminding system – HyperMembrane is a record of federated human conversation • Harvested from books, papers, and recorded conversation • Includes statistical properties of recorded utterances – HyperMembrane records: • That which is common • That which is novel – Possibly wrong – Possibly game changing 29
  • 30. TellAsk Interface Conversation Tree User can click a node to select as parent for any user response Response Type Selectors. Selection required before response. User types here Linear conversation flow Entry Forms Selector List Map starts a new conversation with entered topic 30
  • 31. The Open Source Stack • Persistence – ElasticSearch – Considering Titan – Considering Blazegraph (Bigdata™ RDF Store) • Libraries – Many from Apache Foundation and others – LinkGrammarParser (Java version) – XML PullParser – Simple JSON Parser • Tools – Eclipse 31
  • 33. Current State of Development • Aim to answer simple questions about casuality – Current focus on biomedical domain – Current focus on two lenses • Taxonomy • Casuality – No Conceptual Graphs – No Process Models – No Probabilistic Models 33
  • 34. Future Work • Aim to complete an anticipatory system – Process models for anticipation – Conceptual graphs – Probabilistic models – More lenses • Pluggable lenses • Adaptive lenses – More domains 34
  • 35. Why Do This? • Augment human capabilities in problem solving • Participate in Open Science 35
  • 36. Augmenting Social Sensemaking 1 2 3 Creating Ideas Refining Connections Connecting Ideas Cancer patient 36
  • 37. Participate in Open Science 37
  • 38. Key Context for Open Science • A planet-wide, collaborative quest for Global Thrivability*. – Issues include • Sociological events – Health, epidemics, wars,… • Geophysical events – Climate change, earthquakes, volcanoes, … • Astrophysical events – Asteroids, our Sun. … * Let’s call the quest: EarthMoonshot 38
  • 39. Completed Representation antioxidants kill free radicals Contraindicates macrophages use free radicals to kill bacteria Bacterial Infection Antioxidants Because Appropriate For Compromised Host Let us co-create Cognitive Agents for Discovery jackpark@topicquests.org OpenSherlock documents at: http://debategraph.org/OpenSherlock Code emerging at: https://github.com/opensherlock/ Slides online at http://slideshare.net/jackpark/ Acknowledgments: Bob Gleichauf David Alexander Price Arun Majumdar Robert S. Stephenson Mark Szpakowski Martin Radley Sherry Jones Alexander Wenzowski Ted Kahn Patrick Durusau 39