Structuring Collective Knowledge, v2 : Interpreting Layered Knowledge
Collaborative notes — Edit these notes
Background examples:
Human Cell Atlas (2019) [aligned multi-omic taxo; origin & data needs]
1632 (essay, authors’ manual) [collectively authored sci-fi]
Life - Slime molds; Íslendingabók; Addgene
World - Geograph (ideas); Google Sky; iNaturalist; Zooniverse
Research - WikiData + data underlays; PubPub; CERN + Zenodo
Synthesis - Roam + Codex + mind maps;
…add yours here!
[[flancian]] took notes here, will transcribe later:
anagora.org/node/iap21 | agora for this agenda
[[metasj]]
[[agnes cameron]] works with [[metasj]], part of knowledge collaboration innovation data. In [[uk]].
[[samir ghosh]] runs a [[maker space]]. Does [[vr]].
[[sarah s]] [[neuroscience]] and [[machine learning]]. [[crowdsourcing]] to probe [[human perception]].
[[amr kayid]] [[munich]] [[research engineer]] at [[open mind]]. [[reinforcement learning]].
[[thomas renkert]] [[religious studies]] [[knowledge representation]].
[[kevin]] (pending)
[[favour kelvin]]
Jose Labra
Add your name + current projects / tidbits below
Codex Editor : paper
Go-links : can this be made global? A global authority file
Coronavirus : collecting data on those who have passed away
In Austria — the lack of such initiatives is itself a topic of discussion
US: the NYT list of names; historical parallel: the Yale AIDS memorial
ImageNet : human annotations of human images; linked to WordNet
WordNet: grouped words into synsets (concepts)
Images associated to 20k synsets, via URLs (imgs not owned), tagged
~> used to train ResNet, SENet, BigGAN —> inception
Thomas: want to see tools that can be used to represent text in context.
Suggestions: Quotebacks / ObjectNet / OpenMined / Dall-E / Are.na
Visual concept vocabulary (like a “map” of GAN latent space) — SS, SG
How to structure data from collective labelling —> Toward a Topology of Visual Experience
Annotated Hub of Science — Amr, SJ, SG, EI
(approaches to hypertexts + research)
Connector Library (electrical + physical - UCK ) — AC, SG
(visual, geometric, parametric descriptors; HTMAE)
-> some kind of nodal graph? (connect X to Y with Z properties)
Recipes + Food sustainability — EI, SJ
(important + good model, clear requirements)
Movebank + eBird? — FK, AC?
(mix of researchers + crowd contributions)
Meta - wikilinks + query language (start at root, apply constraints)
(what do we need to streamline this collab)
SG: Specs for network graphs or … hierarchical datasets? When you click on something: be able to render arbitrary doc-set or html snippet? I’ve been thinking about latent space for a while; in the uni, people talk about getting meaningful info out of a neural net / that’s network data of its own, that could use complex viz.
AC: that’s really interesting — kg problems are often sth that sits b/t a standard network graph and a more structured thing.
Choosing viz and hierarchy models?
Keyword branching out into other ways.
How to specify a choice of graph viz along w/ what you render on mouseover/clicking on nodes?
Let users shove away information / bring relevant info to the front.
Neo4j : can we improve on both the defaults and how we interact with them?
New proposal: capturing graph-viz approaches + styles?
are.na : structuring a set of X (viz types?)
web apps can sit on top of a. channels. [YT chan -> a.chan -> player]
scoby (a.chan) —> website
code : a list of awesome graph viz? [awesome lists as meta-layer]
HUDs and GUIs: annotated reviews of designs in fiction (film)
“Let’s do better than force-directed graphs”
Perpendicular Institute (and its performance hacks)
[Splatoon 2 getting shut down vs Ring Fit + proprietary hardware…]
Working w/ public knowledge graphs, building new visualizations + overlays
Room: Andra, Blake, FK, ET, SG, AC, SS, SK
You don’t always know what you mean to ask until you start seeing some answers. Example of that — Google Translate (showing potential results, letting you refine the result based on possible alternatives for each part of a paragraph)
Postive questions to answer:
What Q’s / queries do you want to answering?
What datasets are you using?
What audiences might use this, or contribute to it?
What results in the world (here: regulations, food guidance) might need it?
Are there natural focused subsets to develop along the way?
Think about a single Overlay, a few Underlay collections, and an Interlay spec that could link them / handle any interchange + interfacing needed.
Obstacles:
What does it take for data currently under license / not being shared to become usable in this way?
Maintenance:
What curation and updating are happening? are needed?
Iteration, persistence:
What steps can be iterated? Where there are obstacles, is there a related step that is easier to start with (at another level of abstraction or access, w/ a similar source or topic)? Clearly describing an obstacle is a first step towards working around it.
Network: How does this fit into other popular / existing / needed sources?
Comments:
To what degree can we use half-closed data to make tools for the user?
You can always get metadata or analyze closed data and share that.
Countering the zero-sum game narrative: Using a model to derive the optimal health outcomes is actually very similar to a model that optimises economic outcomes: you have an optimisation question where you can minimise the summation of costs due to both cases and restrictions
source — Oxford tool: http://epidemicforecasting.com allows you to modulate different aspects that affect the reproduction rate
More difficult to find data on what different COVID measures will cost. Can also account for vaccination models, including staging etc.
Where is the data?
‘Towards an optimal policy dashboard’ — want an idea of costs and cases caused by different policies.
SJ: Q: How much do your choice of data sources, derivatives (these visuals), and audiences affect how effective this work is? [how effective it feels; empirically the feedback you get, if appropriate]