Annotation
TaxonWorks excels in annotating data. Tags, notes, custom data-attributes, citations, alternative value, confidence levels, attribution and more are all customizable by the user then usable to enrich and manage their projects.
TaxonWorks provides a nearly universally applicable set of "annotation" types that can be used on core and supporting data.
Use cases
Annotations have many potential uses in TaxonWorks. Some examples to get you thinking.
My data don't fit in TaxonWorks!
You're checking out TaxonWorks, it seems to have most of what you need, but the UI seems to be missing fields for key data that are critical to your project. See Data attributes
Export to a particular DarwinCore field
TaxonWorks maps its internal model to well over 50 Darwin Core fields automatically. In addition, users can add Controlled Vocabulary Terms that reference DarwinCore fields to further extend the data that can be exported. See Darwin core exportable URIs.
I need to make a group of records
As part of your SOP you are constantly needing to make temporary small groups of data to review and modify.
In TaxonWorks you can filter and select objects by Tags (and many other similar things). Tags can be batch created, destroyed in mass, and used to dynamically grow and shrink matrices, group your data into stages of a workflow, indicate things "TODO", filter records, add objects to loans, and much more. See Tags.
I need to annotate this data with how it came to be
In your project it's important to know (record a provenance trail) that some Observations came from one SOP, while others came from another. See Protocols.
User interface
Annotations are implemented in the UI in many different ways, but there are several key interfaces that are perhaps the most important.
Radial annotator
The radial annotator is used to call up and access the different types of annotations that are available for the object in question.
Manage controlled vocabulary terms task
The Manage controlled vocabular terms
task is used to customize the types of annotations you want to use. Think of it as creating the Keywords that your linking to your records, or the new column that you're adding to your table of data (e.g. names, collection objects, or collecting events).
"Custom fields"
UI inputs for data attributes.
If you are a project administrator you can make new fields (see Data attributes below) appear in the UI in various places.
- Create the "fields" (see Data attributes below) you need
- Click
Project
(top right), thenPreferences
(top left) - Select the model you want the new fields to appear for (e.g.
CollectionObject
) - Check all fields that you want to appear as input fields. Note that this creates new form-fields, you can always access and create data-attributes through the radial annotator.
Fields will now appear in the UI where pertinent, notably the Comprehensive digitization
task, the OTU quick forms
radial, and the New collecting event
task.
Annotation types
Controlled vocabulary terms
Many annotation types start with the creation of a controlled vocabulary term (CVT). This is done in the Manage controlled vocabulary terms
task.
All CVTs must have 2 parts:
- A
name
, or label - used to select the CVT in the interface, for display, etc. - A
definition
- don't assume that your label is understandable, now, or over time! Provide a definition so that others may interpret and apply this CVT term as your project intends it to be used.
A third attribute is the Uri
. This attribute is used to assert that the CVT in your project means the same things as data used in others projects.
Data attributes
Think of DataAttributes as custom columns on your table. There are two types in TaxonWorks, "Import", and "Internal". The latter are created in the user-interface, the former are used in the documentation of bulk-data import, typically for bespoke datasets. DataAttributes are key-value pairs that reference a record (or object. They reference the object the attribute is being attached to, a type of CVT called a Predicate
, and the value.
Customizing models
For example you might add a DataAttribute referencing a Predicate
you created called "TownshipAndRange" with the value "Smithville" to a CollectingEvent.
Darwin Core exportable URIs
TaxonWorks "knows" about certain DarwinCore fields that are not 1:1 mappable to its model. To add data to these fields, and have them export 1:1 (i.e. typing "Stuff; Things" results in "Stuff; Things" being exported into the field) to your DarwinCore exports do this:
- Determine if the DwC attribute you want to use is "known" by TaxonWorks. At present it's best to do this by looking at the code itself, here is the list. If you find your attribute (e.g.
asssociatedTaxa
) then note (cpy to your clipboard) the URI it corresponds to (in our examplehttp://rs.tdwg.org/dwc/terms/associatedTaxa
). - In the
Manage controlled vocabulary terms
task create a newPredicate
, name it typically 1:1 with the name of the DwC field, but this is required, the magic is in the Uri field. - Add the URI to your Predicate that you found in from the file above by pasting or typing it in.
You may want to add the field to directly to the UI, for example the New collecting event
task or Comprehensive specimen digitization
, see above for how to do this.
At this point you have a new field in TaxonWorks that will be exported to DwC when requested.
Note that on export no processing or validation is done on the export of these fields, you get in the export what you type in the field.
Tags
Tags are used an arbitrary label with your object. Think of tags as data attributes with no value. To create a type of Tag create a Keyword
in the CVT manager. You can apply tags in the radial annotator, in batch in filter interfaces, and in various other ways.
Note that technically a Tag is the link (or join) between a Keyword (CVT) and an Object, not the Keyword itself.
Protocols
Tagging an object with a Protocol asserts "some or all of the data in here came from this process. Create new protocols via Data
->Protocols
. You can attach PDFs describing your protocol to your Protocol to expand its description.
Alternate values
Alternate values reference a TaxonWorks field and provide a way to include a Translation, Abbreviation, Misspelling or Alternate spelling. To create one:
- Open the radial annotator for the record in question
- Select
Alternate values
- Select the type of alternate value (e.g. Abbreviation)
- Select the target field (e.g.
title
of a source) - Add the value, and click Create.
Alternate values are not allowed on the name
of a TaxonName, don't even think about requesting this.
Identifiers
Identifiers are broader subject (i.e. we need further discussion here). Briefly, there are two types in TaxonWorks, "Global", and "Local". The latter are in essence those types of identifiers that people thought really hard about, and tried to make universally applicable, the former, not so much.
In the real-world there are no-such-things as "unique" identifiers per object. TaxonWorks embraces this philosophy by allowing you to add as many identifiers to your data as you want. Identifiers can be used to say and do things like:
- Conceptually, my digital thing is the
same_as
your digital thing - If you find my identifier you'll likely be able to find my thing (e.g. a
Identifier::Local::CatalogNumber
that represents a physical labellocalized_to
my specimen) - The same things has two different "unique" identifiers, and that thing is my thing
- I'm tracking (integrating/referencing/relating) this thing in my project, but with minimal detail, you can get more details on this thing if you can find other things "out there" with the same Identifier
- My thing is the Person with this ORCiD (Identifier)
- My thing historically could be found (your results may vary now) with this Identifier
- I want to make labels for my specimens that reference my trip-codes so that I can quickly call them up, print them, assign them to Specimens, etc., and darn it my (Matt's) "MJY-YYYY" codes do not mean "Mark Jupiter Yahoo", but I need their (Mark's) "MJY" codes too. You can keep them the same, but different, using TaxonWorks' Namespaces.
- Print labels and barcodes in TaxonWorks
Notes
Notes are the most unrestricted annotation type, you're simply adding a block of text (Markdown is allowed) to a record. You should not assume that all notes of are of some type. You should also not use Notes because you want something to appear somewhere, it's much better to do this with a more defined annotation such as a data attribute.
Verifiers
A Verifier is a very generic way of saying "This Person verified this record". It links a Person object with a role "verifier" to your data. There are no sub-types of verification, for that functionality see Confidences.
Documentation
Some classes of data can be "documented", this simply means attaching a link to a binary document (e.g. pdf) to the record. The assertion being made is "this binary file contains the content of that thing", as in "This is the PDF of that Source".
Confidences
Projects can create their own arbitrary levels of Confidence (think data-quality) that they can use to annotate there data with. For example you might have a ConfidenceLevel "1 - Origin unknonw" or "B - Spelling check required".