TaxonWorks excels in annotating data. Tags, notes, custom data-attributes, citations, alternative value, confidence levels, attribution and more are all customizable by the user then usable to enrich and manage their projects.
TaxonWorks provides a nearly universally applicable set of "annotation" types that can be used on core and supporting data.
Annotations have many potential uses in TaxonWorks. Some examples to get you thinking.
My data don't fit in TaxonWorks!
Export to a particular DarwinCore field
TaxonWorks maps its internal model to well over 50 Darwin Core fields automatically. In addition, users can add Controlled Vocabulary Terms that reference DarwinCore fields to further extend the data that can be exported. See Darwin core exportable URIs.
I need to make a group of records
As part of your SOP you are constantly needing to make temporary small groups of data to review and modify.
In TaxonWorks you can filter and select objects by Tags (and many other similar things). Tags can be batch created, destroyed in mass, and used to dynamically grow and shrink matrices, group your data into stages of a workflow, indicate things "TODO", filter records, add objects to loans, and much more. See Tags.
I need to annotate this data with how it came to be
Annotations are implemented in the UI in many different ways, but there are several key interfaces that are perhaps the most important.
The radial annotator is used to call up and access the different types of annotations that are available for the object in question.
Manage controlled vocabulary terms task
Manage controlled vocabular terms task is used to customize the types of annotations you want to use. Think of it as creating the Keywords that your linking to your records, or the new column that you're adding to your table of data (e.g. names, collection objects, or collecting events).
UI inputs for data attributes.
If you are a project administrator you can make new fields (see Data attributes below) appear in the UI in various places.
- Create the "fields" (see Data attributes below) you need
Project(top right), then
- Select the model you want the new fields to appear for (e.g.
- Check all fields that you want to appear as input fields. Note that this creates new form-fields, you can always access and create data-attributes through the radial annotator.
Fields will now appear in the UI where pertinent, notably the
Comprehensive digitization task, the
OTU quick forms radial, and the
New collecting event task.
Controlled vocabulary terms
Many annotation types start with the creation of a controlled vocabulary term (CVT). This is done in the
Manage controlled vocabulary terms task.
All CVTs must have 2 parts:
name, or label - used to select the CVT in the interface, for display, etc.
definition- don't assume that your label is understandable, now, or over time! Provide a definition so that others may interpret and apply this CVT term as your project intends it to be used.
A third attribute is the
Uri. This attribute is used to assert that the CVT in your project means the same things as data used in others projects.
Think of DataAttributes as custom columns on your table. There are two types in TaxonWorks, "Import", and "Internal". The latter are created in the user-interface, the former are used in the documentation of bulk-data import, typically for bespoke datasets. DataAttributes are key-value pairs that reference a record (or object. They reference the object the attribute is being attached to, a type of CVT called a
Predicate, and the value.
For example you might add a DataAttribute referencing a
Predicate you created called "TownshipAndRange" with the value "Smithville" to a CollectingEvent.
Darwin Core exportable URIs
TaxonWorks "knows" about certain DarwinCore fields that are not 1:1 mappable to its model. To add data to these fields, and have them export 1:1 (i.e. typing "Stuff; Things" results in "Stuff; Things" being exported into the field) to your DarwinCore exports do this:
- Determine if the DwC attribute you want to use is "known" by TaxonWorks. At present it's best to do this by looking at the code itself, here is the list. If you find your attribute (e.g.
asssociatedTaxa) then note (cpy to your clipboard) the URI it corresponds to (in our example
- In the
Manage controlled vocabulary termstask create a new
Predicate, name it typically 1:1 with the name of the DwC field, but this is required, the magic is in the Uri field.
- Add the URI to your Predicate that you found in from the file above by pasting or typing it in.
You may want to add the field to directly to the UI, for example the
New collecting event task or
Comprehensive specimen digitization, see above for how to do this.
At this point you have a new field in TaxonWorks that will be exported to DwC when requested.
Note that on export no processing or validation is done on the export of these fields, you get in the export what you type in the field.
Tags are used an arbitrary label with your object. Think of tags as data attributes with no value. To create a type of Tag create a
Keyword in the CVT manager. You can apply tags in the radial annotator, in batch in filter interfaces, and in various other ways.
Note that technically a Tag is the link (or join) between a Keyword (CVT) and an Object, not the Keyword itself.
Tagging an object with a Protocol asserts "some or all of the data in here came from this process. Create new protocols via
Protocols. You can attach PDFs describing your protocol to your Protocol to expand its description.
Alternate values reference a TaxonWorks field and provide a way to include a Translation, Abbreviation, Misspelling or Alternate spelling. To create one:
- Open the radial annotator for the record in question
- Select the type of alternate value (e.g. Abbreviation)
- Select the target field (e.g.
titleof a source)
- Add the value, and click Create.
Alternate values are not allowed on the
name of a TaxonName, don't even think about requesting this.
Identifiers are broader subject (i.e. we need further discussion here). Briefly, there are two types in TaxonWorks, "Global", and "Local". The latter are in essence those types of identifiers that people thought really hard about, and tried to make universally applicable, the former, not so much.
In the real-world there are no-such-things as "unique" identifiers per object. TaxonWorks embraces this philosophy by allowing you to add as many identifiers to your data as you want. Identifiers can be used to say and do things like:
- Conceptually, my digital thing is the
same_asyour digital thing
- If you find my identifier you'll likely be able to find my thing (e.g. a
Identifier::Local::CatalogNumberthat represents a physical label
- The same things has two different "unique" identifiers, and that thing is my thing
- I'm tracking (integrating/referencing/relating) this thing in my project, but with minimal detail, you can get more details on this thing if you can find other things "out there" with the same Identifier
- My thing is the Person with this ORCiD (Identifier)
- My thing historically could be found (your results may vary now) with this Identifier
- I want to make labels for my specimens that reference my trip-codes so that I can quickly call them up, print them, assign them to Specimens, etc., and darn it my (Matt's) "MJY-YYYY" codes do not mean "Mark Jupiter Yahoo", but I need their (Mark's) "MJY" codes too. You can keep them the same, but different, using TaxonWorks' Namespaces.
- Print labels and barcodes in TaxonWorks
Notes are the most unrestricted annotation type, you're simply adding a block of text (Markdown is allowed) to a record. You should not assume that all notes of are of some type. You should also not use Notes because you want something to appear somewhere, it's much better to do this with a more defined annotation such as a data attribute.
A Verifier is a very generic way of saying "This Person verified this record". It links a Person object with a role "verifier" to your data. There are no sub-types of verification, for that functionality see Confidences.
Some classes of data can be "documented", this simply means attaching a link to a binary document (e.g. pdf) to the record. The assertion being made is "this binary file contains the content of that thing", as in "This is the PDF of that Source".
Projects can create their own arbitrary levels of Confidence (think data-quality) that they can use to annotate there data with. For example you might have a ConfidenceLevel "1 - Origin unknonw" or "B - Spelling check required".