Samples of groundwater were collected from sites across the southern Murray Darling Basin on 2 occasions and analysed to characterise the microbial communities in the water. Water quality data were also collected.
Microbial assemblage data derived from Illumina sequencing of 16S rDNA amplicons. Sequence data were processed using Greenfield Hybrid Analysis Pipeline (GHAP) v2.2, created by CSIRO Australia (available at https://cloudstor.aarnet.edu.au/plus/s/ pUvwME6fMeCcX - zL). Taxonomic assignment is based on the Ribosomal Database Project (RDP) Classifier (Wang, Garrity et al. 2007). OTUs were assigned using a 97% similarity threshold). Low abundant OTUs (<10), organelles and unassigned taxa at kingdom level were removed.
This text has been generated from a tool that has been adapted from the ARDC FAIR Assessment Tool
Findable
--------
Does the dataset have any identifiers assigned?
Global
Is the dataset identifier included in all metadata records/files describing the data?
Yes
How is the data described with metadata?
Comprehensively (see suggestion) using a recognised formal machine-readable metadata schema
What type of repository or registry is the metadata record in?
Data is in one place but discoverable through several registries
Accessible
----------
How accessible is the data?
Publicly accessible
Is the data available online without requiring specialised protocols or tools once access has been approved?
Standard web service API (e.g. OGC)
Will the metadata record be available even if the data is no longer available?
Yes
Interoperable
-------------
What (file) format(s) is the data available in?
Mostly in a proprietary format
What best describes the types of vocabularies/ontologies/tagging schemas used to define the data elements?
Standardised vocabularies/ontologies/schema without global identifiers
How is the metadata linked to other data and metadata (to enhance context and clearly indicate relationships)?
The metadata record includes URI links to related metadata, data and definitions
Reusable
--------
Which of the following best describes the license/usage rights attached to the data?
Standard machine-readable license (e.g. Creative Commons)
How much provenance information has been captured to facilitate data reuse?
Fully recorded in a machine-readable format