Macquarie University
Browse

International Corpus of English (ICE)

Download (2.8 MB)
dataset
posted on 2023-12-21, 02:11 authored by Adam SmithAdam Smith, Pam PetersPam Peters

The Australian component of the International Corpus of English (ICE-AUS) is an approximately one million word corpus of transcribed spoken and written Australian English from 1992-1995. It consists of 500 samples of Australian English (60% speech, 40% writing) that matches the structure of other ICE corpora (associated with the International corpus of English). The spoken data includes transcriptions of face-to face spoken conversations, telephone conversations, monologues, broadcast dialogues, and scripted speech. The written texts include samples of unpublished letters (personal and professional), student essays, newspaper writing, popular nonfiction, academic writing, and fiction.This collection was previously accessible online via the Australian National Corpus (AusNC), an initiative managed by Griffith University between 2012 and 2023.

History

Q/A Log

  • Institutional review completed
  • FAIR assessment completed

FAIR Self Assessment Summary

This text has been generated from a tool that has been adapted from the ARDC FAIR Assessment Tool Findable -------- Does the dataset have any identifiers assigned? Global Is the dataset identifier included in all metadata records/files describing the data? No How is the data described with metadata? Comprehensively (see suggestion) using a recognised formal machine-readable metadata schema What type of repository or registry is the metadata record in? Data is in one place but discoverable through several registries Accessible ---------- How accessible is the data? Publicly accessible Is the data available online without requiring specialised protocols or tools once access has been approved? Standard web service API (e.g. OGC) Will the metadata record be available even if the data is no longer available? Yes Interoperable ------------- What (file) format(s) is the data available in? In a structured, open standard, machine-readable format What best describes the types of vocabularies/ontologies/tagging schemas used to define the data elements? No standards have been applied in the description of data elements How is the metadata linked to other data and metadata (to enhance context and clearly indicate relationships)? The metadata record includes URI links to related metadata, data and definitions Reusable -------- Which of the following best describes the license/usage rights attached to the data? Standard machine-readable license (e.g. Creative Commons) How much provenance information has been captured to facilitate data reuse? Fully recorded in a machine-readable format

FAIR Self Assessment Rating

  • 4 Stars

Data Sensitivity

  • General

Usage metrics

    Macquarie University Research Data Repository

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC