Macquarie University
10 files

Bilingual speech in Jaru–Kriol conversations: Codeswitching, codemixing, and grammatical fusion / Supplementary video files

posted on 2021-12-24, 00:38 authored by Josua DahmenJosua Dahmen
This dataset comprises ten supplementary video files to the research article "Bilingual speech in Jaru–Kriol conversations: Codeswitching, codemixing, and grammatical fusion". The study explores natural conversations in the Yaruman community of Western Australia, where language contact between the endangered language Jaru (Pama-Nyungan, Ngumpin-Yapa) and the English-lexified creole language Kriol has led to prevalent bilingual practices. The linguistic practices in these extracts illustrate different patterns along the continuum of bilingual speech, i.e., codeswitching, codemixing, and grammatical fusion.

The videos are subtitled with a vernacular Jaru–Kriol tier and a free English translation tier. In accordance with the ethics approval and participants' preferences, some name tags represent pseudonyms or subsection names, some faces have been blurred, and some spoken names and words have been bleeped out. The video clips are intended to be watched alongside the published article, where readers will find a Jeffersonian transcription of each extract as well as morphemic glosses. Words and morphemes that are derived from Jaru are represented in bold typeface; words and morphemes from Kriol, non-lexical vocalisations and proper nouns are represented in regular typeface.

The supplementary videos have been downsized and converted to MP4 format. The complete Jaru corpus with the original high-quality recordings will be accessible under certain conditions through the Australian Institute for Aboriginal and Torres Strait Islander Studies (AIATSIS) Collection.


Australian Aboriginal conversational style

Australian Research Council

Find out more...

International Macquarie University Research Excellence Scholarship (iMQRES)


Research Project ID


Q/A Log

  • Peer review completed
  • FAIR assessment completed

FAIR Self Assessment Summary

Findable -------- Does the dataset have any identifiers assigned? Global Is the dataset identifier included in all metadata records/files describing the data? Yes How is the data described with metadata? Comprehensively, but in a text-based, non-standard format What type of repository or registry is the metadata record in? Generalist public repository Accessible ---------- How accessible is the data? Publicly accessible Is the data available online without requiring specialised protocols or tools once access has been approved? File download from online location Will the metadata record be available even if the data is no longer available? Yes Interoperable ------------- What (file) format(s) is the data available in? In a structured, open standard, machine-readable format What best describes the types of vocabularies/ontologies/tagging schemas used to define the data elements? No standards have been applied in the description of data elements How is the metadata linked to other data and metadata (to enhance context and clearly indicate relationships)? Metadata is represented in a machine readable format, e.g. in a linked data format such as Resource Description Framework (RDF). Reusable -------- Which of the following best describes the license/usage rights attached to the data? Standard machine-readable license (e.g. Creative Commons) How much provenance information has been captured to facilitate data reuse? Fully recorded in a text format

FAIR Self Assessment Rating

  • 4 Stars

Data Sensitivity

  • General