This dataset comprises ten supplementary video files to the research article "Bilingual speech in Jaru–Kriol conversations: Codeswitching, codemixing, and grammatical fusion". The study explores natural conversations in the Yaruman community of Western Australia, where language contact between the endangered language Jaru (Pama-Nyungan, Ngumpin-Yapa) and the English-lexified creole language Kriol has led to prevalent bilingual practices. The linguistic practices in these extracts illustrate different patterns along the continuum of bilingual speech, i.e., codeswitching, codemixing, and grammatical fusion.
The videos are subtitled with a vernacular Jaru–Kriol tier and a free English translation tier. In accordance with the ethics approval and participants' preferences, some name tags represent pseudonyms or subsection names, some faces have been blurred, and some spoken names and words have been bleeped out. The video clips are intended to be watched alongside the published article, where readers will find a Jeffersonian transcription of each extract as well as morphemic glosses. Words and morphemes that are derived from Jaru are represented in bold typeface; words and morphemes from Kriol, non-lexical vocalisations and proper nouns are represented in regular typeface.
The supplementary videos have been downsized and converted to MP4 format. The complete Jaru corpus with the original high-quality recordings will be accessible under certain conditions through the Australian Institute for Aboriginal and Torres Strait Islander Studies (AIATSIS) Collection.
Findable
--------
Does the dataset have any identifiers assigned?
Global
Is the dataset identifier included in all metadata records/files describing the data?
Yes
How is the data described with metadata?
Comprehensively, but in a text-based, non-standard format
What type of repository or registry is the metadata record in?
Generalist public repository
Accessible
----------
How accessible is the data?
Publicly accessible
Is the data available online without requiring specialised protocols or tools once access has been approved?
File download from online location
Will the metadata record be available even if the data is no longer available?
Yes
Interoperable
-------------
What (file) format(s) is the data available in?
In a structured, open standard, machine-readable format
What best describes the types of vocabularies/ontologies/tagging schemas used to define the data elements?
No standards have been applied in the description of data elements
How is the metadata linked to other data and metadata (to enhance context and clearly indicate relationships)?
Metadata is represented in a machine readable format, e.g. in a linked data format such as Resource Description Framework (RDF).
Reusable
--------
Which of the following best describes the license/usage rights attached to the data?
Standard machine-readable license (e.g. Creative Commons)
How much provenance information has been captured to facilitate data reuse?
Fully recorded in a text format