The Mbay simulation is described in Section 3.1.2 of the paper. The corpus comprises 4,046 distinct lexical items and is from Keegan's (1996) Mbay-English dictionary. The dictionary was hand-entered into a text file and filtered to exclude proper nouns and loanwords. Mbay is unusual among the world's languages in that it has a heterorganic prenasalized stop [nɟ], as well as homorganic [mb], [nd], and [ŋg]. For more information on Mbay phonology as well as arguments that nasal-stop sequences function as single segments, see Keegan's (1997) grammar of the language.
Simulation data at a glance
Click on simulation name to view additional simulation details.
|Simulation name||Initial state Learning Data||Initial state features|