Complex Segment Learner

Complex Segment Learner

Navajo (Athabaskan)

Navajo is mentioned in our paper in passing in the discussion of the nature of the learning data. We wanted to look at the language because it has a large inventory of affricates, both strident and non-strident, with laryngeal contrasts. The inevitable frequency differences between these sounds make it a tricky case for our learner, and indeed the results vary depending on the dataset we use. The learner comes the closest to the traditional phonological analyses of Navajo (e.g., Harry Hoijer's) when trained on the most curated dataset is the roots/stems list.

Simulation data at a glance

Click on simulation name to view additional simulation details.

Simulation nameInitial state Learning DataInitial state features
Stems LearningData.txt Features.txt
Web_Words LearningData.txt Features.txt
Young_Morgan_Dictionary LearningData.txt Features.txt

Simulation details for Navajo stems

Input:

This data set was prepared by Gillian Gallagher on the basis of a stem list provided by David Eddington.

LearningData.txt | Features.txt

Summary of iterations:

IterationLearning Data producedFeatures producedInseparabilityNew Segments addedSegments removed
1 LearningData.txt Features.txt [download] [view] dʒ, tʃ', ts', tɬ', ts, tʃ ɟ, p, c, ʃ', s', ɬ'
2 LearningData.txt Features.txt [download] [view] dɮ, dz, tɬ None
3 No new learning data No new features [download] [view] None None

Summary of inventory changes

StageConsonant set
Inputb d ɟ g ɮ p t c k t' k' ʃ' s' ɬ' z s ʃ ʒ h x ɣ ɮ ɬ m n w j ʔ
Outputb d g ɮ t k t' k' z s ʃ ʒ h x ɣ ɮ ɬ m n w j ʔ dʒ tʃ' ts' tɬ' ts tʃ dɮ dz tɬ

Simulation Plots

/media/navajo/stems/simulation/insep_plots.png


Simulation details for Navajo web_words

Input:

This data set was created by Gillian Gallagher from the An Crúbadán corpus for Navajo. English words were excluded.

LearningData.txt | Features.txt

Summary of iterations:

IterationLearning Data producedFeatures producedInseparabilityNew Segments addedSegments removed
1 LearningData.txt Features.txt [download] [view] dʒ, ts' s'
2 No new learning data No new features [download] [view] None None

Summary of inventory changes

StageConsonant set
Inputb d g ɮ p t k t' k' ʃ' s' ɬ' z s ʃ ʒ h x ɣ ɮ ɬ m n w j ʔ
Outputb d g ɮ p t k t' k' ʃ' ɬ' z s ʃ ʒ h x ɣ ɮ ɬ m n w j ʔ dʒ ts'

Simulation Plots

/media/navajo/web_words/simulation/insep_plots.png


Simulation details for Navajo young_morgan_dictionary

Input:

This data set was prepared by Gillian Gallagher from Young and Morgan's (1972/80) dictionary, by including inflected headwords and excluding stems.
Young, Robert W., and William Morgan. The Navajo language: A grammar and colloquial dictionary. Vol. 3. University of New Mexico Press, 1980.

LearningData.txt | Features.txt

Summary of iterations:

IterationLearning Data producedFeatures producedInseparabilityNew Segments addedSegments removed
1 LearningData.txt Features.txt [download] [view] ɟʒ, ts', tɬ', cʃ' ɟ, p, ʃ', s', ɬ'
2 LearningData.txt Features.txt [download] [view] ts, cʃ c
3 No new learning data No new features [download] [view] None None

Summary of inventory changes

StageConsonant set
Inputb d ɟ g ɮ p t c k t' k' ʃ' s' ɬ' z s ʃ ʒ h x ɣ ɮ ɬ m n w j ʔ
Outputb d g ɮ t k t' k' z s ʃ ʒ h x ɣ ɮ ɬ m n w j ʔ ɟʒ ts' tɬ' cʃ' ts cʃ

Simulation Plots

/media/navajo/young_morgan_dictionary/simulation/insep_plots.png