Synatical Analysis of Myanmar Language


Myanmar Language is made up of different glyphs, derived from Brahmic Scripts. Each seperable syllable constitues different combination of glyphs, following the well-defined syllablic rules (Maung & Mikami, 2008).

Naturally, grammatical analysis starts with validation of individual syllable, which comprises of different glyphs. Invalid glyphs will always result in invalid syllables.

However, the difficulites actually come from correction of Myanmar Spelling as there are many different possible combinations to form a possible Burmese Word. How do we know what might be the most correct word, given the partial syllabic combinations, really fast?.

Of course, it could be generally solved with Machine Learning which could predict the possible combinations of syllables. However, a lot of more simpler method exists if we can utilize a commonly known phrases of Burmese Words and their relative frequency.


Burmese Glyphs

The most fundamental unit of Burmese Syllables are Burmese Glyphs. However, Burmese Glyphs are different from English Alphabets in that some Burmese Glyphs cannot exist independently.



Burmese Syllabic Order

Each Burmese Syllable is a combination of different Burmese Glyphs in which the primary component is the Consonant (C).





Common Occurance of Myanmar Phrases


Actually, I have collected around 2 million commonly used Myanmar Phrases and their relative frequency. And the accuracy rate is quite satisfactory. Dataset is available on my github.