-
Notifications
You must be signed in to change notification settings - Fork 123
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Possibility of using regex for generating geminates #105
Comments
Here are some comments: There are two ways of writing geminate consonants in the IPA:
For reasons of parseability with PanPhon, the second solution is the approved Epitran solution (so <އް> could simply be mapped to /ː/). If you need doubling instead, you can achieve this with a regular expression and named groups, e.g.:
will change The prefixed |
Thank you for the comments. First on the On the geminates, at first I gave this a try, which did not seem to work (I'm not sure if I'm writing that rule wrong or if the
but
Does this have an impact on affricates? We have two Also, I'm hesitant to simply map އް to : - it would complicate the post processing rules since the language uses a lot long vowels, and އް can also cause pre-nasalisation or serve as a glottal stop depending on context. |
I'm trying to write post processing rules for
div-Thaa
over on this fork.The rules dicatate that for occurrences of certain graphemes
އް
would have effect of having the next consonant be a geminate in some situations. I can't seem to figure out if this can be done with a single regex rule with a match group or not.For the time being I've added the cases as individual rules here. The rules in question are the ones with
<AS>
in them.TL;DR;
Apologies in advance if this is a redundant question and I missed something in the docs.
The text was updated successfully, but these errors were encountered: