Pronunciation variants #751

stannam · 2021-02-04T07:39:32Z

An example csv file (csv_pron_var.txt) is in 'csv_sample'.

Unwanted transcription column (only 'Canonical' is expected but another column 'Transcription' gets added when creating a corpus) -- edit: solved
PCT doesn't autogenerate the inventory chart (all segments go under 'uncategorized') -- edit: this issue doesn't arise in the most recent version of 'master'.
Question: Phonotactic probability doesn't allow '... as separate entry' as variant options? (cf. this)
Error when calculating MI, FL, PrOD, etc by pronunciation variants. -- edit: solved?

Traceback (most recent call last):
File "D:\PycharmProjects\CorpusTools\corpustools\gui\migui.py", line 44, in run
call_back = kwargs['call_back'])
File "D:\PycharmProjects\CorpusTools\corpustools\mutualinfo\mutual_information.py", line 197, in pointwise_mi
probability=True, need_wb=need_wd)
File "D:\PycharmProjects\CorpusTools\corpustools\contextmanagers.py", line 98, in get_frequency_base
tier = getattr(word, self.sequence_type)
AttributeError: 'Word' object has no attribute 'Transcription'

Need to confirm the change in the Word class (lexicon.py) does not raise any error elsewhere. ('create corpus', and 'add word' functions work properly)

YuHsiangLo · 2021-04-18T07:19:25Z

Hmmm I think this is caused by the mixing of attribute _transcription, Transcription, _transcription_name, and the transcription getter and setter... This issue is related to #756, and we'll need to do some fundamental refactoring of the Word class to solve this type of problems once and for all.

stannam · 2021-04-19T04:08:00Z

cf. we have a separate branch for this: 'pronunciation_variants'

stannam · 2021-04-26T18:21:27Z

The recent commit, the one that forces the column name (92113c5) gets rid of the 'Unwanted transcription column' issue (i.e., no 'canonical' column to start with). However, for an independent reason, "List pronunciation variants" is acting out again 😳.
I have added an example file csv_pron_var.txt to the 'csv_sample' folder.

stannam added the bug label Feb 4, 2021

stannam self-assigned this Feb 4, 2021

stannam changed the title ~~Mutual Information on Buckeye Corpus~~ Buckeye Corpus pronunciation variants Feb 4, 2021

stannam changed the title ~~Buckeye Corpus pronunciation variants~~ Pronunciation variants Feb 15, 2021

stannam added a commit that referenced this issue Feb 16, 2021

fixed errors in calculation with pronunciation variants (issue #751)

73b73aa

Need to confirm the change in the Word class (lexicon.py) does not raise any error elsewhere. ('create corpus', and 'add word' functions work properly)

stannam mentioned this issue Mar 8, 2021

For PCT 1.5.0 #754

Closed

25 tasks

YuHsiangLo added enhancement and removed bug labels May 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pronunciation variants #751

Pronunciation variants #751

stannam commented Feb 4, 2021 •

edited

Loading

YuHsiangLo commented Apr 18, 2021

stannam commented Apr 19, 2021

stannam commented Apr 26, 2021 •

edited

Loading

Pronunciation variants #751

Pronunciation variants #751

Comments

stannam commented Feb 4, 2021 • edited Loading

YuHsiangLo commented Apr 18, 2021

stannam commented Apr 19, 2021

stannam commented Apr 26, 2021 • edited Loading

stannam commented Feb 4, 2021 •

edited

Loading

stannam commented Apr 26, 2021 •

edited

Loading