Task #3941
closedTask #3672: RA1d - Automatic cleaning of speech corpora
Task #3704: Detection and correction of prosodic structures
Task #4155: Prosodic word parsing
Create synthesis logs for corpora with altered prosodic words
100%
Description
Can you create synthesis logs for corpora with altered prosodic words? All ASFs are placed in artic/Experiments/pros_words
(2 speakers x 3 versions).
Since we need only sentences suitable for a listening test, is it possible to limit the length of sentences to maximum 20 words? It could also significantly shorten the synthesis process. But if the limitation is complicated, it is not necessary.
Files
Updated by Tihelka Dan over 8 years ago
- Due date set to 19.06.2016
- Status changed from New to Feedback
- Assignee changed from Tihelka Dan to Hanzlíček Zdeněk
I will need a python script assigning new PWord tags to a sequence of phones (or preferably ASF units). Then I can put it into the scripting framework. Reduction of sentences to 20 words should not be a big problem.
Updated by Hanzlíček Zdeněk over 8 years ago
- File asflight.py asflight.py added
- File pros_words.py pros_words.py added
- Assignee changed from Hanzlíček Zdeněk to Tihelka Dan
File pros_words.py contains function set_pwords( unit_list, ver=1 ) where
- unit_list is a list of units == dicts with the same keys as in ASF files (phone, pwordBoundPos, pphrsBoundPos, prosodeme and word should be set; pwordBoundPos and prosodeme are updated)
- ver is the version of prosodic words
- 1 .. joins monosyllabic prepositions, conjuctions a, i and enclitic pronouns se, si
- 2 .. joins also monosyllabic relative pronouns
File asflight.py is just for testing --> pros_words.py can be used directly for the processing of ASF files.
Updated by Tihelka Dan over 8 years ago
- Status changed from Feedback to Resolved
- Assignee changed from Tihelka Dan to Hanzlíček Zdeněk
- % Done changed from 0 to 90
After a week of troubles, I have finally managed to get the experiment working.
Resulting logs are stored to Experiments/pros_words/out_${spkr}_${ver}
directories, where ver = 0
is for the baseline system and ver = 1,2
are for the individual versions as described in #3941#note-2.
Please, check the logs, if the position in prosodic words (namely begWeight, midWeight and endWeight attributes) and prosodemes are assigned correctly to units!
Updated by Tihelka Dan about 8 years ago
- Status changed from Resolved to Assigned
After fix of the prosodeme type handling in TC computation, there are new logs in Experiments/pros_words/out_${spkr}_${ver}
directories (same as before).
Updated by Matoušek Jindřich almost 8 years ago
- Parent task changed from #3704 to #4155
Updated by Matoušek Jindřich almost 8 years ago
- Status changed from Assigned to Closed