Improvement for french pronunciation lexicon

Hi !
I work to improve the french pronunciation lexicon which is extract from www.wiktionary.org based on the https://github.com/Kyubyong/pron_dictionaries script.

I already see two problems:

  • It does not differentiate proper noun and normal noun. Example: ‘train’ is both the vehicle \tʁɛ̃\ and a city in germany \tʁajn\ .
  • It extracts old french pronunciation. Example: ‘de’ was pronounce \dam\ in old french, but it should be pronunces \də\

For proper noun, we could either not extract proper noun pronunciation at all, or match the case, and keep the capital letter.

I found this repository:
https://github.com/Remiphilius/PoemesProfonds.
I forked it: https://github.com/tjiho/PoemesProfonds
Maybe, it could be an idea to improve french pronunciation.