Skip to content

[Question] How to associate pronunciation to senses/synsets/definitions #56

@sevagh

Description

@sevagh

Hello - using this library (or the cmudict directly, same information), we can get multiple pronunciations. Some of these correspond to a different part of speech (e.g. PRO-ject noun vs. pro-JECT verb). Some are homographs with the same part of speech (e.g. bow).

Here's an example of bow:

>>> import pronouncing
>>> pronouncing.phones_for_word('bow')
['B AW1', 'B OW1']
>>>
>>> from nltk.corpus import wordnet
>>> [ss.definition() for ss in wordnet.synsets('bow')[:2]]
['a knot with two loops and loose ends; used to tie shoelaces', 'a slightly curved piece of resilient wood with taut horsehair strands; used in playing certain stringed instruments']

Does anybody have suggestions for how I could create relations from the pronunciations to the senses/synsets?

One potential path I'm looking at is:

  1. arpabet to IPA
  2. look up definitions/senses by IPA (I don't know where, just yet)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions