Liking cljdoc? Tell your friends :D

com.owoga.phonetics


cmu-lexiconclj

The CMULexicon can get phones for words that aren't in the CMU Pronouncing Dictionary. But the phones are slightly different. The AH sound, as in allow, is returned as ax from the CMULexicon. Also, unstressed vowels don't have a 0 suffix. Instead, the CMULexicon just returns unstressed vowels as the vowel itself with no suffix.

The above is important to note if you want clean interplay between these two different ways of getting phonemes.

The CMULexicon can get phones for words that aren't in the
CMU Pronouncing Dictionary. But the phones are slightly different.
The `AH` sound, as in `allow`, is returned as `ax` from the CMULexicon.
Also, unstressed vowels don't have a `0` suffix. Instead, the CMULexicon
just returns unstressed vowels as the vowel itself with no suffix.

The above is important to note if you want clean interplay between these
two different ways of getting phonemes.
raw docstring

cmu-lexicon->cmu-pronouncing-dictclj

(cmu-lexicon->cmu-pronouncing-dict phonemes)

The CMULexicon returns the AH sound, as in allow, as ax. The Sphinx dictionary treates that sound as AH. This converts ax to AH. It also adds 0 to phonemes that are unstressed, which CMULexicon returns as the plain phoneme with no stress marker.

The CMULexicon returns the `AH` sound, as in `allow`, as `ax`.
The Sphinx dictionary treates that sound as `AH`. This
converts `ax` to `AH`. It also adds `0` to phonemes that are
unstressed, which CMULexicon returns as the plain phoneme with
no stress marker.
raw docstring

cmu-word-alternativesclj

For words with multiple pronunciations in the CMU dictionary, this maps from the word to its variations. reputed -> reputed, reputed(1), reputed(2).

Not particularly useful itself since reputed(1) doesn't tell you how it's different from reputed. But it's useful to look up the pronunciations in the CMU dictionary.

For words with multiple pronunciations in the CMU dictionary,
this maps from the word to its variations.
reputed -> reputed, reputed(1), reputed(2).

Not particularly useful itself since reputed(1) doesn't tell you how it's
different from reputed. But it's useful to look up the pronunciations in the
CMU dictionary.
raw docstring

cmu-word-to-stressed-phones-mapclj

Map of lowercase English words to their phonetic sounding based on the CMU Pronouncing Dictionary at http://www.speech.cs.cmu.edu/cgi-bin/cmudict/

Includes words with apostrophes, like possessive aaronson's.

Words with multiple pronunciations have keys with a (1) or (2) after their duplicates, like [aaronsons(1) (AA1 R AH0 N S AH0 N Z)]

Primary stress is indicated by a 1 after the phoneme. Secondary stress with a 2. Unstressed with a 0.

Map of lowercase English words to their phonetic sounding based on
the CMU Pronouncing Dictionary at http://www.speech.cs.cmu.edu/cgi-bin/cmudict/

Includes words with apostrophes, like possessive aaronson's.

Words with multiple pronunciations have keys with a `(1)` or `(2)` after their
duplicates, like [aaronsons(1) (AA1 R AH0 N S AH0 N Z)]

Primary stress is indicated by a `1` after the phoneme. Secondary stress with a `2`.
Unstressed with a `0`.
raw docstring

cmu-word-to-unstressed-phones-mapclj


consonantclj


get-phonesclj

(get-phones word)

Tries to get phones first from the CMU Pronouncing Dictionary and falls back to the CMULexicon if the word doesn't exist in the dictionary.

Input must be lower-case.

Returns a vector of all possible pronunciations.

Tries to get phones first from the CMU Pronouncing Dictionary
and falls back to the CMULexicon if the word doesn't exist in
the dictionary.

Input must be lower-case.

Returns a vector of all possible pronunciations.
raw docstring

get-wordclj

(get-word phones)

long-vowelclj


phonemapclj


remove-stressclj

(remove-stress phonemes)

short-vowelclj


single-sound-bigramclj


stressed-phones-to-cmu-word-mapclj

The same sequence of phones can map to multiple words.

The same sequence of phones can map to multiple words.
raw docstring

syllable-endclj


unstressed-phones-to-cmu-word-mapclj

There might be unstressed phones that can map to two different pronunciations when stress is added, so this maps unstressed phones to a vector of words that can be looked up in the CMU Pronouncing dictionary to see what their stressed phones are.

Another example, look at how many words map to [N IY S]. [[N IY S] [neice neece niece nice kneece kniess neiss neace niess]]

There might be unstressed phones that can map
to two different pronunciations when stress is added,
so this maps unstressed phones to a vector of words that
can be looked up in the CMU Pronouncing dictionary to
see what their stressed phones are.

Another example, look at how many words map to [N IY S].
[[N IY S]
[neice neece niece nice kneece kniess neiss neace niess]]
raw docstring

vowelclj


word-alternativesclj

(word-alternatives word)

For words with multiple pronunciations in the CMU dictionary, this maps from the word to its variations. reputed -> reputed, reputed(1), reputed(2).

Not particularly useful itself since reputed(1) doesn't tell you how it's different from reputed. But it's useful to look up the pronunciations in the CMU dictionary.

For words with multiple pronunciations in the CMU dictionary,
this maps from the word to its variations.
reputed -> reputed, reputed(1), reputed(2).

Not particularly useful itself since reputed(1) doesn't tell you how it's
different from reputed. But it's useful to look up the pronunciations in the
CMU dictionary.
raw docstring

cljdoc is a website building & hosting documentation for Clojure/Script libraries

× close