BiblicalHebrew.codept_splitMethod

Split string s into substrings separated by character c; if keep is true, also maintain c as a string in the resulting list.

BiblicalHebrew.hebrew_charsMethod

Compose a list of all defined characters in the Hebrew range of Unicode as Chars, sorted in Unicode order.

BiblicalHebrew.is_alphabeticMethod

True if a Char is a vowel point, a consonant, or one of the Unicode combining characters that are part of writing consonantal values.

is_alphabetic(c)
BiblicalHebrew.rm_accentsMethod

Remove all accents and cantillation marks from a string, leaving only consonants, vowel points and dagesh/mappiq (the same code point in Unicode).

rm_accents(s)
BiblicalHebrew.tokenize_stringFunction

Tokenize a string s where s has already isolated the token separators maqaf and soph pasuq. This can be accomplished by feeding s to the tokenize function with a HebrewOrthography.

BiblicalHebrew.unpointedMethod

Reduce a string to consonantal representation by removing all pointing and accents.

unpointed(s)
BiblicalHebrew.validcpsMethod

Compose a list of all defined codepoints in the Hebrew range of Unicode sorted in codepoint order.

Orthography.codepointsMethod

Implement Orthography's codepoints functions for HebrewOrthography.

codepoints(ortho)
Orthography.tokenizeMethod

Implement Orthography's tokenize function for HebrewOrthography.

tokenize(s, o)
Orthography.tokentypesMethod

Implement Orthography's tokentypes functions for HebrewOrthography.

tokentypes(ortho)
Orthography.validstringMethod

Implement validstring function for HebrewOrthography. True if every code point in s appears in the HebrewOrthography's values for codepoints.