CitableParserBuilder.AnalysisType

Citable analysis of a string value.

An Analysis has seven members: string values for the orthographic token and the morphological token, four abbreviated URNs, one each for the lexeme, form, rule and stem, and a sequence ID for the morphological token.

Base.:==Method

Override Base.== for Analysis.

==(a1, a2)
Base.:==Method

Override Base.== for AnalyzedToken.

==(atoken1, atoken2)
Base.:==Method

Override Base.== for AnalyzedTokenCollection.

==(at1, at2)
Base.:==Method

Override Base.== for AbbreviatedUrn.

==(au1, au2)
Base.eltypeMethod

Implement base element type for AnalyzedTokenCollection.

eltype(atc)
Base.iterateMethod

Implement iteration with state for AnalyzedTokenCollection.

iterate(atc, state)
Base.iterateMethod

Implement iteration for AnalyzedTokenCollection.

iterate(atc)
Base.objectidMethod

Default implementation of function to find the object identifier of AbbreviatedUrn.

objectid(au)
Base.showMethod

Override Base.show for AnalyzedTokenCollection.

show(io, atc)
Base.showMethod

Override Base.show for AnalyzedToken.

show(io, atoken)
Base.showMethod

Override Base.show for AbbreviatedUrn.

show(io, au)
CitableBase.cexMethod

Implementation of cex function for an Analysis.

cex(a; delim, registry)
CitableBase.cexMethod

Format an AnalyzedTokenCollection collection as a delimited-text string.

cex(atc; delimiter, registry)

Required function for Citable abstraction.

CitableBase.cexMethod

Serialize an AnalyzedToken as delimited text (required for Citable interface).

cex(at; delimiter, registry)

Uses abbreviated URNs. These can be expanded to full CITE2 URNs when read back with a URN registry, or the delimited function can be used with a URN registry to write full CITE2 URNs.

CitableBase.cexMethod

Compose delimited text string for a DFParser.

cex(dfp; delimiter)
CitableBase.fromcexMethod

Parse a delimited-text string into an AnalyzedTokenCollection collection.

fromcex(trait, s, ; delimiter, configuration, strict)
CitableBase.fromcexMethod

Implementation of fromcex function for an Analysis.

fromcex(
    traitvalue,
    cexsrc,
    T;
    delimiter,
    configuration,
    strict
)
CitableBase.fromcexMethod

Parse a one-line delimited-text representation into an AnalyzedToken, using abbreviated URNs for identifiers. Note that for a sigle CEX line, the AnalyzedToken will have a single Analysis in its vector of analyses.

fromcex(
    traitvalue,
    cexsrc,
    T;
    delimiter,
    configuration,
    strict
)
CitableBase.fromcexMethod

Create a DFParser from a delimited-text source.

fromcex(trait, cexsrc, T; delimiter, configuration, strict)
CitableBase.labelMethod

Label for analyses.

label(atc)

Required function for Citable abstraction.

CitableBase.labelMethod

Label for AnalyzedToken (required for Citable interface).

label(at)
CitableBase.urnMethod

Unique identifier for AnalyzedToken (required for Citable interface).

urn(at)
CitableBase.urntypeMethod

Typeof URN identifying analyses in an an AnalyzedTokenCollection collection.

urntype(analyses)

Required function for Citable abstraction.

CitableBase.urntypeMethod

Identify URN type for an AnalyzedToken as CtsUrn.

urntype(at)

Required function for Citable abstraction.

CitableParserBuilder.abbreviateMethod

Constructs an AbbreviatedUrn string from a Cite2Urn.

abbreviate(urn)

Example:

julia> abbreviate(Cite2Urn("urn:cite2:kanones:lsj.v1:n123"))
"lsj.n123"

Example: a pipeline abbreviating a Cite2Urn and forming a LexemeUrn from the abbreviated string value.

julia> Cite2Urn("urn:cite2:kanones:lsj.v1:n123") |> abbreviate |> LexemeUrn
LexemeUrn("lsj", "n123")
CitableParserBuilder.dfParserFunction

Create a DFParser from delimited text file.

dfParser(delimitedfile; ...)
dfParser(delimitedfile, ortho; delimiter)
CitableParserBuilder.expandMethod

Constructs a Cite2Urn from an AbbreviatedUrn and a dictionary mapping collection identifiers in AbbreviatedUrns's to full Cite2Urns for a versioned collection.

CitableParserBuilder.fstsafeMethod

Compose SFST representation of an AbbreviatedUrn.

fstsafe(au)

Example:

julia> LexemeUrn("lexicon.lex123") |> fstsafe
"<u>lexicon\.lex123</u>"
CitableParserBuilder.generateMethod

Generate all possible morphological analyses for a given lexeme and form.

generate(lex, mform, parser; delim)
CitableParserBuilder.lexemedictionaryMethod

From a vector of AnalyzedTokens and an index of tokens in a corpus, construct a dictionary keyed by lexemes, mapping to a further dictionary of surface forms to passages.

lexemedictionary(parses, tokenindex)
CitableParserBuilder.lexemehistoMethod

Compute histogram of lexemes in AnalyzedTokenCollection.

lexemehisto(parses)

All distinct lexemes for a token are counted; there is no weighting of counts for lexically ambiguous tokens.

CitableParserBuilder.parsecorpusMethod

Use a CitableParser to parse a CitableTextCorpus with each citable node containing containg a single token of type LexicalToken.

parsecorpus(c, p; data, countinterval)

Returns anAnalyzedTokenCollection object.

CitableParserBuilder.parselistMethod

Parse a list of tokens with a CitableParser.

parselist(vocablist, p; countinterval)

Returns a Dict mapping strings to a (possibly empty) vector of Analysis objects. Blank lines in input are silently ignored.

CitableParserBuilder.parselistMethod

Read a list of tokens from file f and parse with p.

parselist(f, p, reader; countinterval)

Returns a Dict mapping strings to a (possibly empty) vector of Analysis objects.

CitableParserBuilder.parselistMethod

Read a list of tokens from URL u and parse with p.

parselist(u, p, reader; countinterval)

Returns a Dict mapping strings to a (possibly empty) vector of Analysis objects.

CitableParserBuilder.parsepassageMethod

Parse a CitablePassage with text for a single token with a CitableParser.

parsepassage(cn, p; data)

Returns a single AnalyzedToken.

CitableParserBuilder.parsepassageMethod

Parse a CitablePassage with text for a single token with a CitableParser.

parsepassage(ct, p; data)

Returns a single AnalyzedToken.

CitableParserBuilder.readfstMethod

Read SFST output from file f, and parse into a dictionary keying tokens to a (possibly empty) array of SFST strings.

readfst(f)
CitableParserBuilder.relationsblockFunction

Compose a CEX relationset block for a set of analyses.

relationsblock(urn, label, v; ...)
relationsblock(urn, label, v, delim; registry)
CitableParserBuilder.stringParserFunction

Construct a string-backed parser from a dataframe.

stringParser(df)
stringParser(df, ortho)
stringParser(df, ortho, delim)