if input is XML and an element contains the attribute "xml:lang" this value is used for this element
default: "eng"
normalization (optional)
"none" (no normalization)
"reg" (regularized)
"norm" (regularized + normalized)
default: "norm"
normalizationType (optional)
"dictionary"
"display"
default: "dictionary"
elements (optional)
list of xml element names which should be tokenized (e.g. "s head")
default: empty list (which means: all elements are tokenized)
stopElements (optional)
list of xml element names which are stop elements(e.g. "var emph"): stop elements: its tokens should not get word tags (when output format is "xml") or its tokens should be removed (if output format is "string")
default: empty list
highlightTerms (optional)
list of word forms which should be highlighted. Each matched word form is surrounded by <hi></hi>. The matching function is dependent of the normalization. E.g. if normalization = "norm" then the normalized word form is fetched and highlighted.
default: empty list
outputFormat (optional)
"xml"
"string"
default: "xml"
outputOptions (optional)
output options separated by blanks (e.g. "withForms withLemmas")
"withForms"
"withLemmas"
default: empty list
Response output
outputFormat=xml
tokenized inputString or document (enriched by element <w>)