Package: morphemepiece 1.2.3

Jonathan Bratt

morphemepiece: Morpheme Tokenization

Tokenize text into morphemes. The morphemepiece algorithm uses a lookup table to determine the morpheme breakdown of words, and falls back on a modified wordpiece tokenization algorithm for words not found in the lookup table.

Authors:Jonathan Bratt [aut, cre], Jon Harmon [aut], Bedford Freeman & Worth Pub Grp LLC DBA Macmillan Learning [cph]

morphemepiece_1.2.3.tar.gz
morphemepiece_1.2.3.zip(r-4.7)morphemepiece_1.2.3.zip(r-4.6)morphemepiece_1.2.3.zip(r-4.5)
morphemepiece_1.2.3.tgz(r-4.6-any)morphemepiece_1.2.3.tgz(r-4.5-any)
morphemepiece_1.2.3.tar.gz(r-4.7-any)morphemepiece_1.2.3.tar.gz(r-4.6-any)
morphemepiece_1.2.3.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
morphemepiece/json (API)
NEWS

# Install 'morphemepiece' in R:
install.packages('morphemepiece', repos = c('https://jonthegeek.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/macmillancontentscience/morphemepiece/issues

On CRAN:

Conda:

5.04 score 11 stars 9 scripts 243 downloads 10 exports 37 dependencies

Last updated from:bc071b1a03. Checks:7 NOTE, 2 OK. Indexed: no.

TargetResultTimeFilesSyslog
linux-devel-x86_64NOTE128
source / vignettesOK193
linux-release-x86_64NOTE157
macos-release-arm64NOTE77
macos-oldrel-arm64NOTE102
windows-develNOTE80
windows-releaseNOTE76
windows-oldrelNOTE78
wasm-releaseOK117

Exports:load_lookupload_or_retrieve_lookupload_or_retrieve_vocabload_vocabmorphemepiece_cache_dirmorphemepiece_lookupmorphemepiece_tokenizemorphemepiece_vocabprepare_vocabset_morphemepiece_cache_dir

Dependencies:bitbit64cachemclicliprcpp11crayondigestdlrfastmapfastmatchfsgluehmslifecyclemagrittrmemoisemorphemepiece.datapiecemakerpillarpkgconfigprettyunitsprogresspurrrR6rappdirsreadrrlangstringistringrtibbletidyselecttzdbutf8vctrsvroomwithr

Generating a Vocabulary and Lookup

Rendered fromgenerating_vocab.Rmdusingknitr::rmarkdownon May 26 2026.

Last update: 2021-10-26
Started: 2021-07-29

Testing the fall-through algorithm

Rendered fromalgorithm_test.Rmdusingknitr::rmarkdownon May 26 2026.

Last update: 2021-09-06
Started: 2021-07-29