Package: morphemepiece 1.2.3
morphemepiece: Morpheme Tokenization
Tokenize text into morphemes. The morphemepiece algorithm uses a lookup table to determine the morpheme breakdown of words, and falls back on a modified wordpiece tokenization algorithm for words not found in the lookup table.
Authors:
morphemepiece_1.2.3.tar.gz
morphemepiece_1.2.3.zip(r-4.7)morphemepiece_1.2.3.zip(r-4.6)morphemepiece_1.2.3.zip(r-4.5)
morphemepiece_1.2.3.tgz(r-4.6-any)morphemepiece_1.2.3.tgz(r-4.5-any)
morphemepiece_1.2.3.tar.gz(r-4.7-any)morphemepiece_1.2.3.tar.gz(r-4.6-any)
morphemepiece_1.2.3.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
morphemepiece/json (API)
NEWS
| # Install 'morphemepiece' in R: |
| install.packages('morphemepiece', repos = c('https://jonthegeek.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/macmillancontentscience/morphemepiece/issues
Last updated from:bc071b1a03. Checks:7 NOTE, 2 OK. Indexed: no.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | NOTE | 128 | ||
| source / vignettes | OK | 193 | ||
| linux-release-x86_64 | NOTE | 157 | ||
| macos-release-arm64 | NOTE | 77 | ||
| macos-oldrel-arm64 | NOTE | 102 | ||
| windows-devel | NOTE | 80 | ||
| windows-release | NOTE | 76 | ||
| windows-oldrel | NOTE | 78 | ||
| wasm-release | OK | 117 |
Exports:load_lookupload_or_retrieve_lookupload_or_retrieve_vocabload_vocabmorphemepiece_cache_dirmorphemepiece_lookupmorphemepiece_tokenizemorphemepiece_vocabprepare_vocabset_morphemepiece_cache_dir
Dependencies:bitbit64cachemclicliprcpp11crayondigestdlrfastmapfastmatchfsgluehmslifecyclemagrittrmemoisemorphemepiece.datapiecemakerpillarpkgconfigprettyunitsprogresspurrrR6rappdirsreadrrlangstringistringrtibbletidyselecttzdbutf8vctrsvroomwithr
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| morphemepiece: Morpheme Tokenization | morphemepiece-package |
| Load a morphemepiece lookup file | load_lookup |
| Load a lookup file, or retrieve from cache | load_or_retrieve_lookup |
| Load a vocabulary file, or retrieve from cache | load_or_retrieve_vocab |
| Load a vocabulary file | load_vocab |
| Retrieve Directory for Morphemepiece Cache | morphemepiece_cache_dir |
| Tokenize Sequence with Morpheme Pieces | morphemepiece_tokenize |
| Format a Token List as a Vocabulary | prepare_vocab |
| Set a Cache Directory for Morphemepiece | set_morphemepiece_cache_dir |
