commit | f1b61e748b6d6dffd39c3ebd391c55d6e0851449 | [log] [tgz] |
---|---|---|
author | Xin Li <delphij@google.com> | Sat Aug 14 06:30:58 2021 +0000 |
committer | Xin Li <delphij@google.com> | Sat Aug 14 06:30:58 2021 +0000 |
tree | f36e0f9da5855a1abb42f107a4e0733d79080e65 | |
parent | 189dd28362838fa19da8e2af7d34ada320cbccba [diff] | |
parent | 202ccc7cdef4b7d5c79dcfcc883083d2a9339a3b [diff] |
Merge sc-dev-plus-aosp-without-vendor@7634622 Merged-In: I0e9307b76dfe9e0b9b5f1ad20ab52e84cc5f1b8b Change-Id: Ib84685c96ac6b9448656882a2ad963b7248f3804
marisa-trie
MARISA: Matching Algorithm with Recursively Implemented StorAge
0.2.6
Matching Algorithm with Recursively Implemented StorAge (MARISA) is a static and space-efficient trie data structure. And libmarisa is a C++ library to provide an implementation of MARISA. Also, the package of libmarisa contains a set of command line tools for building and operating a MARISA-based dictionary.
A MARISA-based dictionary supports not only lookup but also reverse lookup, common prefix search and predictive search.
The biggest advantage of libmarisa is that its dictionary size is considerably more compact than others. See below for the dictionary size of other implementations.
Implementation | Size (bytes) | Remarks |
---|---|---|
darts-clone | 376,613,888 | Compacted double-array trie |
tx-trie | 127,727,058 | LOUDS-based trie |
marisa-trie | 50,753,560 | MARISA trie |
You can get the latest version via git clone
. Then, you can generate a configure
script via autoreconf -i
. After that, you can build and install libmarisa and its command line tools via configure
and make
. For details, see also documentation in docs
.
$ git clone https://github.com/s-yata/marisa-trie.git $ cd marisa-trie $ autoreconf -i $ ./configure --enable-native-code $ make $ make install