From c11da266aaa63ad7648fe2177505f5b334b5e26c Mon Sep 17 00:00:00 2001 From: Dirkjan Ochtman Date: Thu, 26 Nov 2020 11:39:57 +0100 Subject: [PATCH] Update performance claim in README --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 1d48041..b0ace9f 100644 --- a/README.md +++ b/README.md @@ -13,7 +13,7 @@ Corpus][corpus], as described by Thorsten Brants and Alex Franz, and [distribute Linguistic Data Consortium. Note that this data **"may only be used for linguistic education and research"**, so for any other usage you should acquire a different data set. -For the microbenchmark included in this repository, word-segmenters is ~5x faster than +For the microbenchmark included in this repository, word-segmenters is ~17x faster than the Python implementation. Further optimizations are planned -- see the [issues][issues]. The API has been carefully constructed so that multiple segmentations can share the underlying state (mainly the unigram and bigram maps) to allow parallel usage.