Skip to content

Latest commit

 

History

History
42 lines (27 loc) · 2.01 KB

NEWS.md

File metadata and controls

42 lines (27 loc) · 2.01 KB

CHANGES IN sentencepiece VERSION 0.2.3

  • fix R CMD check warning due to change in version 0.2.2.
    • in third_party/protobuf-lite/strutil.cc:506:33: warning: argument to ‘sizeof’ in ‘int snprintf(char*, size_t, const char*, ...)’ call is the same expression as the destination; did you mean to provide an explicit length? [-Wsizeof-pointer-memaccess]
    • this part of third_party/protobuf-lite/strutil.cc was not used in sentencepiece

CHANGES IN sentencepiece VERSION 0.2.2

  • use snprintf instead of sprintf to handle the R CMD check deprecating note on M1mac

CHANGES IN sentencepiece VERSION 0.2.1

  • Fix for clang-UBSAN error

CHANGES IN sentencepiece VERSION 0.2

  • Fix wordpiece bug for 1-character words. (@jonthegeek, #4)
  • Upgraded to sentencepiece release v0.1.96

CHANGES IN sentencepiece VERSION 0.1.3

  • Fix wordpiece bug for 1-character words. (@jonthegeek, #4)
  • Fix Solaris installation issue related to incorrect usage of pointer as a function
  • Also download the binary model in sentencepiece_download_model as it can be loaded with word2vec::read.wordvectors
  • read_word2vec now uses word2vec::read.wordvectors from word2vec >= 0.2.0
  • added BPEembed and predict.BPEembed
  • allow subword regularisation by adding nbest and alpha option in sentencepiece_encode and changed sentencepiece_decode accordingly
  • Added txt_remove_
  • Upgrade sentencepiece to release v0.1.91 commit a32d7dc6ce6f383a65ad6e1cbe1983f94ab11932 which has subword regularisation for BPE

CHANGES IN sentencepiece VERSION 0.1.2

  • Fix Solaris installation issue which used log of uint64 which is not defined on Solaris

CHANGES IN sentencepiece VERSION 0.1.1

  • Added verbose argument in sentencepiece

CHANGES IN sentencepiece VERSION 0.1.0