Skip to content

Releases: bnosac/udpipe

CRAN release 0.2

13 Nov 10:43
Compare
Choose a tag to compare
  • Added data preparation elements for standard text mining flows namely:
    cooccurrence
    collocation
    document_term_frequencies
    document_term_matrix
    dtm_tfidf
    dtm_remove_terms
    dtm_remove_lowfreq
    dtm_remove_tfidf
    dtm_reverse
    dtm_cor
    txt_collapse
    txt_sample
    txt_show
    txt_highlight
    txt_recode
    txt_previous
    txt_next
    txt_nextgram
    unique_identifier
  • Added predict.LDA_VEM and predict.LDA_Gibbs
  • Renamed dataset annotation_params to udpipe_annotation_params
  • Added example datasets called brussels_listings, brussels_reviews, brussels_reviews_anno
  • Use path.expand on conll-u files which are used for training
  • udpipe_download_model now downloads from https://raw.githubusercontent.com/jwijffels/udpipe.models.ud.2.0/master instead of https://github.com/jwijffels/udpipe.models.ud.2.0/raw/master
  • Another stab at trying to Fix Solaris enum issue

CRAN release version 0.1.1

13 Sep 17:47
Compare
Choose a tag to compare
  • Fix issue on Solaris builds at CRAN, namely: error: expected primary-expression before ‘enum’
  • Use ufal::udpipe namespace directly
  • Documentation fixes

CRAN release version 0.1

01 Sep 18:32
Compare
Choose a tag to compare

CHANGES IN udpipe VERSION 0.1

  • Initial release based on UDPipe commit a2ebb99d243546f64c95d0faf36882bb1d67a670
  • Allow to do annotation (tokenisation, POS tagging, Lemmatisation, Dependency parsing)
  • Allow to build your own UDPipe model based on data in CONLL-U format
  • Convert the output of udpipe_annotate to a data.frame
  • Allow to download models from https://github.com/jwijffels/udpipe.models.ud.2.0
  • Add vignettes