v0.7.0
TextAnalysis v0.7.0
Closed issues:
- Feature Request: Part of speech tagging (#2)
- Implement Named Entity Recognition (NER) (#117)
- Can a new release be tagged? (#139)
- Need API documentation (#146)
- Extend Naive Bayes Classifier to support the various document types (#152)
- Summarize function throws error for docs with less than 5 sentences. (#153)
- UndefVarError when
prepare!
called on Corpus (#171) - Need to export Flux, Tracker (#178)
- Docs and docstring for Sentiment Analysis model needs fixing (#182)
- NaiveBayesClassifier scope error. (#192)
- APIs to avoid datatype constraint between CorpusLoaders.jl and TextAnalysis.jl (#195)
- Add entry for ULMFiT in docs/make.jl (#196)
- Unexpected behaviour of ngram(sd, 3) (#202)
- "resulting" bug (#205)
- Statistical tokenization algorithms (#207)
- Trying to use NaiveBayesClassifier results in UndefVarError (#216)
Merged pull requests:
- Simple document classifier (AKA spam filter) (#106) (@MikeInnes)
- Average Perceptron POS Tagger (Issue #2) (#131) (@ComputerMaestro)
- Remove HTML style tags in preprocessing (#137) (@phereford)
- PR: To address performance issues with stopword removal (#141) (@asbisen)
- Indentation fix patch (#142) (@Ayushk4)
- Fix deprecated function in extended example (#144) (@ViralBShah)
- Add characters to list of punctuations (#145) (@asbisen)
- Add API documentation (#147) (@aquatiko)
- Update ngramizer.jl (#148) (@djokester)
- Add offline Documentation (Docstrings) to the codebase (#150) (@Ayushk4)
- Documentation for Bayes.jl (#151) (@Ayushk4)
- Update summarizer.jl (#154) (@Ayushk4)
- Fix deprecations in show.jl (#155) (@Ayushk4)
- Added ROUGE Score to TextAnalysis.jl (#156) (@djokester)
- allow multiple ngram complexity in NGramDocument, ngrams and ngrammize (#157) (@tanmaykm)
- Update the documentation reflecting changes in show.jl (#159) (@Ayushk4)
- Add functions for Tagging Schemes and Conversion. (#161) (@Ayushk4)
- Conditional Random Fields (#162) (@Ayushk4)
- BM25, Co-occurrence Matrix, faster ROUGE, Fixing LSA. (#165) (@Ayushk4)
- Use datadeps for AvgPerceptronTagger, add pos tagging over document types (#166) (@Ayushk4)
- Named Entity Recognition (#167) (@Ayushk4)
- Add API for Part of Speech Tagging (#169) (@Ayushk4)
- Add favicon to the docs (#170) (@Ayushk4)
- Fix prepare! on strip_whitespace (#172) (@Ayushk4)
- Readme updated. Docs edited to provide API Reference online. (#173) (@Ayushk4)
- ULMFiT (#179) (@aviks)
- Fix Sequence Labelling Models, fixes #178 (#180) (@Ayushk4)
- Drop support for 0.7 and add support for 1.3 (#181) (@Ayushk4)
- Minor fix of doc and docstring of Sentiment Analysis (#184) (@tejasvaidhyadev)
- Remove duplicate entries in Project.toml, and fix a broken build (#189) (@DilumAluthge)
- Bump version number from "0.6.0" to "0.7.0" (#190) (@DilumAluthge)
- Install TagBot as a GitHub Action (#194) (@JuliaTagBot)
- updated docs/make.jl (#198) (@tejasvaidhyadev)
- make DTM type generic (#199) (@baggepinnen)
- bug fix in get_sentiment function (#206) (@tejasvaidhyadev)
- Language Model Interface (#210) (@tejasvaidhyadev)
- Modify loop in initial assignments of lda to use sparse structure. (#213) (@jmoralez)
- export NaiveBayesClassifier (#217) (@agarie)
- Extend NaiveBayesClassifier to support Documents as input #152 (#219) (@KimBue)
- Minor Fixes (#220) (@tejasvaidhyadev)
- LM doc fix (#233) (@tejasvaidhyadev)
- Split project, separate TextModels (#234) (@aviks)