Release v0.7.0 · JuliaText/TextAnalysis.jl

TextAnalysis v0.7.0

Diff since v0.6.0

Closed issues:

Feature Request: Part of speech tagging (#2)
Implement Named Entity Recognition (NER) (#117)
Can a new release be tagged? (#139)
Need API documentation (#146)
Extend Naive Bayes Classifier to support the various document types (#152)
Summarize function throws error for docs with less than 5 sentences. (#153)
UndefVarError when prepare! called on Corpus (#171)
Need to export Flux, Tracker (#178)
Docs and docstring for Sentiment Analysis model needs fixing (#182)
NaiveBayesClassifier scope error. (#192)
APIs to avoid datatype constraint between CorpusLoaders.jl and TextAnalysis.jl (#195)
Add entry for ULMFiT in docs/make.jl (#196)
Unexpected behaviour of ngram(sd, 3) (#202)
"resulting" bug (#205)
Statistical tokenization algorithms (#207)
Trying to use NaiveBayesClassifier results in UndefVarError (#216)

Merged pull requests:

Simple document classifier (AKA spam filter) (#106) (@MikeInnes)
Average Perceptron POS Tagger (Issue #2) (#131) (@ComputerMaestro)
Remove HTML style tags in preprocessing (#137) (@phereford)
PR: To address performance issues with stopword removal (#141) (@asbisen)
Indentation fix patch (#142) (@Ayushk4)
Fix deprecated function in extended example (#144) (@ViralBShah)
Add characters to list of punctuations (#145) (@asbisen)
Add API documentation (#147) (@aquatiko)
Update ngramizer.jl (#148) (@djokester)
Add offline Documentation (Docstrings) to the codebase (#150) (@Ayushk4)
Documentation for Bayes.jl (#151) (@Ayushk4)
Update summarizer.jl (#154) (@Ayushk4)
Fix deprecations in show.jl (#155) (@Ayushk4)
Added ROUGE Score to TextAnalysis.jl (#156) (@djokester)
allow multiple ngram complexity in NGramDocument, ngrams and ngrammize (#157) (@tanmaykm)
Update the documentation reflecting changes in show.jl (#159) (@Ayushk4)
Add functions for Tagging Schemes and Conversion. (#161) (@Ayushk4)
Conditional Random Fields (#162) (@Ayushk4)
BM25, Co-occurrence Matrix, faster ROUGE, Fixing LSA. (#165) (@Ayushk4)
Use datadeps for AvgPerceptronTagger, add pos tagging over document types (#166) (@Ayushk4)
Named Entity Recognition (#167) (@Ayushk4)
Add API for Part of Speech Tagging (#169) (@Ayushk4)
Add favicon to the docs (#170) (@Ayushk4)
Fix prepare! on strip_whitespace (#172) (@Ayushk4)
Readme updated. Docs edited to provide API Reference online. (#173) (@Ayushk4)
ULMFiT (#179) (@aviks)
Fix Sequence Labelling Models, fixes #178 (#180) (@Ayushk4)
Drop support for 0.7 and add support for 1.3 (#181) (@Ayushk4)
Minor fix of doc and docstring of Sentiment Analysis (#184) (@tejasvaidhyadev)
Remove duplicate entries in Project.toml, and fix a broken build (#189) (@DilumAluthge)
Bump version number from "0.6.0" to "0.7.0" (#190) (@DilumAluthge)
Install TagBot as a GitHub Action (#194) (@JuliaTagBot)
updated docs/make.jl (#198) (@tejasvaidhyadev)
make DTM type generic (#199) (@baggepinnen)
bug fix in get_sentiment function (#206) (@tejasvaidhyadev)
Language Model Interface (#210) (@tejasvaidhyadev)
Modify loop in initial assignments of lda to use sparse structure. (#213) (@jmoralez)
export NaiveBayesClassifier (#217) (@agarie)
Extend NaiveBayesClassifier to support Documents as input #152 (#219) (@KimBue)
Minor Fixes (#220) (@tejasvaidhyadev)
LM doc fix (#233) (@tejasvaidhyadev)
Split project, separate TextModels (#234) (@aviks)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.7.0

TextAnalysis v0.7.0