Skip to content

Commit

Permalink
Aperiodic data update on 2015-11-26(Thu)
Browse files Browse the repository at this point in the history
  • Loading branch information
overlast committed Nov 26, 2015
1 parent ee0baa1 commit bfe41cf
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion README.ja.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ Web上の文書の解析をする際には、この辞書と標準のシステ

## 特徴
### 利点
- MeCab の標準のシステム辞書では正しく分割できない固有表現などの語の表層(表記)とフリガナの組を約203万組(重複エントリを含む)採録しています
- MeCab の標準のシステム辞書では正しく分割できない固有表現などの語の表層(表記)とフリガナの組を約201.5万組(重複エントリを含む)採録しています
- この辞書の更新は開発サーバ上で自動的におこなわれます
- 毎月月初と中旬に更新する予定です
- Web上の言語資源を活用しているので、更新時に新しい固有表現を採録できます
Expand Down
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@ When you analyze the Web documents, it's better to use this system dictionary an

## Pros and Cons
### Pros
- Recorded about 2.03 million pairs(including duplicate entries) of surface/furigana(kana indicating the pronunciation of kanji) of the words such as the named entity that can not be tokenized correctly using default system dictionary of MeCab.
- Recorded about 2.015 million pairs(including duplicate entries) of surface/furigana(kana indicating the pronunciation of kanji) of the words such as the named entity that can not be tokenized correctly using default system dictionary of MeCab.
- Update process of this dictionary will automatically run on development server.
- I'm planning to renew this dictionary in monthly beginning of the month and middle of the month.
- When renewing by utilizing the language resources on Web, a new named entity can be recorded.
Expand Down

0 comments on commit bfe41cf

Please sign in to comment.