pySCDC

S,C-dense coding in Python 3

Usage

Put the text file (by default named text.txt) in the working directory, run the script. It will create a vocabulary in vocab.txt, as well as encode and decode directories.

Vocabulary consists of source file's MD5 hashsum and text elements (words and punctuation) sorted by descending rate of occurrence. Default supported punctuation includes the following symbols: { .,!?:; }. It can be modified via PATTERN string defined at the top of the script.

Encode folder will contain 255 variants of dense codes (for each S value in [1, 255]).
Decode folder will contain 255 copies of original text if worked correctly, each decoded from corresponding encode file.

Script is weakly tested, consider it a sample implementation.

References

(S,C)-Dense Coding: An Optimized Compression Code for Natural Language Text Databases (by Brisaboa, Fariña et al.; link).
On the Usefulness of Fibonacci Compression Codes (by Klein, Ben-Nissan; link).
http://vios.dc.fi.udc.es/codes/semistatic.html (archive.org)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
scdc.py		scdc.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pySCDC

Usage

References

About

Releases

Packages

Languages

License

rkoten/pySCDC

Folders and files

Latest commit

History

Repository files navigation

pySCDC

Usage

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages