Skip to content
This repository has been archived by the owner on May 21, 2020. It is now read-only.

Standalone filter for pasted MS Word HTML content; forked from TinyMCE

License

Notifications You must be signed in to change notification settings

jasonphillips/tinymce-word-paste-filter-archived

 
 

Repository files navigation

TinyMCE Word Paste Filter (standalone)

npm version Build Status npm downloads

What?

This project provides a standalone version of the filter function used by TinyMCE to clean up pasted MS Word content.

Why?

TinyMCE has one of the best implementations of Word cleanup around; they have tackled pernicious problems like restoring list items to actual lists when these arrive as <p> tags with a lot of odd styles and bullet characters. However, TinyMCE is no tiny library, and does not export that capability by itself.

This repo will remain an up-to-date fork of TinyMCE that builds only the Word Filter function (and its internal dependencies) into a standalone library much smaller (contributes ~50k if you minify your build) than the full TinyMCE (contribrutes at minimum >300k to your project -- with the internal Word clean function not exposed), published on npm as tinymce-word-paste-filter for easy incorporation into other projects.

How?

import wordFilter from 'tinymce-word-paste-filter';

// if you have incoming pasted Word data in your program...
const awfulWordHTML = '...'; // content here

// clean it up in one step
const cleaned = wordFilter(awfulWordHTML); 

Please note that this is intended to be bundled into browser applications; if running in a node environment for some reason, you'll need to have browser / dom globals (document, navigator, etc) available.

See the brief test under ./standalone/test/ and its fixtures.

Installation

The easiest way to use tinymce-word-paste-filter is to install it from npm and include it in your own React build process (using Browserify, Webpack, etc).

npm install tinymce-word-paste-filter --save

Building from source, testing

npm install
npm run build
npm run test

About

Standalone filter for pasted MS Word HTML content; forked from TinyMCE

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • TypeScript 93.6%
  • JavaScript 2.7%
  • CSS 2.3%
  • HTML 1.4%