Skip to content

oerpub/gdocs_structured_html5_markdown

Repository files navigation

Convert Google Documents HTML to structured well formed HTML

Ubuntu 16.04

Setup

pip install -r requirements.txt

Usage

python run.py

Open web browser at http://127.0.0.1:9000

Alternative use VirtualBox + Vagrant setup

vagrant up
vagrant ssh
cd /vagrant
python run.py

Open web browser at http://127.0.0.1:9000

Some public test documents

Screenshot

Known issues

  • no session/multiuser support (one user at a time).
  • no good support for big Google Docs documents with lots of images. Images are fetched into memory (yes, that's bad).
  • no good temporary file handling and deletion

About

Convert HTML, GDocs and Word HTML to CNXML and structured HTML

Resources

License

LGPL-3.0, Unknown licenses found

Licenses found

LGPL-3.0
LICENSE
Unknown
COPYING

Stars

Watchers

Forks

Packages

No packages published