from Hacker News

Show HN: Diff for Formatted Documents (.docx)

by mkreis on 10/28/21, 12:27 PM with 3 comments

  • by mkreis on 10/28/21, 12:31 PM

    Co-Founder here. BlitzCompare is a spin-off of our main product at EDITIVE (we're building the "Github for documents").

    BlitzCompare is not just another diff: The big advantages are:

    * Understands complex structured documents like .docx Detects moved sections, even if there are small changes within those sections, which are then highlighted separately.

    * You can navigate between source and destination of those moves by clicking on them

    * Formatting is shown as an individual type of change, and what has been modified is shown as a tool-tip

    => Upload your own docs or click on the "Show example" button to have a look for yourself.

    A note regarding PDF: Please be aware that PDF files contain almost no formatting information, sometimes not even text (only images), therefore results with PDF will depend on the quality of the input file.

    Btw. all uploaded documents are immediately deleted after processing.

  • by newsbinator on 10/28/21, 2:44 PM

    Well done. I was working on something similar (diff for formatted html that handles moved sections) and there were a lot of gotchas.

    I'd love to find a node package for this, if anybody has a suggestion.