"# Apache tika + java-diff-utils + diff2html" Examples to follow
forked from bitsevn/apache-tika-file-diff
-
Notifications
You must be signed in to change notification settings - Fork 0
Uses Apache Tika parser libraries to extract text out of a variety of file formats (pdf, excel, word, mhtml, images, txt, csv, etc.) and then uses java-diff-utils to generate unified diff between two versions of the files. This unified diff can be fed to diff2html library to show side by side diff on browser
manon2333/apache-tika-file-diff
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Uses Apache Tika parser libraries to extract text out of a variety of file formats (pdf, excel, word, mhtml, images, txt, csv, etc.) and then uses java-diff-utils to generate unified diff between two versions of the files. This unified diff can be fed to diff2html library to show side by side diff on browser
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Java 100.0%