Hacker News new | past | comments | ask | show | jobs | submit login

.docx files are a zip folder of plaintext XML files. Theoretically they could be tracked with git if you unzip/zip every commit/checkout.



This doesn't work well when XML is not hand-edited, but rather a dump of some in-memory structure - there are many cases where e.g. order is not important, so it can be essentially random during serialization, and that then shows up as spurious diffs.


I actually tried to hash .docx files to keep track of them. Turns out there are random timestamp data that the resulting file is different every time




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: