Release History

Version 0.1.1 (2020-11-24)

This is a bugfix release that focuses on web_monitoring_diff.html_diff_render() and the server.

  • Fix an issue where the diffing server could reset the process pool that manages the actual diffs multiple times unnecessarily, leading to wasted memory and CPU. If you are tracking logs and errors, this will also make error messages about the diffing server clearer — you’ll see “BrokenProcessPool” instead of “’NoneType’ object does not support item assignment.” (#38)

  • Ensure the server shuts down gracefully when pressing ctrl+c or sending a SIGINT signal. (#44)

  • Fix web_monitoring_diff.html_diff_render() to make sure the spacing of text and tags in the HTML source code of the diff matches the original. This resolves display issues on pages where CSS is used to treat spacing as significant. (#40)

  • Improve handling of lazy-loaded images in web_monitoring_diff.html_diff_render(). When images are lazy-loaded via JS, they usually use the data-src or data-srcset attributes, and we now check those, too. Additionally, if two images have no detectable URLs, we now treat them as the same, rather than different. (#39)

  • Stop showing inline scripts and styles in web_monitoring_diff.html_diff_render(). These still get wrapped with <del> or <ins> elements, but they don’t show up visually since they aren’t elements that should be visually rendered. (#43)

Version 0.1.0

This project used to be a part of web-monitoring-processing, which contains a wide variety of libraries, scripts, and other tools for working with data across all the various parts of EDGI’s Web Monitoring project. The goal of this initial release is to create a new, more focused package containing the diff-releated tools so they can be more easily used by others.

This release is more-or-less the same code that was a part of web-monitoring-processing, although the public API has been rearranged very slightly to make sense in this new, stand-alone context.