mimic
Bloatectomy: a method for the identification and removal of duplicate text in the bloated notes of electronic health records and other documents.
io+hdf5+lazy-loading+processing-pipelines for EHRs and generic PyTrees for JAX friendly tasks, mainly ML