common-voice
Python library for converting image and object annotations formats into each other.
Python library for converting annotated datasets into various formats (e.g., image classification, object detection and speech datasets).