tsne-algorithm
GPU Accelerated t-SNE for CUDA with Python bindings
CUDA-accelerated PyTorch implementation of t-SNE