PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Spark Datasource Python Packages

Python packages with the GitHub topic spark-datasource. Sorted by relevance, with stars and monthly downloads.
StabRise
pyspark-pdf

PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it

243 81 4
    • Data from PyPI, GitHub, ClickHouse, and BigQuery