PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Parquet Files Python Packages

Python packages with the GitHub topic parquet-files. Sorted by relevance, with stars and monthly downloads.
uber
petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

273K 2K 284
mongodb-labs
pymongoarrow

MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.

97K 114 19
zachspar
parquet-py

A simple command-line interface & Python API for parquet

5K 1 0
QTSurfer
lastra-convert

CLI converter for the Lastra columnar time series file format. Parquet / CSV / Arrow ↔️ Lastra round-trips.

2K 0 0
Tendo33
parq-cli

A powerful command-line tool for inspecting tabular files like Parquet, CSV, and XLSX

916 2 0
IgnacioMB
csvcli

A light-weight command-line tool to browse and query CSV, Excel and Apache Parquet files, regardless of their size.

221 3 0
uber
hops-petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

168 2K 284
sami5001
parquet-converter

Python utility to convert TXT and CSV files to Parquet

153 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery