parquet
Here are 439 public repositories matching this topic...
A large-scale entity and relation database supporting aggregation of properties
-
Updated
May 17, 2024 - Java
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
-
Updated
May 14, 2024 - Python
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
-
Updated
Dec 2, 2023 - Python
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
-
Updated
May 19, 2024 - Rust
fully asynchronous, pure JavaScript implementation of the Parquet file format
-
Updated
Apr 13, 2023 - JavaScript
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
-
Updated
May 18, 2024 - Go
Quilt is a data mesh for connecting people with actionable data
-
Updated
May 17, 2024 - Jupyter Notebook
A tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch
-
Updated
Jul 3, 2022 - Python
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
-
Updated
Apr 18, 2024 - Java
A collection of tools for extracting FHIR resources and analytics services on top of that data.
-
Updated
May 17, 2024 - Java
Improve this page
Add a description, image, and links to the parquet topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the parquet topic, visit your repo's landing page and select "manage topics."