Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
-
Updated
Jun 11, 2024 - Python
Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Data sources used by the Big Data Innovation Team
CBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.
Open pixelated STEM framework
LiberTEM correlation and refinement library
CrateDB Toolkit.
Kubernetes-native platform to run massively parallel data/streaming jobs
VIProject interprets Kinect camera data, showcasing RGB, depth, and IR videos. It serves as a tool for developing and teaching visual intelligence applications.
Data and tools for generating and inspecting OLMo pre-training data.
Advanced and Fast Data Transformation in R
A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.
Framework for processing and filtering datasets
This repository is created as part of the Data Science Coursework Birzeit university
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
📋 Acceptance testing of rules authored by the ACT Rules Community Group (@act-rules) and implemented by Alfa
Add a description, image, and links to the data-processing topic page so that developers can more easily learn about it.
To associate your repository with the data-processing topic, visit your repo's landing page and select "manage topics."