Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
Here are 44,289 public repositories matching this topic...
Time series machine model to predict crop growth as part of an AgAID internship
-
Updated
Jun 12, 2024 - Python
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
-
Updated
Jun 12, 2024 - Python
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
-
Updated
Jun 12, 2024 - Python
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
-
Updated
Jun 12, 2024 - Go
PostgreSQL vector database extension for building AI applications
-
Updated
Jun 12, 2024 - C
Utilizing bulldozer's auction sale price Dataset to predict the Price of Bulldozers!
-
Updated
Jun 12, 2024 - Jupyter Notebook
Snowflake Snowpark Python API
-
Updated
Jun 12, 2024 - Python
Incometric predicts income using machine learning on demographic and socio-economic data. It involves data preprocessing, feature engineering, model training, evaluation, and deployment. This tool aids financial planning and policy-making, offering accurate predictions through a web application and API.
-
Updated
Jun 12, 2024 - Jupyter Notebook
Source code for https://www.datagaz.fr
-
Updated
Jun 12, 2024 - JavaScript
Tracking outages for Puerto Rico's private electricity distributor with GitHub Actions.
-
Updated
Jun 11, 2024 - Python
An ASL detection script utilizing a TensorFlow image classification model trained from scratch. It is tailored to recognize American Sign Language (ASL) alphabet letters from live video streams, and provides documentation covering the neural network architecture, installation, dataset details, training procedures, and real-time detection.
-
Updated
Jun 11, 2024 - Python
🔥 A website showcasing my work
-
Updated
Jun 11, 2024 - JavaScript
Statistical Machine Intelligence & Learning Engine
-
Updated
Jun 11, 2024 - Java
Apache Superset is a Data Visualization and Data Exploration Platform
-
Updated
Jun 12, 2024 - TypeScript
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
-
Updated
Jun 12, 2024 - Java
Python binder for maidr library
-
Updated
Jun 11, 2024 - HTML
Project repository for the development of a Question-Answering (QA) information retrieval system fine-tuned on customer queries.
-
Updated
Jun 11, 2024 - Jupyter Notebook
- Followers
- 4k followers
- Wikipedia
- Wikipedia