[go: nahoru, domu]

Skip to content
View Joel-hanson's full-sized avatar
:octocat:
Looking for opportunities
:octocat:
Looking for opportunities

Organizations

@IBM @jazzband
Block or Report

Block or report Joel-hanson

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Apache DataFusion SQL Query Engine

Rust 5,480 1,014 Updated Jul 1, 2024

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 7,672 4,189 Updated Jul 1, 2024

Publicly available real-time data sets on Kafka, Redpanda, RabbitMQ & Apache Pulsar

Python 30 3 Updated Jul 14, 2022

Scalable datastore for metrics, events, and real-time analytics

Rust 28,175 3,505 Updated Jul 1, 2024

the portable Python dataframe library

Python 4,515 549 Updated Jul 1, 2024

Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

Jupyter Notebook 21,602 14,950 Updated Dec 22, 2023

SQL stream processing, analytics, and management. We decouple storage and compute to offer instant failover, dynamic scaling, speedy bootstrapping, and efficient joins.

Rust 6,567 536 Updated Jul 1, 2024

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

Java 7,020 409 Updated Jul 1, 2024

Metriport is an open-source universal API for healthcare data.

JavaScript 460 39 Updated Jul 1, 2024

An Awesome List of Open-Source Data Engineering Projects

1,798 283 Updated Jun 19, 2024

TFX is an end-to-end platform for deploying production ML pipelines

Python 2,089 693 Updated Jun 28, 2024

Fluentd: Unified Logging Layer (project under CNCF)

Ruby 12,686 1,327 Updated Jun 20, 2024

🚀 A curated list of awesome articles, videos, and other resources to learn and practice software architecture, patterns, and principles.

6,714 510 Updated Jun 28, 2024

Lean and mean distributed stream processing system written in rust and web assembly.

Rust 2,763 198 Updated Jul 1, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,232 1,625 Updated Jul 1, 2024

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 15,317 1,498 Updated Jul 1, 2024

A production ready example Django app that's using Docker and Docker Compose.

Python 1,137 238 Updated Jun 30, 2024
Jupyter Notebook 110 38 Updated Dec 11, 2023

Open Source Event Monitoring

TypeScript 3,200 347 Updated Dec 13, 2023

Testcontainers that start Kubernetes in Docker.

Java 105 9 Updated Jun 27, 2024

Flower: A Friendly Federated Learning Framework

Python 4,429 784 Updated Jul 1, 2024

Real Time Big Data / IoT Machine Learning (Model Training and Inference) with HiveMQ (MQTT), TensorFlow IO and Apache Kafka - no additional data store like S3, HDFS or Spark required

Jupyter Notebook 393 142 Updated Nov 5, 2020

This project contains examples which demonstrate how to deploy analytic models to mission-critical, scalable production environments leveraging Apache Kafka and its Streams API. Models are built wi…

Java 832 302 Updated Dec 17, 2023

A dedicated scratchpad for developers

JavaScript 3,624 161 Updated Jun 30, 2024

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 5,032 1,085 Updated Jul 1, 2024

EventStoreDB, the event-native database. Designed for Event Sourcing, Event-Driven, and Microservices architectures

C# 5,157 637 Updated Jul 1, 2024

Data-Centric Pipelines and Data Versioning

Go 6,098 568 Updated Jul 1, 2024

High-Performance server for NATS.io, the cloud and edge native messaging system.

Go 15,101 1,364 Updated Jul 1, 2024

Python library for Wit.ai

Python 1,449 359 Updated Dec 10, 2023

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

Python 18,000 4,481 Updated Jun 29, 2024
Next