[go: nahoru, domu]

Skip to content
View Joel-hanson's full-sized avatar
:octocat:
Looking for opportunities
:octocat:
Looking for opportunities

Organizations

@IBM @jazzband
Block or Report

Block or report Joel-hanson

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Apache DataFusion SQL Query Engine

Rust 5,493 1,018 Updated Jul 3, 2024

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 7,671 4,189 Updated Jul 3, 2024

Publicly available real-time data sets on Kafka, Redpanda, RabbitMQ & Apache Pulsar

Python 30 3 Updated Jul 14, 2022

Scalable datastore for metrics, events, and real-time analytics

Rust 28,189 3,506 Updated Jul 3, 2024

the portable Python dataframe library

Python 4,526 552 Updated Jul 3, 2024

Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

Jupyter Notebook 21,610 14,952 Updated Dec 22, 2023

SQL stream processing, analytics, and management. We decouple storage and compute to offer instant failover, dynamic scaling, speedy bootstrapping, and efficient joins.

Rust 6,580 536 Updated Jul 3, 2024

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

Java 7,032 410 Updated Jul 3, 2024

Metriport is an open-source universal API for healthcare data.

JavaScript 462 39 Updated Jul 3, 2024

An Awesome List of Open-Source Data Engineering Projects

1,801 285 Updated Jun 19, 2024

TFX is an end-to-end platform for deploying production ML pipelines

Python 2,090 693 Updated Jul 3, 2024

Fluentd: Unified Logging Layer (project under CNCF)

Ruby 12,692 1,328 Updated Jun 20, 2024

🚀 A curated list of awesome articles, videos, and other resources to learn and practice software architecture, patterns, and principles.

7,004 531 Updated Jun 28, 2024

Lean and mean distributed stream processing system written in rust and web assembly.

Rust 2,767 199 Updated Jul 2, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,237 1,626 Updated Jul 3, 2024

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 15,328 1,499 Updated Jul 3, 2024

A production ready example Django app that's using Docker and Docker Compose.

Python 1,138 238 Updated Jun 30, 2024
Jupyter Notebook 110 38 Updated Dec 11, 2023

Open Source Event Monitoring

TypeScript 3,201 347 Updated Dec 13, 2023

Testcontainers that start Kubernetes in Docker.

Java 105 9 Updated Jul 2, 2024

Flower: A Friendly Federated Learning Framework

Python 4,440 784 Updated Jul 3, 2024

Real Time Big Data / IoT Machine Learning (Model Training and Inference) with HiveMQ (MQTT), TensorFlow IO and Apache Kafka - no additional data store like S3, HDFS or Spark required

Jupyter Notebook 393 142 Updated Nov 5, 2020

This project contains examples which demonstrate how to deploy analytic models to mission-critical, scalable production environments leveraging Apache Kafka and its Streams API. Models are built wi…

Java 832 302 Updated Dec 17, 2023

A dedicated scratchpad for developers

JavaScript 3,669 167 Updated Jul 2, 2024

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 5,035 1,086 Updated Jul 2, 2024

EventStoreDB, the event-native database. Designed for Event Sourcing, Event-Driven, and Microservices architectures

C# 5,159 637 Updated Jul 3, 2024

Data-Centric Pipelines and Data Versioning

Go 6,100 568 Updated Jul 3, 2024

High-Performance server for NATS.io, the cloud and edge native messaging system.

Go 15,109 1,364 Updated Jul 1, 2024

Python library for Wit.ai

Python 1,449 359 Updated Dec 10, 2023

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

Python 18,011 4,482 Updated Jun 29, 2024
Next