[go: nahoru, domu]

Skip to content
View Je-Cp's full-sized avatar
Block or Report

Block or report Je-Cp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Rust 28,501 1,782 Updated Aug 8, 2024
Jupyter Notebook 40 12 Updated Jul 10, 2024

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 32,570 5,516 Updated Aug 8, 2024

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Python 3,830 260 Updated Aug 8, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,432 779 Updated Aug 5, 2024

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 58,047 30,047 Updated Aug 8, 2024

One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure

Python 1,402 207 Updated Aug 8, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 14,284 1,262 Updated Aug 8, 2024

Pythonic Iceberg REST Catalog

Python 58 7 Updated Jul 24, 2024

New file format for storage of large columnar datasets.

C++ 407 22 Updated Jul 25, 2024

Trino AI SQL Functions

Java 6 2 Updated May 17, 2024

Interagir avec starburst via python

Python 1 Updated Jun 17, 2024

Apache DataFusion SQL Query Engine

Rust 5,713 1,068 Updated Aug 8, 2024

CLI to manage your datacontract.yaml files

Python 382 74 Updated Aug 8, 2024

For distributed HA HDFS only.

Shell 1 Updated Sep 20, 2017

Avro XML Mapper is a Java library that converts XML formatted data to Apache Avro format

Java 16 6 Updated Jun 20, 2024

A vulnerability scanner for container images and filesystems

Go 8,301 540 Updated Aug 8, 2024

Apache Atlas development image for the Rokku project: https://github.com/ing-bank/rokku

Shell 20 31 Updated Jun 9, 2020

Cluster in docker with Apache Atlas and a minimal Hadoop ecosystem to perform some basic experiments.

Shell 22 25 Updated Apr 16, 2024

This end-end demo will walk the users through the process of extracting text from encoded PDF documents at scale using Apache PDFBox and Databricks using Scala and Spark.

HTML 2 1 Updated Jul 19, 2021

Simple project to expose a catalog over REST using a Java catalog backend

Java 94 40 Updated Aug 4, 2024

A tool that makes it easy to run modular Trino environments locally.

Python 30 3 Updated Jun 14, 2024

📊 Cube — The Semantic Layer for Building Data Applications

Rust 17,584 1,750 Updated Aug 8, 2024

An orchestration platform for the development, production, and observation of data assets.

Python 10,923 1,361 Updated Aug 8, 2024

High quality resources & applications for LLMs, multi-modal models and VectorDBs

Jupyter Notebook 539 92 Updated Aug 6, 2024

Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

Java 809 133 Updated Aug 7, 2024

Trino connectors for accessing APIs with an OpenAPI spec

Java 24 5 Updated Aug 4, 2024

dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)

Python 825 71 Updated Aug 5, 2024

A GitOps based Anthos Multi Cloud installer framework.

HCL 8 11 Updated Jul 14, 2021

MicroK8s is a small, fast, single-package Kubernetes for datacenters and the edge.

Python 8,308 761 Updated Aug 7, 2024
Next