[go: nahoru, domu]

Skip to content
View Murhaf's full-sized avatar
Block or Report

Block or report Murhaf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results
Python 6 Updated Jul 2, 2024

The most streamlined road map to learn ML for free.

225 26 Updated Jul 5, 2024

AI Observability & Evaluation

Jupyter Notebook 3,021 218 Updated Jul 10, 2024

A reactive notebook for Python โ€” run reproducible experiments, execute as a script, deploy as an app, and version with git.

Python 5,412 162 Updated Jul 10, 2024

Reconquer the canvas: beautiful Tikz figures without clunky Tikz code

Python 370 33 Updated Nov 18, 2020

LLM101n: Let's build a Storyteller

15,259 729 Updated Jun 28, 2024

Python module (C extension and plain python) implementing Aho-Corasick algorithm

C 912 122 Updated Mar 21, 2024

Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)

Python 603 17 Updated Jul 8, 2024

AI + Data, online. https://vespa.ai

Java 5,471 584 Updated Jul 10, 2024

The ultimate Vim configuration (vimrc)

Vim Script 30,360 7,262 Updated May 27, 2024

Fast & Simple repository for pre-training and fine-tuning T5-style models

Python 944 67 Updated Jun 14, 2024

โš—๏ธ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Python 1,128 71 Updated Jul 9, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,139 2,438 Updated Jul 9, 2024

Efficient few-shot learning with Sentence Transformers

Jupyter Notebook 2,081 207 Updated Jul 3, 2024

Experiments for efforts to train a new and improved t5

Python 74 5 Updated Apr 15, 2024

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,479 175 Updated Jun 27, 2024

Paper List for Contrastive Learning for Natural Language Processing

507 55 Updated Apr 27, 2023

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 9,072 703 Updated Jul 9, 2024

Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)

Python 9 Updated Apr 29, 2024

MTEB: Massive Text Embedding Benchmark

Python 1,636 212 Updated Jul 10, 2024

Sparsity-aware deep learning inference runtime for CPUs

Python 2,939 169 Updated Jul 5, 2024

Scale LLM Engine public repository

Python 755 48 Updated Jul 10, 2024

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Python 1,172 100 Updated Mar 16, 2024

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Cโ€ฆ

Python 465 39 Updated Jun 22, 2024

AraT5: Text-to-Text Transformers for Arabic Language Understanding

84 18 Updated May 16, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,324 426 Updated May 3, 2024

Sentiment Corpus for Swedish ๐Ÿ‡ธ๐Ÿ‡ช Norwegian ๐Ÿ‡ณ๐Ÿ‡ด Danish ๐Ÿ‡ฉ๐Ÿ‡ฐ Finnish ๐Ÿ‡ซ๐Ÿ‡ฎ (and English ๐Ÿด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ)

15 1 Updated May 3, 2021

Modeling, training, eval, and inference code for OLMo

Python 4,202 393 Updated Jul 10, 2024

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 1,900 126 Updated Jul 10, 2024

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 5,791 721 Updated Jul 6, 2024
Next