Murhaf

Follow

Murhaf Murhaf

Follow

14 followers · 36 following

Norway

Achievements

Achievements

Block or Report

Block or report Murhaf

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (2)

Sort

arabic

llms

17 repositories

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

for-ai / llm-profiling-toolkit

Python 6 Updated Jul 2, 2024

loganthorneloe / ml-road-map

The most streamlined road map to learn ML for free.

225 26 Updated Jul 5, 2024

Arize-ai / phoenix

AI Observability & Evaluation

Jupyter Notebook 3,021 218 Updated Jul 10, 2024

marimo-team / marimo

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

Python 5,412 162 Updated Jul 10, 2024

negrinho / sane_tikz

Reconquer the canvas: beautiful Tikz figures without clunky Tikz code

Python 370 33 Updated Nov 18, 2020

karpathy / LLM101n

LLM101n: Let's build a Storyteller

15,259 729 Updated Jun 28, 2024

WojciechMula / pyahocorasick

Python module (C extension and plain python) implementing Aho-Corasick algorithm

C 912 122 Updated Mar 21, 2024

xhluca / bm25s

Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)

Python 603 17 Updated Jul 8, 2024

vespa-engine / vespa

AI + Data, online. https://vespa.ai

Java 5,471 584 Updated Jul 10, 2024

amix / vimrc

The ultimate Vim configuration (vimrc)

Vim Script 30,360 7,262 Updated May 27, 2024

PiotrNawrot / nanoT5

Fast & Simple repository for pre-training and fine-tuning T5-style models

Python 944 67 Updated Jun 14, 2024

argilla-io / distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Python 1,128 71 Updated Jul 9, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,139 2,438 Updated Jul 9, 2024

huggingface / setfit

Efficient few-shot learning with Sentence Transformers

Jupyter Notebook 2,081 207 Updated Jul 3, 2024

EleutherAI / improved-t5

Experiments for efforts to train a new and improved t5

Python 74 5 Updated Apr 15, 2024

beir-cellar / beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,479 175 Updated Jun 27, 2024

ryanzhumich / Contrastive-Learning-NLP-Papers

Paper List for Contrastive Learning for Natural Language Processing

507 55 Updated Apr 27, 2023

cleanlab / cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 9,072 703 Updated Jul 9, 2024

UBC-NLP / octopus

Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)

Python 9 Updated Apr 29, 2024

embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark

Python 1,636 212 Updated Jul 10, 2024

neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Python 2,939 169 Updated Jul 5, 2024

scaleapi / llm-engine

Scale LLM Engine public repository

Python 755 48 Updated Jul 10, 2024

bheinzerling / bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Python 1,172 100 Updated Mar 16, 2024

PrithivirajDamodaran / FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & C…

Python 465 39 Updated Jun 22, 2024

UBC-NLP / araT5

AraT5: Text-to-Text Transformers for Arabic Language Understanding

84 18 Updated May 16, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,324 426 Updated May 3, 2024

timpal0l / ScandiSent

Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴󠁧󠁢󠁥󠁮󠁧󠁿)

15 1 Updated May 3, 2021

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,202 393 Updated Jul 10, 2024

predibase / lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 1,900 126 Updated Jul 10, 2024

MaartenGr / BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 5,791 721 Updated Jul 6, 2024

Starred topics

Machine learning

Deep learning

ml

macOS

Python

Natural language processing