[go: nahoru, domu]

Skip to content
View delltower's full-sized avatar

Block or report delltower

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Python 426 38 Updated Jul 15, 2024

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 376 20 Updated Sep 24, 2024

4 bits quantization of LLaMA using GPTQ

Python 2,994 457 Updated Jul 13, 2024

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,672 505 Updated Jul 18, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 134,682 26,933 Updated Nov 9, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 5,988 520 Updated Sep 6, 2024

Inference code for Llama models

Python 56,337 9,557 Updated Aug 18, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,360 1,867 Updated Apr 30, 2024

Let ChatGPT teach your own chatbot in hours with a single GPU!

Python 3,166 285 Updated Mar 17, 2024

Making large AI models cheaper, faster and more accessible

Python 38,781 4,341 Updated Nov 8, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,799 3,303 Updated Jul 23, 2024

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

Python 934 158 Updated Oct 22, 2022

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Python 1,397 207 Updated Apr 3, 2024

微信大赛baseline

Python 237 60 Updated Aug 6, 2022

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。

Jupyter Notebook 18,331 5,396 Updated Oct 14, 2021

Encapsulate server requests using react custom hooks

TypeScript 4 Updated Aug 10, 2021

"我的阅历"

2,145 742 Updated Feb 6, 2021

PlaidML is a framework for making deep learning work everywhere.

C++ 4,583 398 Updated Jul 23, 2023

本项目整合了常用中文nlp资源,包括:工具、数据、学习资源和常用模型。

Python 30 7 Updated Dec 11, 2019

This project reproduces the book Dive Into Deep Learning (https://d2l.ai/), adapting the code from MXNet into PyTorch.

Jupyter Notebook 4,242 1,241 Updated Jul 25, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 44,098 7,810 Updated Nov 9, 2024

TensorFlow code and pre-trained models for BERT

Python 38,163 9,598 Updated Jul 23, 2024

LeetCode刷题记录与面试整理

Java 7,373 1,882 Updated Aug 17, 2024

结巴中文分词

Python 33,325 6,722 Updated Aug 21, 2024

中文命名实体识别,实体抽取,tensorflow,pytorch,BiLSTM+CRF

Python 1,390 394 Updated Mar 15, 2020

Language Technology Platform

Python 4,961 1,040 Updated Oct 12, 2024

Pytorch-Named-Entity-Recognition-with-BERT

Python 1,209 278 Updated May 6, 2021

A fast and accurate POS and morphological tagging toolkit (EACL 2014)

HTML 140 48 Updated Feb 16, 2020
Next