Block or Report
Block or report xuejoy
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A unified evaluation framework for large language models
Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)
[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score ru…
A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
The official GitHub page for the survey paper "A Survey of Large Language Models".
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
🦜🔗 Build context-aware reasoning applications
A Comprehensive Assessment of Trustworthiness in GPT Models
✨✨Latest Advances on Multimodal Large Language Models
Specify what you want it to build, the AI asks for clarification, and then builds it.
Examples and guides for using the OpenAI API
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
OpenDAN is an open source Personal AI OS , which consolidates various AI modules in one place for your personal use.
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.