-
Notifications
You must be signed in to change notification settings - Fork 8.7k
Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
readme: add Paddler to the list of projects
#8239
opened Jul 1, 2024 by
mcharytoniuk
Loading…
2 of 4 tasks
Removing fsep token from GPTRefactForCausalLM
python
python script changes
#8237
opened Jul 1, 2024 by
jpodivin
Loading…
2 of 4 tasks
[SYCL] Enabled more data types for oneMKL's gemm_batch API
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#8236
opened Jul 1, 2024 by
OuadiElfarouki
Loading…
2 of 4 tasks
Stylistic adjustments for python scripts
python
python script changes
#8233
opened Jul 1, 2024 by
jpodivin
Loading…
2 of 4 tasks
tests : add _CRT_SECURE_NO_WARNINGS for WIN32
build
Compilation issues
#8231
opened Jul 1, 2024 by
danbev
Loading…
2 of 4 tasks
[SYCL] Fix win build conflict of math library
build
Compilation issues
#8230
opened Jul 1, 2024 by
luoyu-intel
Loading…
2 of 4 tasks
llama : fix mpt and olmo pre-tokenizer
bugfix
fixes an issue or bug
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8228
opened Jun 30, 2024 by
compilade
Loading…
2 tasks done
gemma2: add sliding window mask
help wanted
Extra attention is needed
merge ready
indicates that this may be ready to merge soon and is just holding out in case of objections
python
python script changes
#8227
opened Jun 30, 2024 by
ngxson
Loading…
2 tasks done
adding guile_llama_cpp to binding list
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8222
opened Jun 30, 2024 by
andy-tai
Loading…
2 of 4 tasks
CUDA: refactor and optimize IQ MMVQ
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
#8215
opened Jun 29, 2024 by
JohannesGaessler
Loading…
clip: don't throw exceptions from llava functions compiled as extern "C"
examples
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8210
opened Jun 29, 2024 by
hackingthekernel
Loading…
2 of 4 tasks
server: Windows 7 compatibility
examples
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
server
#8208
opened Jun 29, 2024 by
Zor-X-L
Loading…
2 of 4 tasks
Added checks for cmake,make and ctest in ci->run.sh
devops
improvements to build systems and github actions
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8200
opened Jun 28, 2024 by
AlexsCode
Loading…
2 of 4 tasks
json
: $ref + object overhaul (https & recursive $refs, mix properties & allOf)
breaking change
#8199
opened Jun 28, 2024 by
ochafik
Loading…
server : fix templates for llama2, llama3 and zephyr in new UI
examples
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
server
#8196
opened Jun 28, 2024 by
mgroeber9110
Loading…
2 of 4 tasks
Fix for Deepseek-V2's low-rank attention weights being quantized
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8194
opened Jun 28, 2024 by
jukofyork
Loading…
2 of 4 tasks
llama-bench : Generate full token count during warm up
examples
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8187
opened Jun 28, 2024 by
joeatodd
Loading…
2 of 4 tasks
convert-hf : print output file name when completed
python
python script changes
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8181
opened Jun 28, 2024 by
danbev
Loading…
2 of 4 tasks
Add alpaca chat template (repush of #7383)
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
testing
Everything test related
#8159
opened Jun 27, 2024 by
jukofyork
Loading…
2 of 4 tasks
ggml-quants : 1.625 bpw ternary packing for BitNet b1.58
enhancement
New feature or request
examples
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
Tensor Encoding Scheme
https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes
testing
Everything test related
#8151
opened Jun 27, 2024 by
compilade
Loading…
5 of 9 tasks
llama : suppress unref var in Windows MSVC
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8150
opened Jun 27, 2024 by
danbev
Loading…
2 of 4 tasks
Inference support for T5 and FLAN-T5 model families
examples
python
python script changes
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#8141
opened Jun 26, 2024 by
fairydreaming
Loading…
2 of 4 tasks
Performance Tuning for Q4_K matmul CUDA kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8136
opened Jun 26, 2024 by
contentis
Loading…
2 of 4 tasks
json
: unified properties order across optional & required
examples
python
Quantize: use --pure, --output-tensor-type and --token-embedding-type as the same time
examples
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#8130
opened Jun 26, 2024 by
ZeusXuan
Loading…
2 of 4 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.