-
Notifications
You must be signed in to change notification settings - Fork 8.6k
Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[no ci] Add Nix and Flox install instructions
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7899
opened Jun 12, 2024 by
bryanhonof
Loading…
2 of 4 tasks
Implement non-mapped async IO for CUDA on Windows.
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7896
opened Jun 12, 2024 by
mtavenrath
Loading…
2 of 4 tasks
convert-hf-to-gguf-update.py: Added Ukrainian tokens into string
python
python script changes
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7888
opened Jun 11, 2024 by
AragonerUA
Loading…
2 of 4 tasks
Implemented Spellcheck for Llama.cpp
examples
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7884
opened Jun 11, 2024 by
Ferruolo
Loading…
2 of 4 tasks
json
: better support for "type" unions (e.g. nullable arrays w/ typed items)
examples
python
#7863
opened Jun 10, 2024 by
ochafik
Loading…
[SYCL] Revert "use the correct SYCL context for host USM allocations"
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#7858
opened Jun 10, 2024 by
AidanBeltonS
Loading…
gguf-dump.py: add --markdown dump output
python
python script changes
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7853
opened Jun 10, 2024 by
mofosyne
Loading…
Type cast AVX512_BF16 data types based on compiler instead of the OS platform
ggml
changes relating to the ggml tensor library for machine learning
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7851
opened Jun 10, 2024 by
Srihari-mcw
Loading…
AVX IQ Quants
ggml
changes relating to the ggml tensor library for machine learning
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
#7845
opened Jun 10, 2024 by
netrunnereve
•
Draft
examples: refine tensor dump
examples
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7844
opened Jun 10, 2024 by
zhouwg
Loading…
Fix conversion of unnormalized BF16->BF16 weights
python
python script changes
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7843
opened Jun 10, 2024 by
CISC
Loading…
[WIP] python script changes
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
server
testing
Everything test related
json
: fix additionalProperties, allow space after enum/const
examples
python
update: support Qwen2-57B-A14B
python
python script changes
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7835
opened Jun 8, 2024 by
legraphista
Loading…
Avoid division-by-zero on 0-weights
ggml
changes relating to the ggml tensor library for machine learning
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7825
opened Jun 7, 2024 by
CISC
Loading…
Add Qwen2MoE 57B-A14B
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7814
opened Jun 7, 2024 by
CISC
Loading…
build
: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc...
breaking change
#7809
opened Jun 6, 2024 by
ochafik
Loading…
[WIP] New feature or request
examples
python
python script changes
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
server
testing
Everything test related
json
: support integer minimum, maximum, exclusiveMinimum, exclusiveMaximum
enhancement
WIP: Use DirectStorage with CUDA interop to more efficient load tensors
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7796
opened Jun 6, 2024 by
mtavenrath
•
Draft
feat: add changes to handle jina v2 chinese code
python
python script changes
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7795
opened Jun 6, 2024 by
JoanFM
Loading…
JSON Schema to GBNF integration tests
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
testing
Everything test related
#7790
opened Jun 6, 2024 by
HanClinto
Loading…
Fix missing libgomp.so.1 Error in Docker Container for llama.cpp
devops
improvements to build systems and github actions
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7775
opened Jun 5, 2024 by
0x4139
Loading…
Enable stream updating in the SwiftUI example
examples
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7754
opened Jun 5, 2024 by
shu223
Loading…
Fix no gcc pragma on Windows
merge ready
indicates that this may be ready to merge soon and is just holding out in case of objections
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7751
opened Jun 4, 2024 by
jojorne
Loading…
[ci] add LLAMA_CURL flags to the prebuilt binaries
devops
improvements to build systems and github actions
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7747
opened Jun 4, 2024 by
Vaibhavs10
Loading…
Poro-34B-chat tokenizer support
enhancement
New feature or request
python
python script changes
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7713
opened Jun 3, 2024 by
ezosa
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-05-12.