[go: nahoru, domu]

Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support GGUF Mixtral format where experts are in one tensor #355

Merged
merged 2 commits into from
Jun 8, 2024

Conversation

EricLBuehler
Copy link
Owner

Refs #352. After this PR, both the ffn_*_exps and ffn_*.{i} formats for expert tensors will be accepted.

@EricLBuehler EricLBuehler added the models Additions to model or architectures label May 28, 2024
Copy link
github-actions bot commented May 28, 2024
Code Metrics Report
  ===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 Dockerfile              1           34           25            0            9
 Happy                   1          442          369            0           73
 JSON                    9           21           21            0            0
 Python                 21          741          622           21           98
 TOML                   15          393          355            1           37
-------------------------------------------------------------------------------
 Jupyter Notebooks       1            0            0            0            0
 |- Markdown             1           60           30           22            8
 |- Python               1           96           87            1            8
 (Total)                            156          117           23           16
-------------------------------------------------------------------------------
 Markdown               16         1054            0          781          273
 |- BASH                 6          203          190            0           13
 |- Python               6          121          110            0           11
 |- Rust                 3          185          172            9            4
 (Total)                           1563          472          790          301
-------------------------------------------------------------------------------
 Rust                   86        28434        26022          377         2035
 |- Markdown            42          439            0          427           12
 (Total)                          28873        26022          804         2047
===============================================================================
 Total                 151        31119        27414         1180         2525
===============================================================================
  

@EricLBuehler EricLBuehler merged commit c8fef8e into master Jun 8, 2024
11 checks passed
@EricLBuehler EricLBuehler deleted the mixtral_other_format branch June 8, 2024 03:11
EricLBuehler added a commit that referenced this pull request Jun 10, 2024
* Implement gpt2 gguf tokenizer

* Fix unk tok calculation

* Remove normalizer

* Update gguf tokenizer

* Allow adding unk token when found

* Add unk token to builder if provided.

* Improve add_special_tokens

* Use tokenizerx builder

* Add useful comment

Co-authored-by: Brennan Kinney <5098581+polarathene@users.noreply.github.com>

* Bump version to 0.1.16 (#404)

* Bump version to 0.1.17

* Fix version bump

* Add and update template READMEs (#405)

* Add readmes

* Fix typos

* Improve Rust docs (#406)

* Expose phi3v loader and remove unused deps (#408)

* Support format for mixtral where experts are in one tensor (#355)

* Normal loading metadata for vision models (#409)

* Phi 3 vision ISQ support (#410)

* ISQ support for phi3v

* Document it

* Remove causal masks cache (#412)

* Fix: use new slice_assign (#415)

* Use new slice_assign

* Fix dead image links

* Fix Phi-3 GGUF (#414)

* Fix kv head usage

* Fix rope weights

* Clippy

* Work on the gpt2 conversion

* Add comment

* Add some tests

* Update readme

---------

Co-authored-by: Brennan Kinney <5098581+polarathene@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
models Additions to model or architectures
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant