Add support for cross-gpu device mapping #462

EricLBuehler · 2024-06-22T18:57:53Z

Refs #395.

github-actions · 2024-06-22T18:59:01Z

Code Metrics Report

  ===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 Dockerfile              1           34           25            0            9
 Happy                   1          442          369            0           73
 JSON                    9           21           21            0            0
 Python                 32         1256         1075           37          144
 TOML                   16          444          403            1           40
-------------------------------------------------------------------------------
 Jupyter Notebooks       1            0            0            0            0
 |- Markdown             1           60           30           22            8
 |- Python               1           96           87            1            8
 (Total)                            156          117           23           16
-------------------------------------------------------------------------------
 Markdown               17         1297            0          965          332
 |- BASH                 5          100           97            0            3
 |- Python               6          122          110            0           12
 |- Rust                 3          151          135            6           10
 (Total)                           1670          342          971          357
-------------------------------------------------------------------------------
 Rust                  119        36082        32630          629         2823
 |- Markdown            59          659           13          609           37
 (Total)                          36741        32643         1238         2860
===============================================================================
 Total                 197        39576        34523         1632         3421
===============================================================================

chenwanqq · 2024-06-27T09:30:25Z

mistral.rs/mistralrs-core/src/models/llama.rs

Lines 302 to 303 in e04f840

	let vb = vb.set_dtype(mapper.get_min_dtype()?);

I think maybe this line copy the entire weights..
It does cost serious memory usage problem, can not even start a 7b llama model in 4090 GPU...

EricLBuehler · 2024-07-04T09:56:08Z

@chenwanqq can you please raise an issue about this?

chenwanqq · 2024-07-05T03:16:42Z

@chenwanqq can you please raise an issue about this?

The problem is that I can not track this problem again.. after using memory usage to check:

println!("vb.device: {:?}", vb.device());
        println!("memory usage: {:?}", memory_track.get_memory_available(vb.device()));
        let vb = vb.set_dtype(mapper.get_min_dtype()?);
        println!("memory usage: {:?}", memory_track.get_memory_available(vb.device()));

It is same before and after. And I did not encounter oom problem today...

Add support for cross-gpu device mapping

0c8b2ce

EricLBuehler added new feature New feature or request backend Backend work labels Jun 22, 2024

EricLBuehler added 9 commits June 22, 2024 15:02

Clippy

6cd0c1d

Handle multiple gpus in dtype selector

43d7354

Handle multiple gpus in dtype selector

d2b89cc

Fix

0ef0dc9

Get min supported dtype

97d562b

Fix

047ed7a

Fix vb_m dtype

90ae056

Add docs

be3ce9e

Fix llama

018a7b1

EricLBuehler merged commit 196c893 into master Jun 23, 2024
10 of 11 checks passed

EricLBuehler deleted the cross_gpu_map branch June 23, 2024 03:17

EricLBuehler mentioned this pull request Jun 23, 2024

Cross GPU device mapping feature #395

Closed

chenwanqq mentioned this pull request Jun 27, 2024

Add LLaVA Support #484

Merged

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for cross-gpu device mapping #462

Add support for cross-gpu device mapping #462

Add support for cross-gpu device mapping #462

Add support for cross-gpu device mapping #462

Conversation