Can i use 65b model and how? #149

LightVolk · 2023-03-24T17:53:22Z

I download this files:

ggml-model-q4_0.bin
ggml-model-q4_0.bin.1
ggml-model-q4_0.bin.2
ggml-model-q4_0.bin.3
ggml-model-q4_0.bin.4
ggml-model-q4_0.bin.5
ggml-model-q4_0.bin.6
ggml-model-q4_0.bin.7

Can i run some command like ./chat -m ggml-model-q4_0.bin and load all of this models to use 65b power?

Degot · 2023-03-24T23:32:35Z

To run this model, I modified chat.cpp and changed in LLAMA_N_PARTS { 8192, 1 } to { 8192, 8 }... Then recompiled and used -m ggml-model-q4_0.bin . Worked smoothly. ~42GB RAM used

LightVolk · 2023-03-25T11:25:18Z

To run this model, I modified chat.cpp and changed in LLAMA_N_PARTS { 8192, 1 } to { 8192, 8 }... Then recompiled and used -m ggml-model-q4_0.bin . Worked smoothly. ~42GB RAM used

Yes it works for me , but in Windows chat.exe version. Somehow it works for me.
But it generate words so slow.. and yes - use 40 gb of RAM

Thank you!

clover1980 · 2023-03-26T05:08:43Z

I would like to experiment with this. I have Asrock board with 128Gb, that's the only manufacturer as i think which still announcing new motherboards with multi-slots huge RAM on consumers market (the other choice is only Microstar server boards). I kinda anticipated all this in 2020th, but usable Ai's surfaced only now.

To run this model, I modified chat.cpp and changed in LLAMA_N_PARTS { 8192, 1 } to { 8192, 8 }... Then recompiled and used -m ggml-model-q4_0.bin . Worked smoothly. ~42GB RAM used

Have you posted your compiled file anywhere? I'm using Win 10 x64.

Degot · 2023-03-27T06:18:56Z

Have you posted your compiled file anywhere? I'm using Win 10 x64.

No, I didn't.. I used standard windows instructions... cmake-based... to build binary

flarens · 2023-04-09T08:39:04Z

No, I didn't.. I used standard windows instructions... cmake-based... to build binary

I spent several days on unsuccessful attempts to compile chat.cpp. I beg you, can you share a ready -made application for launching 65B model?

LightVolk closed this as completed Mar 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can i use 65b model and how? #149

Can i use 65b model and how? #149

LightVolk commented Mar 24, 2023

Degot commented Mar 24, 2023

LightVolk commented Mar 25, 2023

clover1980 commented Mar 26, 2023 •

edited

Loading

Degot commented Mar 27, 2023

flarens commented Apr 9, 2023

Can i use 65b model and how? #149

Can i use 65b model and how? #149

Comments

LightVolk commented Mar 24, 2023

Degot commented Mar 24, 2023

LightVolk commented Mar 25, 2023

clover1980 commented Mar 26, 2023 • edited Loading

Degot commented Mar 27, 2023

flarens commented Apr 9, 2023

clover1980 commented Mar 26, 2023 •

edited

Loading