ollama run codellama:34b issue #4519

Iliceth · 2024-05-19T13:19:02Z

What is the issue?

Every model I tested with ollama runs fine, but when trying: ollama run codellama:34b, I get Error: llama runner process has terminated: signal: segmentation fault (core dumped)Tried the 13B-version then, works fine.

OS

Linux

GPU

Nvidia

CPU

Intel

Ollama version

0.1.37

The text was updated successfully, but these errors were encountered:

izzy84 · 2024-05-19T17:12:57Z

same here

jmorganca · 2024-05-19T17:53:29Z

So sorry about this, looking into why this is happening.

jonz-secops · 2024-05-22T21:38:36Z

same

barrard · 2024-05-23T00:32:27Z

I'm also getting this on 7b-python and 13b-python

dhiltgen · 2024-05-23T15:14:27Z

Could you share your server log, and your VRAM size? This is a larger model, so my suspicion is we're off on our memory prediction.

9SMTM6 · 2024-05-23T15:48:56Z

Same here.

AMD CPU in contrast to OP.

16 GB VRAM (RTX 4070 Ti Super)

Log attached.
codellama_crashdump_shortend.log

jonz-secops · 2024-05-23T16:14:31Z

To add details ..

AMD 5950X on x570E
32 GB CPU RAM

6900XT
16GB VRAM

server log; doesn't appear to have the events in it anymore, but I'll try and recreate later today if I can.

Iliceth · 2024-05-23T16:58:10Z

Hereby my log.

ollama_logs.txt

I can run mixtral and dolphin-mixtral fine and those are some big ones too.

Anyhow my current system is i7-9700K + 32 GB RAM + RTX 3090 with 24 GB VRAM

izzy84 · 2024-05-23T17:47:04Z

I7 7700K with 32gb ram and 6 GPUs with in total 51gb vram.

I can run all models, even llama3:70b without problems, which is already a big model. But I get error with codellama 34b

FukangSun · 2024-05-24T15:50:03Z

same here .I use 0.1.38 version of official docker image.
E5-2670 + 96gb ram + 1 RTX 4090 . also get below error when running codellama 34b :

ollama run codellama-34b-instruct:Q4_K_M
Error: llama runner process has terminated: signal: segmentation fault (core dumped)

Gentleman, I find another way, downgrade to 0.1.32 is okay to load codellama and large model. 0.1.33 - 0.1.39 I tested all not worked.

AZ777xx · 2024-05-26T21:52:16Z

same. Phind-codellama from the repository works, loading gguf doesn't though

Iliceth added the bug Something isn't working label May 19, 2024

dhiltgen self-assigned this May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ollama run codellama:34b issue #4519

ollama run codellama:34b issue #4519

Iliceth commented May 19, 2024

izzy84 commented May 19, 2024

jmorganca commented May 19, 2024

jonz-secops commented May 22, 2024

barrard commented May 23, 2024

dhiltgen commented May 23, 2024

9SMTM6 commented May 23, 2024

jonz-secops commented May 23, 2024

Iliceth commented May 23, 2024 •

edited

izzy84 commented May 23, 2024

FukangSun commented May 24, 2024 •

edited

AZ777xx commented May 26, 2024

ollama run codellama:34b issue #4519

ollama run codellama:34b issue #4519

Comments

Iliceth commented May 19, 2024

What is the issue?

OS

GPU

CPU

Ollama version

izzy84 commented May 19, 2024

jmorganca commented May 19, 2024

jonz-secops commented May 22, 2024

barrard commented May 23, 2024

dhiltgen commented May 23, 2024

9SMTM6 commented May 23, 2024

jonz-secops commented May 23, 2024

Iliceth commented May 23, 2024 • edited

izzy84 commented May 23, 2024

FukangSun commented May 24, 2024 • edited

Gentleman, I find another way, downgrade to 0.1.32 is okay to load codellama and large model. 0.1.33 - 0.1.39 I tested all not worked.

AZ777xx commented May 26, 2024

Iliceth commented May 23, 2024 •

edited

FukangSun commented May 24, 2024 •

edited